Title:
SiSU - SiSU information Structuring Universe / Structured information, Serialized Units - Markup Samples, Output Examples
Creator:
Ralph Amissah
Rights:
Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3
Type:
information
Subject:
ebook, epublishing, electronic book, electronic publishing, electronic document, electronic citation, data structure, citation systems, search
Date created:
Date issued:
Date available:
Date modified:
Date:
2007-09-16
SiSU - SiSU information Structuring Universe / Structured information,
Serialized Units - Markup Samples, Output Examples, Ralph Amissah
1
SiSU Markup and Output Examples
2
1. Markup and Output Examples
3
1.1 Markup examples
4
Current markup examples and document output samples are provided at
< http://www.jus.uio.no/sisu/SiSU/2.html>
5
Some markup with syntax highlighting may be found under < http://www.jus.uio.no/sisu/sample/syntax>
but is not as up to date.
6
For some documents hardly any markup at all is required at all, other
than a header, and an indication that the levels to be taken into
account by the program in generating its output are.
7
1.2 A few book (and other) examples
8
[aukio.png] "Aukio, by Leena Krohn" 1
1. Reproduced with the kind permission of author and artist Leena Krohn,
< http://www.kaapeli.fi/krohn>
"Aukio" is from the work "Sphinx or Robot" < http://www.jus.uio.no/sisu/sphinx_or_robot.leena_krohn.1996>
which is included as a book example in this section, together with
another of the author's works, "Tainaron" < http://www.jus.uio.no/sisu/tainaron.leena_krohn.1998>
9
"The Wealth of Networks" , Yochai Benkler
10
"The Wealth of Networks", Yochai Benkler
11
document manifest 2
2. < http://www.jus.uio.no/sisu/sisu_manual/the_wealth_of_networks.yochai_benkler/sisu_manifest.html>
12
html, segmented text
13
html, scroll, document in one
14
pdf, landscape
15
pdf, portrait
16
open document
17
xhtml scroll
18
xml, sax
19
xml, dom
20
plain text utf-8
21
concordance
22
dcc, document content certificate (digests)
23
markup source text
24
zipped markup source pod
25
"Free Culture" , Lawrence Lessig
26
"Free Culture", Lawrence Lessig
27
document manifest 3
3. < http://www.jus.uio.no/sisu/sisu_manual/free_culture.lawrence_lessig/sisu_manifest.html>
28
html, segmented text
29
html, scroll, document in one
30
pdf, landscape
31
pdf, portrait
32
open document
33
xhtml scroll
34
xml, sax
35
xml, dom
36
plain text utf-8
37
concordance
38
dcc, document content certificate (digests)
39
markup source text
40
zipped markup source pod
41
"Free as in Freedom: Richard Stallman's Crusade for Free
Software" , by Sam Williams
42
"Free as in Freedom: Richard Stallman's Crusade for Free Software", by
Sam Williams
43
document manifest 4
4. < http://www.jus.uio.no/sisu/sisu_manual/free_as_in_freedom.richard_stallman_crusade_for_free_software.sam_williams/sisu_manifest.html>
44
html, segmented text
45
html, scroll, document in one
46
pdf, landscape
47
pdf, portrait
48
open document
49
xhtml scroll
50
xml, sax
51
xml, dom
52
plain text utf-8
53
concordance
54
dcc, document content certificate (digests)
55
markup source text
56
zipped markup source pod
57
"Free For All: How Linux and the Free Software Movement Undercut the
High Tech Titans" , by Peter Wayner
58
"Free For All: How Linux and the Free Software Movement Undercut the
High Tech Titans", by Peter Wayner
59
document manifest 5
5. < http://www.jus.uio.no/sisu/sisu_manual/free_for_all.peter_wayner/sisu_manifest.html>
60
html, segmented text
61
html, scroll, document in one
62
pdf, landscape
63
pdf, portrait
64
open document
65
xhtml scroll
66
xml, sax
67
xml, dom
68
plain text utf-8
69
concordance
70
dcc, document content certificate (digests)
71
markup source text
72
zipped markup source pod
73
"The Cathedral and the Bazaar" , by Eric S. Raymond
74
"The Cathedral and the Bazaar", by Eric S. Raymond
75
document manifest 6
6. < http://www.jus.uio.no/sisu/sisu_manual/the_cathedral_and_the_bazaar.eric_s_raymond/sisu_manifest.html>
76
html, segmented text
77
html, scroll, document in one
78
pdf, landscape
79
pdf, portrait
80
open document
81
xhtml scroll
82
xml, sax
83
xml, dom
84
plain text utf-8
85
concordance
86
dcc, document content certificate (digests)
87
markup source text
88
zipped markup source pod
89
"Accelerando" , Charles Stross
90
"Accelerando", Charles Stross
91
document manifest 7
7. < http://www.jus.uio.no/sisu/sisu_manual/accelerando.charles_stross/sisu_manifest.html>
92
html, segmented text
93
html, scroll, document in one
94
pdf, landscape
95
pdf, portrait
96
open document
97
xhtml scroll
98
xml, sax
99
xml, dom
100
plain text utf-8
101
concordance
102
dcc, document content certificate (digests)
103
markup source text
104
zipped markup source pod
105
"Tainaron" , Leena Krohn
106
"Tainaron", Leena Krohn
107
document manifest 8
8. < http://www.jus.uio.no/sisu/sisu_manual/tainaron.leena_krohn.1998/sisu_manifest.html>
108
html, segmented text
109
html, scroll, document in one
110
pdf, landscape
111
pdf, portrait
112
open document
113
xhtml scroll
114
xml, sax
115
xml, dom
116
plain text utf-8
117
concordance
118
dcc, document content certificate (digests)
119
markup source text
120
zipped markup source pod
121
"Sphinx or Robot" , Leena Krohn
122
[i_sor.png] "Sphinx or Robot by Leena Krohn"
123
"Sphinx or Robot", Leena Krohn
124
document manifest 9
9. < http://www.jus.uio.no/sisu/sisu_manual/sphinx_or_robot.leena_krohn.1996/sisu_manifest.html>
125
html, segmented text
126
html, scroll, document in one
127
pdf, landscape
128
pdf, portrait
129
open document
130
xhtml scroll
131
xml, sax
132
xml, dom
133
plain text utf-8
134
concordance
135
dcc, document content certificate (digests)
136
markup source text
137
zipped markup source pod
138
"War and Peace" , Leo Tolstoy, PG Etext 2600
139
"War and Peace", Leo Tolstoy 10
10. < http://www.jus.uio.no/sisu/war_and_peace.leo_tolstoy/toc.html>
The ascii text was taken from Project Gutenberg. The markup
transforms required are trivial. Of interest, in this instance I am
saved by having alternative syntaxes/(structural modes) for marking up
endnotes... as it was possible to do a simple search and replace to
make the Project Gutenberg ascii presentation suitable for SiSU ,
using the older endnote markup style. This example instructs the
program to use regular expressions, in this example the words: none;
none; BOOK|FIRST|SECOND; CHAPTER; occurring at the beginning of a line,
to identify what should be treated as different levels of heading in a
document (and used to make the table of contents). Note that there was
very little markup required after the document headers and
Project Gutenberg legal notices. As I presume the legal notices are
similar in Project Gutenberg documents, (and I could not bear to think
of preparing the same legal notices twice), I moved those to the "skin"
for the Project, and these are now represented in the markup by
<:insert1> and <:insert2> and the legal notices are
available for similar insertion into the next Project Gutenberg text
prepared for SiSU , should there be one. I did a
stylesheet/skin for the Gutenberg Project, ... I may have to remove.
The markup transforms required are trivial. Of interest, in this
instance I am saved by having alternative syntaxes/(structural modes)
for marking up endnotes... as it is possible to do a simple search and
replace to make Project Gutenberg ascii presentations suitable for
SiSU using the older endnote markup style. There is very
little markup required after the document headers and Project
Gutenberg legal notices. As I presume the legal notices are similar in
Project Gutenberg documents, (and I could not bear to think of
preparing the same legal notices twice), I moved those to the "skin"
for the Project, and these are now represented in the markup by the
<:insert1> and <:insert2> markers and the legal notices are
available for similar insertion into the next Project Gutenberg text
prepared for SiSU , should there be one.
140
document manifest 11
11. < http://www.jus.uio.no/sisu/sisu_manual/war_and_peace.leo_tolstoy/sisu_manifest.html>
141
html, segmented text
142
html, scroll, document in one
143
pdf, landscape
144
pdf, portrait
145
open document
146
xhtml scroll
147
xml, sax
148
xml, dom
149
plain text utf-8
150
concordance
151
dcc, document content certificate (digests)
152
markup source text
153
zipped markup source pod
154
"Don Quixote" , Miguel de Cervantes [Saavedra], translated by
John Ormsby, PG Etext 996
155
"Don Quixote", Miguel de Cervantes [Saavedra]
156
document manifest 12
12. < http://www.jus.uio.no/sisu/sisu_manual/don_quixote.miguel_de_cervantes/sisu_manifest.html>
157
html, segmented text
158
html, scroll, document in one
159
pdf, landscape
160
pdf, portrait
161
open document
162
xhtml scroll
163
xml, sax
164
xml, dom
165
plain text utf-8
166
concordance
167
dcc, document content certificate (digests)
168
markup source text
169
zipped markup source pod
170
"Gulliver's Travels" , Jonathan Swift, transcribed from the 1892
George Bell and Sons edition by David Price, PG Etext 829
171
"Gulliver's Travels", Jonathan Swift
172
document manifest 13
13. < http://www.jus.uio.no/sisu/sisu_manual/gullivers_travels.jonathan_swift/sisu_manifest.html>
173
html, segmented text
174
html, scroll, document in one
175
pdf, landscape
176
pdf, portrait
177
open document
178
xhtml scroll
179
xml, sax
180
xml, dom
181
plain text utf-8
182
concordance
183
dcc, document content certificate (digests)
184
markup source text
185
zipped markup source pod
186
"Alice's Adventures in Wonderland" , Lewis Carroll, PG Etext 11
187
"Alice's Adventures in Wonderland", Lewis Carroll
188
document manifest 14
14. < http://www.jus.uio.no/sisu/sisu_manual/alices_adventures_in_wonderland.lewis_carroll/sisu_manifest.html>
189
html, segmented text
190
html, scroll, document in one
191
pdf, landscape
192
pdf, portrait
193
open document
194
xhtml scroll
195
xml, sax
196
xml, dom
197
plain text utf-8
198
concordance
199
dcc, document content certificate (digests)
200
markup source text
201
zipped markup source pod
202
"Through The Looking-Glass" , Lewis Carroll, PG Etext 12
203
"Through The Looking-Glass", Lewis Carroll
204
document manifest 15
15. < http://www.jus.uio.no/sisu/sisu_manual/through_the_looking_glass.lewis_carroll/sisu_manifest.html>
205
html, segmented text
206
html, scroll, document in one
207
pdf, landscape
208
pdf, portrait
209
open document
210
xhtml scroll
211
xml, sax
212
xml, dom
213
plain text utf-8
214
concordance
215
dcc, document content certificate (digests)
216
markup source text
217
zipped markup source pod
218
"Alice's Adventures in Wonderland" and "Through The
Looking-Glass" , Lewis Carroll, PG Etexts 11 and 12
219
"Alice's Adventures in Wonderland" and "Through The Looking-Glass",
Lewis Carroll
220
document manifest 16
16. < http://www.jus.uio.no/sisu/sisu_manual/alices_adventures_in_wonderland_and_through_the_looking_glass.lewis_carroll/sisu_manifest.html>
221
html, segmented text
222
html, scroll, document in one
223
pdf, landscape
224
pdf, portrait
225
open document
226
xhtml scroll
227
xml, sax
228
xml, dom
229
plain text utf-8
230
concordance
231
dcc, document content certificate (digests)
232
markup source text
233
zipped markup source pod
234
"Gnu Public License 2" , (GPL 2) Free Software Foundation
235
"Gnu Public License 2", (GPL 2) Free Software Foundation
236
document manifest 17
17. < http://www.jus.uio.no/sisu/sisu_manual/gpl2.fsf/sisu_manifest.html>
237
html, segmented text
238
html, scroll, document in one
239
pdf, landscape
240
pdf, portrait
241
open document
242
xhtml scroll
243
xml, sax
244
xml, dom
245
plain text utf-8
246
concordance
247
dcc, document content certificate (digests)
248
markup source text
249
zipped markup source pod
250
"Gnu Public License v3 - Third discussion draft" , (GPLv3) Free
Software Foundation
251
"Gnu Public License 3 - Third discussion draft", (GPL v3 draft3) Free
Software Foundation
252
document manifest 18
18. < http://www.jus.uio.no/sisu/sisu_manual/gpl3_draft3.fsf/sisu_manifest.html>
253
html, segmented text
254
html, scroll, document in one
255
pdf, landscape
256
pdf, portrait
257
open document
258
xhtml scroll
259
xml, sax
260
xml, dom
261
plain text utf-8
262
concordance
263
dcc, document content certificate (digests)
264
markup source text
265
zipped markup source pod
266
"Debian Social Contract"
267
"Debian Social Contract"
268
document manifest 19
19. < http://www.jus.uio.no/sisu/sisu_manual/debian_social_contract_v1.1/sisu_manifest.html>
269
html, segmented text
270
html, scroll, document in one
271
pdf, landscape
272
pdf, portrait
273
open document
274
xhtml scroll
275
xml, sax
276
xml, dom
277
plain text utf-8
278
concordance
279
dcc, document content certificate (digests)
280
markup source text
281
zipped markup source pod
282
"Debian Constitution v1.3" , (simple/default markup)
283
"Debian Constitution v1.3"
284
document manifest 20
20. < http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.3/sisu_manifest.html>
285
html, segmented text
286
html, scroll, document in one
287
pdf, landscape
288
pdf, portrait
289
open document
290
xhtml scroll
291
xml, sax
292
xml, dom
293
plain text utf-8
294
concordance
295
dcc, document content certificate (digests)
296
markup source text
297
zipped markup source pod
298
"Debian Constitution v1.3" , (markup adjusted for output to more
closely match the original)
299
"Debian Constitution v1.3", (markup adjusted for output to more closely
match the original)
300
document manifest 21
21. < http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.3.adjusted/sisu_manifest.html>
301
html, segmented text
302
html, scroll, document in one
303
pdf, landscape
304
pdf, portrait
305
open document
306
xhtml scroll
307
xml, sax
308
xml, dom
309
plain text utf-8
310
concordance
311
dcc, document content certificate (digests)
312
markup source text
313
zipped markup source pod
314
"Debian Constitution v1.2" , (simple/default markup)
315
"Debian Constitution v1.2 (more translations)"
316
document manifest 22
22. < http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.2/sisu_manifest.html>
317
html, segmented text
318
html, scroll, document in one
319
pdf, landscape
320
pdf, portrait
321
open document
322
xhtml scroll
323
xml, sax
324
xml, dom
325
plain text utf-8
326
concordance
327
dcc, document content certificate (digests)
328
markup source text
329
zipped markup source pod
330
"Debian Constitution v1.2" , (markup adjusted for output to more
closely match the original)
331
"Debian Constitution (more translations)", (markup adjusted for output
to more closely match the original)
332
document manifest 23
23. < http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.2.adjusted/sisu_manifest.html>
333
html, segmented text
334
html, scroll, document in one
335
pdf, landscape
336
pdf, portrait
337
open document
338
xhtml scroll
339
xml, sax
340
xml, dom
341
plain text utf-8
342
concordance
343
dcc, document content certificate (digests)
344
markup source text
345
zipped markup source pod
346
"A Uniform Sales Terminology" , Vikki Rogers and Albert Kritzer
347
"A Uniform Sales Terminology", Vikki Rogers and Albert Kritzer
348
document manifest 24
24. < http://www.jus.uio.no/sisu/sisu_manual/a_uniform_international_sales_terminology.vikki_rogers.and.albert_kritzer/sisu_manifest.html>
349
html, segmented text
350
html, scroll, document in one
351
pdf, landscape
352
pdf, portrait
353
open document
354
xhtml scroll
355
xml, sax
356
xml, dom
357
plain text utf-8
358
concordance
359
dcc, document content certificate (digests)
360
markup source text
361
zipped markup source pod
362
"The Autonomous Contract" 1997 - markup sample
363
"The Autonomous Contract" 1997 - markup sample
364
document manifest 25
25. < http://www.jus.uio.no/sisu/sisu_manual/the_autonomous_contract.amissah.19970710/sisu_manifest.html>
365
html, segmented text
366
html, scroll, document in one
367
pdf, landscape
368
pdf, portrait
369
open document
370
xhtml scroll
371
xml, sax
372
xml, dom
373
plain text utf-8
374
concordance
375
dcc, document content certificate (digests)
376
markup source text
377
zipped markup source pod
378
"The Autonomous Contract Revisited" - markup sample
379
"The Autonomous Contract Revisited" - markup sample 26
26. < http://www.jus.uio.no/sisu/autonomy_markup0/toc.html>
alternative markup variations revolving around endnotes
(i) as above, markup with embedded endnotes, and header list of
words/phrases to emphasise < http://www.jus.uio.no/sisu/sample/syntax/autonomy_markup0.sst.html>
< http://www.jus.uio.no/sisu/sample/markup/autonomy_markup0.sst>
(ii) Again markup with embedded endnotes, but font faces changed
within paragraphs rather than in header as in i < http://www.jus.uio.no/sisu/sample/syntax/autonomy_markup1.sst.html>
< http://www.jus.uio.no/sisu/sample/markup/autonomy_markup1.sst>
(iii) Markup with endnote placemarks within paragraphs, the
endnotes following the paragraph that contains them < http://www.jus.uio.no/sisu/sample/syntax/autonomy_markup2.sst.html>
< http://www.jus.uio.no/sisu/sample/markup/autonomy_markup2.sst>
(iv) Another alternative is to place the marked up endnotes
sequentially and at the end of the text. This also works. The paragraph
variant iii is perhaps easier to visually check should there be missing
endnotes; but this variant iv may better suit the conversion of
alternatively pre-prepared documents.
380
document manifest 27
27. < http://www.jus.uio.no/sisu/sisu_manual/autonomy_markup0/sisu_manifest.html>
381
html, segmented text
382
html, scroll, document in one
383
pdf, landscape
384
pdf, portrait
385
open document
386
xhtml scroll
387
xml, sax
388
xml, dom
389
plain text utf-8
390
concordance
391
dcc, document content certificate (digests)
392
markup source text
393
zipped markup source pod
394
"United Nations Convention on Contracts for the International Sale
of Goods"
395
"United Nations Convention on Contracts for the International Sale of
Goods" 28
28. < http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html>
This example instructs the program to use regular expressions,
in this example the words: Part, Chapter, Section, Article occurring at
the beginning of a line, to identify what should be treated as
different levels of heading in a document (and used to make the table
of contents). This example instructs the program to use regular
expressions, in this example the words: Part, Chapter, Section, Article
occurring at the beginning of a line, to identify what should be
treated as different levels of heading in a document (and used to make
the table of contents).
396
document manifest 29
29. < http://www.jus.uio.no/sisu/sisu_manual/un_contracts_international_sale_of_goods_convention_1980/sisu_manifest.html>
397
html, segmented text
398
html, scroll, document in one
399
pdf, landscape
400
pdf, portrait
401
open document
402
xhtml scroll
403
xml, sax
404
xml, dom
405
plain text utf-8
406
concordance
407
dcc, document content certificate (digests)
408
markup source text
409
zipped markup source pod
410
PECL the "Principles of European Contract Law"
411
"Principles of European Contract Law"
412
document manifest 30
30. < http://www.jus.uio.no/sisu/sisu_manual/eu_contract_principles_parts_1_to_3_2002/sisu_manifest.html>
413
html, segmented text
414
html, scroll, document in one
415
pdf, landscape
416
pdf, portrait
417
open document
418
xhtml scroll
419
xml, sax
420
xml, dom
421
plain text utf-8
422
concordance
423
dcc, document content certificate (digests)
424
markup source text
425
zipped markup source pod
426
1.3 SQL - PostgreSQL, SQLite
427
A Sample search form is available at < http://search.sisudoc.org>
428
A few canned searches, showing object numbers. Search for:
429
English documents matching Linux OR Debian
430
GPL OR Richard Stallman
431
invention OR innovation in English language
432
copyright in English language documents
433
Note that the searches done in this form are case sensitive.
434
Expand those same searches, showing the matching text in each document:
435
English documents matching Linux OR Debian
436
GPL OR Richard Stallman
437
invention OR innovation in English language
438
copyright in English language documents
439
Note you may set results either for documents matched and object number
locations within each matched document meeting the search criteria; or
display the names of the documents matched along with the objects
(paragraphs) that meet the search criteria.31
31. of this feature when demonstrated to an IBM software innovations
evaluator in 2004 he said to paraphrase: this could be of interest to
us. We have large document management systems, you can search hundreds
of thousands of documents and we can tell you which documents meet your
search criteria, but there is no way we can tell you without opening
each document where within each your matches are found.
440
1.4 Lex Mercatoria as an example
441
There is quite a bit to peruse if you explore the site Lex Mercatoria:
442
< http://www.lexmercatoria.org/>
32
32. < http://www.jus.uio.no/lm/index>
443
or perhaps:
444
< http://lexmercatoria.org/treaties.and.organisations/lm.chronological>
33
33. < http://www.jus.uio.no/lm/treaties.and.organisations/lm.chronological>
445
1.5 For good measure the markup for a document with lots of (simple)
tables
446
SiSU is not optimised for table making, but does handle simple
tables.
447
SiSU marked up file with tables 34
34. < http://www.jus.uio.no/sisu/sample/syntax/un_conventions_membership_status.sst.html>
< http://www.jus.uio.no/sisu/sample/markup/un_conventions_membership_status.sst>
448
Output of table file example 35
35. < http://www.jus.uio.no/lm/un_conventions_membership_status/toc.html>
449
1.6 And a link to the output of a reported case
450
< http://www.jus.uio.no/lm/england.fothergill.v.monarch.airlines.hl.1980/toc.html>
451
Endnotes
0