Title: SiSU - SiSU information Structuring Universe / Structured information, Serialized Units - Markup Samples, Output Examples
Creator: Ralph Amissah
Rights: Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3
Type: information
Subject: ebook, epublishing, electronic book, electronic publishing, electronic document, electronic citation, data structure, citation systems, search
Date created: 2002-11-12
Date issued: 2002-11-12
Date available: 2002-11-12
Date modified: 2007-09-16
Date: 2007-09-16
SiSU - SiSU information Structuring Universe / Structured information, Serialized Units - Markup Samples, Output Examples,
Ralph Amissah
1
SiSU Markup and Output Examples 2 1. Markup and Output Examples 3 1.1 Markup examples 4 Current markup examples and document output samples are provided at <http://www.jus.uio.no/sisu/SiSU/2.html> 5 Some markup with syntax highlighting may be found under <http://www.jus.uio.no/sisu/sample/syntax> but is not as up to date. 6 For some documents hardly any markup at all is required at all, other than a header, and an indication that the levels to be taken into account by the program in generating its output are. 7 1.2 A few book (and other) examples 8 [aukio.png] "Aukio, by Leena Krohn" 1 1. Reproduced with the kind permission of author and artist Leena Krohn, <http://www.kaapeli.fi/krohn> "Aukio" is from the work "Sphinx or Robot" <http://www.jus.uio.no/sisu/sphinx_or_robot.leena_krohn.1996> which is included as a book example in this section, together with another of the author's works, "Tainaron" <http://www.jus.uio.no/sisu/tainaron.leena_krohn.1998> 9 "The Wealth of Networks", Yochai Benkler 10 "The Wealth of Networks", Yochai Benkler 11 document manifest 2 2. <http://www.jus.uio.no/sisu/sisu_manual/the_wealth_of_networks.yochai_benkler/sisu_manifest.html> 12 html, segmented text 13 html, scroll, document in one 14 pdf, landscape 15 pdf, portrait 16 open document 17 xhtml scroll 18 xml, sax 19 xml, dom 20 plain text utf-8 21 concordance 22 dcc, document content certificate (digests) 23 markup source text 24 zipped markup source pod 25 "Free Culture", Lawrence Lessig 26 "Free Culture", Lawrence Lessig 27 document manifest 3 3. <http://www.jus.uio.no/sisu/sisu_manual/free_culture.lawrence_lessig/sisu_manifest.html> 28 html, segmented text 29 html, scroll, document in one 30 pdf, landscape 31 pdf, portrait 32 open document 33 xhtml scroll 34 xml, sax 35 xml, dom 36 plain text utf-8 37 concordance 38 dcc, document content certificate (digests) 39 markup source text 40 zipped markup source pod 41 "Free as in Freedom: Richard Stallman's Crusade for Free Software", by Sam Williams 42 "Free as in Freedom: Richard Stallman's Crusade for Free Software", by Sam Williams 43 document manifest 4 4. <http://www.jus.uio.no/sisu/sisu_manual/free_as_in_freedom.richard_stallman_crusade_for_free_software.sam_williams/sisu_manifest.html> 44 html, segmented text 45 html, scroll, document in one 46 pdf, landscape 47 pdf, portrait 48 open document 49 xhtml scroll 50 xml, sax 51 xml, dom 52 plain text utf-8 53 concordance 54 dcc, document content certificate (digests) 55 markup source text 56 zipped markup source pod 57 "Free For All: How Linux and the Free Software Movement Undercut the High Tech Titans", by Peter Wayner 58 "Free For All: How Linux and the Free Software Movement Undercut the High Tech Titans", by Peter Wayner 59 document manifest 5 5. <http://www.jus.uio.no/sisu/sisu_manual/free_for_all.peter_wayner/sisu_manifest.html> 60 html, segmented text 61 html, scroll, document in one 62 pdf, landscape 63 pdf, portrait 64 open document 65 xhtml scroll 66 xml, sax 67 xml, dom 68 plain text utf-8 69 concordance 70 dcc, document content certificate (digests) 71 markup source text 72 zipped markup source pod 73 "The Cathedral and the Bazaar", by Eric S. Raymond 74 "The Cathedral and the Bazaar", by Eric S. Raymond 75 document manifest 6 6. <http://www.jus.uio.no/sisu/sisu_manual/the_cathedral_and_the_bazaar.eric_s_raymond/sisu_manifest.html> 76 html, segmented text 77 html, scroll, document in one 78 pdf, landscape 79 pdf, portrait 80 open document 81 xhtml scroll 82 xml, sax 83 xml, dom 84 plain text utf-8 85 concordance 86 dcc, document content certificate (digests) 87 markup source text 88 zipped markup source pod 89 "Accelerando", Charles Stross 90 "Accelerando", Charles Stross 91 document manifest 7 7. <http://www.jus.uio.no/sisu/sisu_manual/accelerando.charles_stross/sisu_manifest.html> 92 html, segmented text 93 html, scroll, document in one 94 pdf, landscape 95 pdf, portrait 96 open document 97 xhtml scroll 98 xml, sax 99 xml, dom 100 plain text utf-8 101 concordance 102 dcc, document content certificate (digests) 103 markup source text 104 zipped markup source pod 105 "Tainaron", Leena Krohn 106 "Tainaron", Leena Krohn 107 document manifest 8 8. <http://www.jus.uio.no/sisu/sisu_manual/tainaron.leena_krohn.1998/sisu_manifest.html> 108 html, segmented text 109 html, scroll, document in one 110 pdf, landscape 111 pdf, portrait 112 open document 113 xhtml scroll 114 xml, sax 115 xml, dom 116 plain text utf-8 117 concordance 118 dcc, document content certificate (digests) 119 markup source text 120 zipped markup source pod 121 "Sphinx or Robot", Leena Krohn 122 [i_sor.png] "Sphinx or Robot by Leena Krohn" 123 "Sphinx or Robot", Leena Krohn 124 document manifest 9 9. <http://www.jus.uio.no/sisu/sisu_manual/sphinx_or_robot.leena_krohn.1996/sisu_manifest.html> 125 html, segmented text 126 html, scroll, document in one 127 pdf, landscape 128 pdf, portrait 129 open document 130 xhtml scroll 131 xml, sax 132 xml, dom 133 plain text utf-8 134 concordance 135 dcc, document content certificate (digests) 136 markup source text 137 zipped markup source pod 138 "War and Peace", Leo Tolstoy, PG Etext 2600 139 "War and Peace", Leo Tolstoy 10 10. <http://www.jus.uio.no/sisu/war_and_peace.leo_tolstoy/toc.html>
The ascii text was taken from Project Gutenberg. The markup transforms required are trivial. Of interest, in this instance I am saved by having alternative syntaxes/(structural modes) for marking up endnotes... as it was possible to do a simple search and replace to make the Project Gutenberg ascii presentation suitable for SiSU, using the older endnote markup style. This example instructs the program to use regular expressions, in this example the words: none; none; BOOK|FIRST|SECOND; CHAPTER; occurring at the beginning of a line, to identify what should be treated as different levels of heading in a document (and used to make the table of contents). Note that there was very little markup required after the document headers and Project Gutenberg legal notices. As I presume the legal notices are similar in Project Gutenberg documents, (and I could not bear to think of preparing the same legal notices twice), I moved those to the "skin" for the Project, and these are now represented in the markup by <:insert1> and <:insert2> and the legal notices are available for similar insertion into the next Project Gutenberg text prepared for SiSU, should there be one.
I did a stylesheet/skin for the Gutenberg Project, ... I may have to remove. The markup transforms required are trivial. Of interest, in this instance I am saved by having alternative syntaxes/(structural modes) for marking up endnotes... as it is possible to do a simple search and replace to make Project Gutenberg ascii presentations suitable for SiSU using the older endnote markup style. There is very little markup required after the document headers and Project Gutenberg legal notices. As I presume the legal notices are similar in Project Gutenberg documents, (and I could not bear to think of preparing the same legal notices twice), I moved those to the "skin" for the Project, and these are now represented in the markup by the <:insert1> and <:insert2> markers and the legal notices are available for similar insertion into the next Project Gutenberg text prepared for SiSU, should there be one.
140
document manifest 11 11. <http://www.jus.uio.no/sisu/sisu_manual/war_and_peace.leo_tolstoy/sisu_manifest.html> 141 html, segmented text 142 html, scroll, document in one 143 pdf, landscape 144 pdf, portrait 145 open document 146 xhtml scroll 147 xml, sax 148 xml, dom 149 plain text utf-8 150 concordance 151 dcc, document content certificate (digests) 152 markup source text 153 zipped markup source pod 154 "Don Quixote", Miguel de Cervantes [Saavedra], translated by John Ormsby, PG Etext 996 155 "Don Quixote", Miguel de Cervantes [Saavedra] 156 document manifest 12 12. <http://www.jus.uio.no/sisu/sisu_manual/don_quixote.miguel_de_cervantes/sisu_manifest.html> 157 html, segmented text 158 html, scroll, document in one 159 pdf, landscape 160 pdf, portrait 161 open document 162 xhtml scroll 163 xml, sax 164 xml, dom 165 plain text utf-8 166 concordance 167 dcc, document content certificate (digests) 168 markup source text 169 zipped markup source pod 170 "Gulliver's Travels", Jonathan Swift, transcribed from the 1892 George Bell and Sons edition by David Price, PG Etext 829 171 "Gulliver's Travels", Jonathan Swift 172 document manifest 13 13. <http://www.jus.uio.no/sisu/sisu_manual/gullivers_travels.jonathan_swift/sisu_manifest.html> 173 html, segmented text 174 html, scroll, document in one 175 pdf, landscape 176 pdf, portrait 177 open document 178 xhtml scroll 179 xml, sax 180 xml, dom 181 plain text utf-8 182 concordance 183 dcc, document content certificate (digests) 184 markup source text 185 zipped markup source pod 186 "Alice's Adventures in Wonderland", Lewis Carroll, PG Etext 11 187 "Alice's Adventures in Wonderland", Lewis Carroll 188 document manifest 14 14. <http://www.jus.uio.no/sisu/sisu_manual/alices_adventures_in_wonderland.lewis_carroll/sisu_manifest.html> 189 html, segmented text 190 html, scroll, document in one 191 pdf, landscape 192 pdf, portrait 193 open document 194 xhtml scroll 195 xml, sax 196 xml, dom 197 plain text utf-8 198 concordance 199 dcc, document content certificate (digests) 200 markup source text 201 zipped markup source pod 202 "Through The Looking-Glass", Lewis Carroll, PG Etext 12 203 "Through The Looking-Glass", Lewis Carroll 204 document manifest 15 15. <http://www.jus.uio.no/sisu/sisu_manual/through_the_looking_glass.lewis_carroll/sisu_manifest.html> 205 html, segmented text 206 html, scroll, document in one 207 pdf, landscape 208 pdf, portrait 209 open document 210 xhtml scroll 211 xml, sax 212 xml, dom 213 plain text utf-8 214 concordance 215 dcc, document content certificate (digests) 216 markup source text 217 zipped markup source pod 218 "Alice's Adventures in Wonderland" and "Through The Looking-Glass", Lewis Carroll, PG Etexts 11 and 12 219 "Alice's Adventures in Wonderland" and "Through The Looking-Glass", Lewis Carroll 220 document manifest 16 16. <http://www.jus.uio.no/sisu/sisu_manual/alices_adventures_in_wonderland_and_through_the_looking_glass.lewis_carroll/sisu_manifest.html> 221 html, segmented text 222 html, scroll, document in one 223 pdf, landscape 224 pdf, portrait 225 open document 226 xhtml scroll 227 xml, sax 228 xml, dom 229 plain text utf-8 230 concordance 231 dcc, document content certificate (digests) 232 markup source text 233 zipped markup source pod 234 "Gnu Public License 2", (GPL 2) Free Software Foundation 235 "Gnu Public License 2", (GPL 2) Free Software Foundation 236 document manifest 17 17. <http://www.jus.uio.no/sisu/sisu_manual/gpl2.fsf/sisu_manifest.html> 237 html, segmented text 238 html, scroll, document in one 239 pdf, landscape 240 pdf, portrait 241 open document 242 xhtml scroll 243 xml, sax 244 xml, dom 245 plain text utf-8 246 concordance 247 dcc, document content certificate (digests) 248 markup source text 249 zipped markup source pod 250 "Gnu Public License v3 - Third discussion draft", (GPLv3) Free Software Foundation 251 "Gnu Public License 3 - Third discussion draft", (GPL v3 draft3) Free Software Foundation 252 document manifest 18 18. <http://www.jus.uio.no/sisu/sisu_manual/gpl3_draft3.fsf/sisu_manifest.html> 253 html, segmented text 254 html, scroll, document in one 255 pdf, landscape 256 pdf, portrait 257 open document 258 xhtml scroll 259 xml, sax 260 xml, dom 261 plain text utf-8 262 concordance 263 dcc, document content certificate (digests) 264 markup source text 265 zipped markup source pod 266 "Debian Social Contract" 267 "Debian Social Contract" 268 document manifest 19 19. <http://www.jus.uio.no/sisu/sisu_manual/debian_social_contract_v1.1/sisu_manifest.html> 269 html, segmented text 270 html, scroll, document in one 271 pdf, landscape 272 pdf, portrait 273 open document 274 xhtml scroll 275 xml, sax 276 xml, dom 277 plain text utf-8 278 concordance 279 dcc, document content certificate (digests) 280 markup source text 281 zipped markup source pod 282 "Debian Constitution v1.3", (simple/default markup) 283 "Debian Constitution v1.3" 284 document manifest 20 20. <http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.3/sisu_manifest.html> 285 html, segmented text 286 html, scroll, document in one 287 pdf, landscape 288 pdf, portrait 289 open document 290 xhtml scroll 291 xml, sax 292 xml, dom 293 plain text utf-8 294 concordance 295 dcc, document content certificate (digests) 296 markup source text 297 zipped markup source pod 298 "Debian Constitution v1.3", (markup adjusted for output to more closely match the original) 299 "Debian Constitution v1.3", (markup adjusted for output to more closely match the original) 300 document manifest 21 21. <http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.3.adjusted/sisu_manifest.html> 301 html, segmented text 302 html, scroll, document in one 303 pdf, landscape 304 pdf, portrait 305 open document 306 xhtml scroll 307 xml, sax 308 xml, dom 309 plain text utf-8 310 concordance 311 dcc, document content certificate (digests) 312 markup source text 313 zipped markup source pod 314 "Debian Constitution v1.2", (simple/default markup) 315 "Debian Constitution v1.2 (more translations)" 316 document manifest 22 22. <http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.2/sisu_manifest.html> 317 html, segmented text 318 html, scroll, document in one 319 pdf, landscape 320 pdf, portrait 321 open document 322 xhtml scroll 323 xml, sax 324 xml, dom 325 plain text utf-8 326 concordance 327 dcc, document content certificate (digests) 328 markup source text 329 zipped markup source pod 330 "Debian Constitution v1.2", (markup adjusted for output to more closely match the original) 331 "Debian Constitution (more translations)", (markup adjusted for output to more closely match the original) 332 document manifest 23 23. <http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.2.adjusted/sisu_manifest.html> 333 html, segmented text 334 html, scroll, document in one 335 pdf, landscape 336 pdf, portrait 337 open document 338 xhtml scroll 339 xml, sax 340 xml, dom 341 plain text utf-8 342 concordance 343 dcc, document content certificate (digests) 344 markup source text 345 zipped markup source pod 346 "A Uniform Sales Terminology", Vikki Rogers and Albert Kritzer 347 "A Uniform Sales Terminology", Vikki Rogers and Albert Kritzer 348 document manifest 24 24. <http://www.jus.uio.no/sisu/sisu_manual/a_uniform_international_sales_terminology.vikki_rogers.and.albert_kritzer/sisu_manifest.html> 349 html, segmented text 350 html, scroll, document in one 351 pdf, landscape 352 pdf, portrait 353 open document 354 xhtml scroll 355 xml, sax 356 xml, dom 357 plain text utf-8 358 concordance 359 dcc, document content certificate (digests) 360 markup source text 361 zipped markup source pod 362 "The Autonomous Contract" 1997 - markup sample 363 "The Autonomous Contract" 1997 - markup sample 364 document manifest 25 25. <http://www.jus.uio.no/sisu/sisu_manual/the_autonomous_contract.amissah.19970710/sisu_manifest.html> 365 html, segmented text 366 html, scroll, document in one 367 pdf, landscape 368 pdf, portrait 369 open document 370 xhtml scroll 371 xml, sax 372 xml, dom 373 plain text utf-8 374 concordance 375 dcc, document content certificate (digests) 376 markup source text 377 zipped markup source pod 378 "The Autonomous Contract Revisited" - markup sample 379 "The Autonomous Contract Revisited" - markup sample 26 26. <http://www.jus.uio.no/sisu/autonomy_markup0/toc.html>
alternative markup variations revolving around endnotes
(i) as above, markup with embedded endnotes, and header list of words/phrases to emphasise
<http://www.jus.uio.no/sisu/sample/syntax/autonomy_markup0.sst.html>
<http://www.jus.uio.no/sisu/sample/markup/autonomy_markup0.sst>
(ii) Again markup with embedded endnotes, but font faces changed within paragraphs rather than in header as in i
<http://www.jus.uio.no/sisu/sample/syntax/autonomy_markup1.sst.html>
<http://www.jus.uio.no/sisu/sample/markup/autonomy_markup1.sst>
(iii) Markup with endnote placemarks within paragraphs, the endnotes following the paragraph that contains them <http://www.jus.uio.no/sisu/sample/syntax/autonomy_markup2.sst.html>
<http://www.jus.uio.no/sisu/sample/markup/autonomy_markup2.sst>
(iv) Another alternative is to place the marked up endnotes sequentially and at the end of the text. This also works. The paragraph variant iii is perhaps easier to visually check should there be missing endnotes; but this variant iv may better suit the conversion of alternatively pre-prepared documents.
380
document manifest 27 27. <http://www.jus.uio.no/sisu/sisu_manual/autonomy_markup0/sisu_manifest.html> 381 html, segmented text 382 html, scroll, document in one 383 pdf, landscape 384 pdf, portrait 385 open document 386 xhtml scroll 387 xml, sax 388 xml, dom 389 plain text utf-8 390 concordance 391 dcc, document content certificate (digests) 392 markup source text 393 zipped markup source pod 394 "United Nations Convention on Contracts for the International Sale of Goods" 395 "United Nations Convention on Contracts for the International Sale of Goods" 28 28. <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html>
This example instructs the program to use regular expressions, in this example the words: Part, Chapter, Section, Article occurring at the beginning of a line, to identify what should be treated as different levels of heading in a document (and used to make the table of contents).
This example instructs the program to use regular expressions, in this example the words: Part, Chapter, Section, Article occurring at the beginning of a line, to identify what should be treated as different levels of heading in a document (and used to make the table of contents).
396
document manifest 29 29. <http://www.jus.uio.no/sisu/sisu_manual/un_contracts_international_sale_of_goods_convention_1980/sisu_manifest.html> 397 html, segmented text 398 html, scroll, document in one 399 pdf, landscape 400 pdf, portrait 401 open document 402 xhtml scroll 403 xml, sax 404 xml, dom 405 plain text utf-8 406 concordance 407 dcc, document content certificate (digests) 408 markup source text 409 zipped markup source pod 410 PECL the "Principles of European Contract Law" 411 "Principles of European Contract Law" 412 document manifest 30 30. <http://www.jus.uio.no/sisu/sisu_manual/eu_contract_principles_parts_1_to_3_2002/sisu_manifest.html> 413 html, segmented text 414 html, scroll, document in one 415 pdf, landscape 416 pdf, portrait 417 open document 418 xhtml scroll 419 xml, sax 420 xml, dom 421 plain text utf-8 422 concordance 423 dcc, document content certificate (digests) 424 markup source text 425 zipped markup source pod 426 1.3 SQL - PostgreSQL, SQLite 427 A Sample search form is available at <http://search.sisudoc.org> 428 A few canned searches, showing object numbers. Search for: 429 English documents matching Linux OR Debian 430 GPL OR Richard Stallman 431 invention OR innovation in English language 432 copyright in English language documents 433 Note that the searches done in this form are case sensitive. 434 Expand those same searches, showing the matching text in each document: 435 English documents matching Linux OR Debian 436 GPL OR Richard Stallman 437 invention OR innovation in English language 438 copyright in English language documents 439 Note you may set results either for documents matched and object number locations within each matched document meeting the search criteria; or display the names of the documents matched along with the objects (paragraphs) that meet the search criteria.31 31. of this feature when demonstrated to an IBM software innovations evaluator in 2004 he said to paraphrase: this could be of interest to us. We have large document management systems, you can search hundreds of thousands of documents and we can tell you which documents meet your search criteria, but there is no way we can tell you without opening each document where within each your matches are found. 440 1.4 Lex Mercatoria as an example 441 There is quite a bit to peruse if you explore the site Lex Mercatoria: 442 <http://www.lexmercatoria.org/> 32 32. <http://www.jus.uio.no/lm/index> 443 or perhaps: 444 <http://lexmercatoria.org/treaties.and.organisations/lm.chronological> 33 33. <http://www.jus.uio.no/lm/treaties.and.organisations/lm.chronological> 445 1.5 For good measure the markup for a document with lots of (simple) tables 446 SiSU is not optimised for table making, but does handle simple tables. 447 SiSU marked up file with tables 34 34. <http://www.jus.uio.no/sisu/sample/syntax/un_conventions_membership_status.sst.html>
<http://www.jus.uio.no/sisu/sample/markup/un_conventions_membership_status.sst>
448
Output of table file example 35 35. <http://www.jus.uio.no/lm/un_conventions_membership_status/toc.html> 449 1.6 And a link to the output of a reported case 450 <http://www.jus.uio.no/lm/england.fothergill.v.monarch.airlines.hl.1980/toc.html> 451 Endnotes 0