Title: SiSU - SiSU information Structuring Universe / Structured information, Serialized Units - Markup Samples, Output Examples
Creator: Ralph Amissah
Rights: Copyright (C) Ralph Amissah 2007, part of SiSU documentation, License GPL 3
Type: information
Subject: ebook, epublishing, electronic book, electronic publishing, electronic document, electronic citation, data structure, citation systems, search
Date created: 2002-11-12
Date issued: 2002-11-12
Date available: 2002-11-12
Date modified: 2007-09-16
Date: 2007-09-16
filename: sisu_examples.sst
version number: 1.25
version date: 2007/09/08
1 SiSU - SiSU information Structuring Universe / Structured information, Serialized Units - Markup Samples, Output Examples,
Ralph Amissah
2 SiSU Markup and Output Examples 3 1. Markup and Output Examples 4 1.1 Markup examples 5 Current markup examples and document output samples are provided at <http://www.jus.uio.no/sisu/SiSU/2.html> 6 Some markup with syntax highlighting may be found under <http://www.jus.uio.no/sisu/sample/syntax> but is not as up to date. 7 For some documents hardly any markup at all is required at all, other than a header, and an indication that the levels to be taken into account by the program in generating its output are. 8 1.2 A few book (and other) examples 9 [aukio.png] "Aukio, by Leena Krohn" 1 1 Reproduced with the kind permission of author and artist Leena Krohn, <http://www.kaapeli.fi/krohn> "Aukio" is from the work "Sphinx or Robot" <http://www.jus.uio.no/sisu/sphinx_or_robot.leena_krohn.1996> which is included as a book example in this section, together with another of the author's works, "Tainaron" <http://www.jus.uio.no/sisu/tainaron.leena_krohn.1998> 10 "The Wealth of Networks", Yochai Benkler 11 "The Wealth of Networks", Yochai Benkler 12 document manifest 2 2 <http://www.jus.uio.no/sisu/sisu_manual/the_wealth_of_networks.yochai_benkler/sisu_manifest.html> 13 html, segmented text 14 html, scroll, document in one 15 pdf, landscape 16 pdf, portrait 17 open document 18 xhtml scroll 19 xml, sax 20 xml, dom 21 plain text utf-8 22 concordance 23 dcc, document content certificate (digests) 24 markup source text 25 zipped markup source pod 26 "Free Culture", Lawrence Lessig 27 "Free Culture", Lawrence Lessig 28 document manifest 3 3 <http://www.jus.uio.no/sisu/sisu_manual/free_culture.lawrence_lessig/sisu_manifest.html> 29 html, segmented text 30 html, scroll, document in one 31 pdf, landscape 32 pdf, portrait 33 open document 34 xhtml scroll 35 xml, sax 36 xml, dom 37 plain text utf-8 38 concordance 39 dcc, document content certificate (digests) 40 markup source text 41 zipped markup source pod 42 "Free as in Freedom: Richard Stallman's Crusade for Free Software", by Sam Williams 43 "Free as in Freedom: Richard Stallman's Crusade for Free Software", by Sam Williams 44 document manifest 4 4 <http://www.jus.uio.no/sisu/sisu_manual/free_as_in_freedom.richard_stallman_crusade_for_free_software.sam_williams/sisu_manifest.html> 45 html, segmented text 46 html, scroll, document in one 47 pdf, landscape 48 pdf, portrait 49 open document 50 xhtml scroll 51 xml, sax 52 xml, dom 53 plain text utf-8 54 concordance 55 dcc, document content certificate (digests) 56 markup source text 57 zipped markup source pod 58 "Free For All: How Linux and the Free Software Movement Undercut the High Tech Titans", by Peter Wayner 59 "Free For All: How Linux and the Free Software Movement Undercut the High Tech Titans", by Peter Wayner 60 document manifest 5 5 <http://www.jus.uio.no/sisu/sisu_manual/free_for_all.peter_wayner/sisu_manifest.html> 61 html, segmented text 62 html, scroll, document in one 63 pdf, landscape 64 pdf, portrait 65 open document 66 xhtml scroll 67 xml, sax 68 xml, dom 69 plain text utf-8 70 concordance 71 dcc, document content certificate (digests) 72 markup source text 73 zipped markup source pod 74 "The Cathedral and the Bazaar", by Eric S. Raymond 75 "The Cathedral and the Bazaar", by Eric S. Raymond 76 document manifest 6 6 <http://www.jus.uio.no/sisu/sisu_manual/the_cathedral_and_the_bazaar.eric_s_raymond/sisu_manifest.html> 77 html, segmented text 78 html, scroll, document in one 79 pdf, landscape 80 pdf, portrait 81 open document 82 xhtml scroll 83 xml, sax 84 xml, dom 85 plain text utf-8 86 concordance 87 dcc, document content certificate (digests) 88 markup source text 89 zipped markup source pod 90 "Accelerando", Charles Stross 91 "Accelerando", Charles Stross 92 document manifest 7 7 <http://www.jus.uio.no/sisu/sisu_manual/accelerando.charles_stross/sisu_manifest.html> 93 html, segmented text 94 html, scroll, document in one 95 pdf, landscape 96 pdf, portrait 97 open document 98 xhtml scroll 99 xml, sax 100 xml, dom 101 plain text utf-8 102 concordance 103 dcc, document content certificate (digests) 104 markup source text 105 zipped markup source pod 106 "Tainaron", Leena Krohn 107 "Tainaron", Leena Krohn 108 document manifest 8 8 <http://www.jus.uio.no/sisu/sisu_manual/tainaron.leena_krohn.1998/sisu_manifest.html> 109 html, segmented text 110 html, scroll, document in one 111 pdf, landscape 112 pdf, portrait 113 open document 114 xhtml scroll 115 xml, sax 116 xml, dom 117 plain text utf-8 118 concordance 119 dcc, document content certificate (digests) 120 markup source text 121 zipped markup source pod 122 "Sphinx or Robot", Leena Krohn 123 [i_sor.png] "Sphinx or Robot by Leena Krohn" 124 "Sphinx or Robot", Leena Krohn 125 document manifest 9 9 <http://www.jus.uio.no/sisu/sisu_manual/sphinx_or_robot.leena_krohn.1996/sisu_manifest.html> 126 html, segmented text 127 html, scroll, document in one 128 pdf, landscape 129 pdf, portrait 130 open document 131 xhtml scroll 132 xml, sax 133 xml, dom 134 plain text utf-8 135 concordance 136 dcc, document content certificate (digests) 137 markup source text 138 zipped markup source pod 139 "War and Peace", Leo Tolstoy, PG Etext 2600 140 "War and Peace", Leo Tolstoy 10 10 <http://www.jus.uio.no/sisu/war_and_peace.leo_tolstoy/toc.html>
The ascii text was taken from Project Gutenberg. The markup transforms required are trivial. Of interest, in this instance I am saved by having alternative syntaxes/(structural modes) for marking up endnotes... as it was possible to do a simple search and replace to make the Project Gutenberg ascii presentation suitable for SiSU, using the older endnote markup style. This example instructs the program to use regular expressions, in this example the words: none; none; BOOK|FIRST|SECOND; CHAPTER; occurring at the beginning of a line, to identify what should be treated as different levels of heading in a document (and used to make the table of contents). Note that there was very little markup required after the document headers and Project Gutenberg legal notices. As I presume the legal notices are similar in Project Gutenberg documents, (and I could not bear to think of preparing the same legal notices twice), I moved those to the "skin" for the Project, and these are now represented in the markup by <:insert1> and <:insert2> and the legal notices are available for similar insertion into the next Project Gutenberg text prepared for SiSU, should there be one.
I did a stylesheet/skin for the Gutenberg Project, ... I may have to remove. The markup transforms required are trivial. Of interest, in this instance I am saved by having alternative syntaxes/(structural modes) for marking up endnotes... as it is possible to do a simple search and replace to make Project Gutenberg ascii presentations suitable for SiSU using the older endnote markup style. There is very little markup required after the document headers and Project Gutenberg legal notices. As I presume the legal notices are similar in Project Gutenberg documents, (and I could not bear to think of preparing the same legal notices twice), I moved those to the "skin" for the Project, and these are now represented in the markup by the <:insert1> and <:insert2> markers and the legal notices are available for similar insertion into the next Project Gutenberg text prepared for SiSU, should there be one.
141 document manifest 11 11 <http://www.jus.uio.no/sisu/sisu_manual/war_and_peace.leo_tolstoy/sisu_manifest.html> 142 html, segmented text 143 html, scroll, document in one 144 pdf, landscape 145 pdf, portrait 146 open document 147 xhtml scroll 148 xml, sax 149 xml, dom 150 plain text utf-8 151 concordance 152 dcc, document content certificate (digests) 153 markup source text 154 zipped markup source pod 155 "Don Quixote", Miguel de Cervantes [Saavedra], translated by John Ormsby, PG Etext 996 156 "Don Quixote", Miguel de Cervantes [Saavedra] 157 document manifest 12 12 <http://www.jus.uio.no/sisu/sisu_manual/don_quixote.miguel_de_cervantes/sisu_manifest.html> 158 html, segmented text 159 html, scroll, document in one 160 pdf, landscape 161 pdf, portrait 162 open document 163 xhtml scroll 164 xml, sax 165 xml, dom 166 plain text utf-8 167 concordance 168 dcc, document content certificate (digests) 169 markup source text 170 zipped markup source pod 171 "Gulliver's Travels", Jonathan Swift, transcribed from the 1892 George Bell and Sons edition by David Price, PG Etext 829 172 "Gulliver's Travels", Jonathan Swift 173 document manifest 13 13 <http://www.jus.uio.no/sisu/sisu_manual/gullivers_travels.jonathan_swift/sisu_manifest.html> 174 html, segmented text 175 html, scroll, document in one 176 pdf, landscape 177 pdf, portrait 178 open document 179 xhtml scroll 180 xml, sax 181 xml, dom 182 plain text utf-8 183 concordance 184 dcc, document content certificate (digests) 185 markup source text 186 zipped markup source pod 187 "Alice's Adventures in Wonderland", Lewis Carroll, PG Etext 11 188 "Alice's Adventures in Wonderland", Lewis Carroll 189 document manifest 14 14 <http://www.jus.uio.no/sisu/sisu_manual/alices_adventures_in_wonderland.lewis_carroll/sisu_manifest.html> 190 html, segmented text 191 html, scroll, document in one 192 pdf, landscape 193 pdf, portrait 194 open document 195 xhtml scroll 196 xml, sax 197 xml, dom 198 plain text utf-8 199 concordance 200 dcc, document content certificate (digests) 201 markup source text 202 zipped markup source pod 203 "Through The Looking-Glass", Lewis Carroll, PG Etext 12 204 "Through The Looking-Glass", Lewis Carroll 205 document manifest 15 15 <http://www.jus.uio.no/sisu/sisu_manual/through_the_looking_glass.lewis_carroll/sisu_manifest.html> 206 html, segmented text 207 html, scroll, document in one 208 pdf, landscape 209 pdf, portrait 210 open document 211 xhtml scroll 212 xml, sax 213 xml, dom 214 plain text utf-8 215 concordance 216 dcc, document content certificate (digests) 217 markup source text 218 zipped markup source pod 219 "Alice's Adventures in Wonderland" and "Through The Looking-Glass", Lewis Carroll, PG Etexts 11 and 12 220 "Alice's Adventures in Wonderland" and "Through The Looking-Glass", Lewis Carroll 221 document manifest 16 16 <http://www.jus.uio.no/sisu/sisu_manual/alices_adventures_in_wonderland_and_through_the_looking_glass.lewis_carroll/sisu_manifest.html> 222 html, segmented text 223 html, scroll, document in one 224 pdf, landscape 225 pdf, portrait 226 open document 227 xhtml scroll 228 xml, sax 229 xml, dom 230 plain text utf-8 231 concordance 232 dcc, document content certificate (digests) 233 markup source text 234 zipped markup source pod 235 "Gnu Public License 2", (GPL 2) Free Software Foundation 236 "Gnu Public License 2", (GPL 2) Free Software Foundation 237 document manifest 17 17 <http://www.jus.uio.no/sisu/sisu_manual/gpl2.fsf/sisu_manifest.html> 238 html, segmented text 239 html, scroll, document in one 240 pdf, landscape 241 pdf, portrait 242 open document 243 xhtml scroll 244 xml, sax 245 xml, dom 246 plain text utf-8 247 concordance 248 dcc, document content certificate (digests) 249 markup source text 250 zipped markup source pod 251 "Gnu Public License v3 - Third discussion draft", (GPLv3) Free Software Foundation 252 "Gnu Public License 3 - Third discussion draft", (GPL v3 draft3) Free Software Foundation 253 document manifest 18 18 <http://www.jus.uio.no/sisu/sisu_manual/gpl3_draft3.fsf/sisu_manifest.html> 254 html, segmented text 255 html, scroll, document in one 256 pdf, landscape 257 pdf, portrait 258 open document 259 xhtml scroll 260 xml, sax 261 xml, dom 262 plain text utf-8 263 concordance 264 dcc, document content certificate (digests) 265 markup source text 266 zipped markup source pod 267 "Debian Social Contract" 268 "Debian Social Contract" 269 document manifest 19 19 <http://www.jus.uio.no/sisu/sisu_manual/debian_social_contract_v1.1/sisu_manifest.html> 270 html, segmented text 271 html, scroll, document in one 272 pdf, landscape 273 pdf, portrait 274 open document 275 xhtml scroll 276 xml, sax 277 xml, dom 278 plain text utf-8 279 concordance 280 dcc, document content certificate (digests) 281 markup source text 282 zipped markup source pod 283 "Debian Constitution v1.3", (simple/default markup) 284 "Debian Constitution v1.3" 285 document manifest 20 20 <http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.3/sisu_manifest.html> 286 html, segmented text 287 html, scroll, document in one 288 pdf, landscape 289 pdf, portrait 290 open document 291 xhtml scroll 292 xml, sax 293 xml, dom 294 plain text utf-8 295 concordance 296 dcc, document content certificate (digests) 297 markup source text 298 zipped markup source pod 299 "Debian Constitution v1.3", (markup adjusted for output to more closely match the original) 300 "Debian Constitution v1.3", (markup adjusted for output to more closely match the original) 301 document manifest 21 21 <http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.3.adjusted/sisu_manifest.html> 302 html, segmented text 303 html, scroll, document in one 304 pdf, landscape 305 pdf, portrait 306 open document 307 xhtml scroll 308 xml, sax 309 xml, dom 310 plain text utf-8 311 concordance 312 dcc, document content certificate (digests) 313 markup source text 314 zipped markup source pod 315 "Debian Constitution v1.2", (simple/default markup) 316 "Debian Constitution v1.2 (more translations)" 317 document manifest 22 22 <http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.2/sisu_manifest.html> 318 html, segmented text 319 html, scroll, document in one 320 pdf, landscape 321 pdf, portrait 322 open document 323 xhtml scroll 324 xml, sax 325 xml, dom 326 plain text utf-8 327 concordance 328 dcc, document content certificate (digests) 329 markup source text 330 zipped markup source pod 331 "Debian Constitution v1.2", (markup adjusted for output to more closely match the original) 332 "Debian Constitution (more translations)", (markup adjusted for output to more closely match the original) 333 document manifest 23 23 <http://www.jus.uio.no/sisu/sisu_manual/debian_constitution_v1.2.adjusted/sisu_manifest.html> 334 html, segmented text 335 html, scroll, document in one 336 pdf, landscape 337 pdf, portrait 338 open document 339 xhtml scroll 340 xml, sax 341 xml, dom 342 plain text utf-8 343 concordance 344 dcc, document content certificate (digests) 345 markup source text 346 zipped markup source pod 347 "A Uniform Sales Terminology", Vikki Rogers and Albert Kritzer 348 "A Uniform Sales Terminology", Vikki Rogers and Albert Kritzer 349 document manifest 24 24 <http://www.jus.uio.no/sisu/sisu_manual/a_uniform_international_sales_terminology.vikki_rogers.and.albert_kritzer/sisu_manifest.html> 350 html, segmented text 351 html, scroll, document in one 352 pdf, landscape 353 pdf, portrait 354 open document 355 xhtml scroll 356 xml, sax 357 xml, dom 358 plain text utf-8 359 concordance 360 dcc, document content certificate (digests) 361 markup source text 362 zipped markup source pod 363 "The Autonomous Contract" 1997 - markup sample 364 "The Autonomous Contract" 1997 - markup sample 365 document manifest 25 25 <http://www.jus.uio.no/sisu/sisu_manual/the_autonomous_contract.amissah.19970710/sisu_manifest.html> 366 html, segmented text 367 html, scroll, document in one 368 pdf, landscape 369 pdf, portrait 370 open document 371 xhtml scroll 372 xml, sax 373 xml, dom 374 plain text utf-8 375 concordance 376 dcc, document content certificate (digests) 377 markup source text 378 zipped markup source pod 379 "The Autonomous Contract Revisited" - markup sample 380 "The Autonomous Contract Revisited" - markup sample 26 26 <http://www.jus.uio.no/sisu/autonomy_markup0/toc.html>
alternative markup variations revolving around endnotes
(i) as above, markup with embedded endnotes, and header list of words/phrases to emphasise
<http://www.jus.uio.no/sisu/sample/syntax/autonomy_markup0.sst.html>
<http://www.jus.uio.no/sisu/sample/markup/autonomy_markup0.sst>
(ii) Again markup with embedded endnotes, but font faces changed within paragraphs rather than in header as in i
<http://www.jus.uio.no/sisu/sample/syntax/autonomy_markup1.sst.html>
<http://www.jus.uio.no/sisu/sample/markup/autonomy_markup1.sst>
(iii) Markup with endnote placemarks within paragraphs, the endnotes following the paragraph that contains them <http://www.jus.uio.no/sisu/sample/syntax/autonomy_markup2.sst.html>
<http://www.jus.uio.no/sisu/sample/markup/autonomy_markup2.sst>
(iv) Another alternative is to place the marked up endnotes sequentially and at the end of the text. This also works. The paragraph variant iii is perhaps easier to visually check should there be missing endnotes; but this variant iv may better suit the conversion of alternatively pre-prepared documents.
381 document manifest 27 27 <http://www.jus.uio.no/sisu/sisu_manual/autonomy_markup0/sisu_manifest.html> 382 html, segmented text 383 html, scroll, document in one 384 pdf, landscape 385 pdf, portrait 386 open document 387 xhtml scroll 388 xml, sax 389 xml, dom 390 plain text utf-8 391 concordance 392 dcc, document content certificate (digests) 393 markup source text 394 zipped markup source pod 395 "United Nations Convention on Contracts for the International Sale of Goods" 396 "United Nations Convention on Contracts for the International Sale of Goods" 28 28 <http://www.jus.uio.no/sisu/un_contracts_international_sale_of_goods_convention_1980/toc.html>
This example instructs the program to use regular expressions, in this example the words: Part, Chapter, Section, Article occurring at the beginning of a line, to identify what should be treated as different levels of heading in a document (and used to make the table of contents).
This example instructs the program to use regular expressions, in this example the words: Part, Chapter, Section, Article occurring at the beginning of a line, to identify what should be treated as different levels of heading in a document (and used to make the table of contents).
397 document manifest 29 29 <http://www.jus.uio.no/sisu/sisu_manual/un_contracts_international_sale_of_goods_convention_1980/sisu_manifest.html> 398 html, segmented text 399 html, scroll, document in one 400 pdf, landscape 401 pdf, portrait 402 open document 403 xhtml scroll 404 xml, sax 405 xml, dom 406 plain text utf-8 407 concordance 408 dcc, document content certificate (digests) 409 markup source text 410 zipped markup source pod 411 PECL the "Principles of European Contract Law" 412 "Principles of European Contract Law" 413 document manifest 30 30 <http://www.jus.uio.no/sisu/sisu_manual/eu_contract_principles_parts_1_to_3_2002/sisu_manifest.html> 414 html, segmented text 415 html, scroll, document in one 416 pdf, landscape 417 pdf, portrait 418 open document 419 xhtml scroll 420 xml, sax 421 xml, dom 422 plain text utf-8 423 concordance 424 dcc, document content certificate (digests) 425 markup source text 426 zipped markup source pod 427 1.3 SQL - PostgreSQL, SQLite 428 A Sample search form is available at <http://search.sisudoc.org> 429 A few canned searches, showing object numbers. Search for: 430 English documents matching Linux OR Debian 431 GPL OR Richard Stallman 432 invention OR innovation in English language 433 copyright in English language documents 434 Note that the searches done in this form are case sensitive. 435 Expand those same searches, showing the matching text in each document: 436 English documents matching Linux OR Debian 437 GPL OR Richard Stallman 438 invention OR innovation in English language 439 copyright in English language documents 440 Note you may set results either for documents matched and object number locations within each matched document meeting the search criteria; or display the names of the documents matched along with the objects (paragraphs) that meet the search criteria.31 31 of this feature when demonstrated to an IBM software innovations evaluator in 2004 he said to paraphrase: this could be of interest to us. We have large document management systems, you can search hundreds of thousands of documents and we can tell you which documents meet your search criteria, but there is no way we can tell you without opening each document where within each your matches are found. 441 1.4 Lex Mercatoria as an example 442 There is quite a bit to peruse if you explore the site Lex Mercatoria: 443 <http://www.lexmercatoria.org/> 32 32 <http://www.jus.uio.no/lm/index> 444 or perhaps: 445 <http://lexmercatoria.org/treaties.and.organisations/lm.chronological> 33 33 <http://www.jus.uio.no/lm/treaties.and.organisations/lm.chronological> 446 1.5 For good measure the markup for a document with lots of (simple) tables 447 SiSU is not optimised for table making, but does handle simple tables. 448 SiSU marked up file with tables 34 34 <http://www.jus.uio.no/sisu/sample/syntax/un_conventions_membership_status.sst.html>
<http://www.jus.uio.no/sisu/sample/markup/un_conventions_membership_status.sst>
449 Output of table file example 35 35 <http://www.jus.uio.no/lm/un_conventions_membership_status/toc.html> 450 1.6 And a link to the output of a reported case 451 <http://www.jus.uio.no/lm/england.fothergill.v.monarch.airlines.hl.1980/toc.html> 0 Endnotes