[XSL-LIST Mailing List Archive Home] [By Thread] [By Date]

Re: [xsl] Looking for "real-world" XML documents


Subject: Re: [xsl] Looking for "real-world" XML documents
From: "Syd Bauman s.bauman@xxxxxxx" <xsl-list-service@xxxxxxxxxxxxxxxxxxxxxx>
Date: Fri, 31 Oct 2014 18:27:07 -0000

Sorry to be late to the party. But Dimitre, you are of course welcome
to the entire WWP corpus, if you like. 

Other thoughts:

 * Just last week the TAPAS project announced they are open for
   (rudimentary) business. Navigate to any document at
   http://www.tapasproject.org/ and there's a "Download TEI" button.

 * Any of the DocSouth collections is available in XML (TEI P4). See
   http://docsouth.unc.edu/docsouthdata/

 * Sticking with the "south" theme, lots of TEI P5 manuscript
   descriptions are available from
   http://aso.obdurodon.org/browse.php.

 * The voting record of the United States Senate in XML is quite a
   pain to find; e.g.:
   http://www.senate.gov/legislative/LIS/roll_call_votes/vote1122/vote_112_2_00172.xml

 * There's this Extremely good conference which keeps its proceedings
   online, and a lot of the papers include the source XML. See 
   http://conferences.idealliance.org/extreme/biblio.html
   and
   http://www.balisage.net/Proceedings/index.html
   E.g.
   http://www.balisage.net/Proceedings/vol10/xml/Novatchev01/BalisageVol10-Novatchev01.xml
   (Alright, that particular one is probably not helpful to you :-)

 * If you'd like some pretty bad OCR-software-generated XML, each
   document in the collection at http://ufdc.ufl.edu/grimm has a link
   to the "TEI".

 * And, of course, if you want lots of examples of really awful XML,
   many of the .plist files on a Mac OS X system will be in the XML
   syntax.

-- 
 Syd Bauman, EMT-Paramedic
 Senior XML Programmer/Analyst
 Northeastern University Women Writers Project
 s.bauman@xxxxxxx or
 Syd_Bauman@xxxxxxxxxxxxxxxx


Current Thread
Keywords