GPO: GPO Makes Available Statute Compilations In USLM XML Format

GPO: GPO Makes Available Statute Compilations In USLM XML Format. “The U.S. Government Publishing Office (GPO) and its legislative data partners in the U.S. House of Representatives and the U.S. Senate have made Statute Compilations available in USLM XML format. This new format makes documents easier to use, read, and download. The public can access the compilations on GPO’s trusted digital repository govinfo, the one-stop site for information published by the Government.” USLM stands for United States Legislative Markup. You can read the schema user guide here.

The Register: Google Groups kills RSS support without notice

The Register: Google Groups kills RSS support without notice. “Google has either turned off RSS support in Google Groups without telling anyone, or has failed to notice that RSS in Groups no longer functions. RSS, which stands for either RDF Site Summary or Really Simple Syndication, is an open content syndication protocol. It allows people to subscribe to feeds from websites and receive syndicated content from them through an app capable of reading XML-based data.”

National Library of New Zealand: Papers Past data has been set free

National Library of New Zealand: Papers Past data has been set free . “Papers Past is the National Library’s fully text searchable website containing over 150 newspapers from New Zealand and the Pacific, as well as magazines, journals and government reports. As a result of the data being released, people can now access the data from 78 New Zealand newspapers from the Albertland Gazette to the Victoria Times, all published before 1900. The data itself consists of the METS/ALTO XML files for each issue. The XML files sit in the back of Papers Past and are what allows you to locate keywords within articles.”

The Future of the Past: Modernizing the New York Times Archive

The New York Times: The Future of the Past: Modernizing the New York Times Archive “In 2014, we launched a redesign of our entire digital platform that gave readers a more modern, fluid, and mobile-friendly experience through improvements such as faster performance, responsive layouts, and dynamic page rendering. While our new design upgraded reader experience for new articles, engineering and resource challenges prevented us from migrating previously published articles into this new design…. Today we are thrilled to announce that, thanks to a cross-team migration effort, nearly every article published since 2004 is available to our readers in the new and improved design.” Lots of great “under the hood” stuff in this article.

IRS Releases Form 990 Information as AWS Data Set

The IRS has made available a huge amount of form 990 data. The data are available on Amazon Web Services as a public data set, and it looks like it’s XML, so this is not a tool you can go searching through. It’s more like a back end of data for an API or something. Or you if were doing something small scale, you could probably use Google Sheets to importXML certain data, if you were able to use the URL standards to build the URL for the data you wanted.

Archives.gov Announces a Read/Write API

Archives.gov has a new read/write API. “The dataset for our catalog API contains all archival descriptions, authority records, digitized records (the images, videos, and so on) and their file metadata, all NARA web pages, and public contributions (tags, transcriptions, and comments). The API will allow developers to retrieve all of this metadata in specified formats (JSON or XML) for any given record or search results set.” A read API is pretty great, but wow, a read-write API?

Developing XML Standards for Sheet Music

The Register (oh, Register, I do so love your headlines) has an overview of the efforts being made to develop an XML standard for sheet music. “The Music Notation Community Group consists of representatives from some of the biggest names in the music notation software business who’ve come together to create a standardised way to display western music notation in your browser. The group is off to a strong start, having set out a list of what it believes are achievable goals in the next six to 12 months. However, don’t look for the W3C to endorse MusicXML, as the proposed standard is known.”