Undark: In ToxicDocs.org, a Treasure Trove of Industry Secrets . “The site officially launched last Friday with an initial 20 million pages of material focused on six toxic substances: asbestos, benzene, lead, polychlorinated biphenyl (PCB), polyvinyl chloride, and silica, and millions more pages are coming.” The whole article is worth a read; in particular, the problems solved to process five million pages of documents with OCR. “A recent batch of about 1.5 million pages only required about three days to convert to OCR.” Yow!
The Intercept: 100,000 Pages Of Chemical Industry Secrets Gathered Dust In An Oregon Barn For Decades — Until Now
The Intercept: 100,000 Pages Of Chemical Industry Secrets Gathered Dust In An Oregon Barn For Decades — Until Now. “FOR DECADES, SOME of the dirtiest, darkest secrets of the chemical industry have been kept in Carol Van Strum’s barn. Creaky, damp, and prowled by the occasional black bear, the listing, 80-year-old structure in rural Oregon housed more than 100,000 pages of documents obtained through legal discovery in lawsuits against Dow, Monsanto, the Environmental Protection Agency, the U.S. Forest Service, the Air Force, and pulp and paper companies, among others. As of today, those documents and others that have been collected by environmental activists will be publicly available through a project called the Poison Papers.”
UC San Francisco: UCSF Chemical Industry Documents Archive Goes Live. “The UCSF Truth Tobacco Industry Documents Archive is well-known an widely used by tobacco control researchers and advocates… Few people realize that the tobacco documents are now part of the larger multi-industry UCSF Industry Documents Library that has included documents from Pharma for several years. Now we have added a third collection of documents, the new Chemical Industry Documents Archive that has been launched with nearly 2,000 documents and more to come in May and beyond.”