Northeastern: Northeastern Library Pioneers New Methods Of Big Data Scholarship In Effort To Digitize History. “Dan Cohen has a vision for the future of the Northeastern library. Cohen, dean of the libraries and vice provost for information collaboration, wants to transform Northeastern’s vast archive of print and photographic data into a standardized digital form that will allow scholars to use modern Big Data techniques to analyze 300-year-old information. And now, thanks to a grant from the National Endowment for the Humanities, Cohen will have the resources he needs to transform that vision into reality.”
Simon Willison: Analyzing US Election Russian Facebook Ads . “Two interesting data sources have emerged in the past few weeks concerning the Russian impact on the 2016 US elections. FiveThirtyEight published nearly 3 million tweets from accounts associated with the Russian ‘Internet Research Agency’—see my article and searchable tweet archive here. Separately, the House Intelligence Committee Minority released 3,517 Facebook ads that were reported to have been bought by the Russian Internet Research Agency as a set of redacted PDF files.” Mr. Willison created some tools for exploring the data, as well as creating ancillary utilities.
FiveThirtyEight: Why We’re Sharing 3 Million Russian Troll Tweets. “FiveThirtyEight has obtained nearly 3 million tweets from accounts associated with the Internet Research Agency. To our knowledge, it’s the fullest empirical record to date of Russian trolls’ actions on social media, showing a relentless and systematic onslaught. In concert with the researchers who first pulled the tweets, FiveThirtyEight is uploading them to GitHub so that others can explore the data for themselves.”
Newswise: Berkeley Lab-Developed Digital Library is a Game Changer for Environmental Research. “… storing, accessing and incorporating environmental data into models is challenging due to the diversity of the datasets, which include measurement of properties associated with bedrock, groundwater, soils, vegetation and atmospheric compartments of environmental systems. Now accessing archival data generated by environmental field, experimental and modeling activities has gotten much easier with the April 1 launch of ESS-DIVE (Environmental System Science – Data Infrastructure for a Virtual Ecosystem)—a digital archive that serves as a repository for hundreds of U.S. Department of Energy (DOE)-funded research projects under the agency’s Environmental System Science umbrella, which includes the Subsurface Biogeochemical Research and Terrestrial Ecosystem Sciences programs. The digital library also serves datasets that were previously stored in DOE’s Carbon Dioxide Information Analysis Center archive.”
Calvin News: Calvin Prof Using AI To Hear Whisper In Twitter’s Whirlwind. “When looking at Twitter, computer science professor Keith Vander Linden formerly saw noise: a continuous roar of chaotic 280-character messages. From this tumult, however, he now discerns meaningful patterns: ‘if you look at enough tweets,’ says Vander Linden, ‘with the right kind of statistical models, you can derive a signal from that, you can find out information about what people are saying about stuff, and from that you can infer what they are thinking.'”
Forbes: Are Toilets The New Twitter? Using Smart City Data To Measure Interest. “One of the most common uses of Twitter by marketers and researchers is to gauge public interest and reaction to major events at scale. Social media makes such analyses relatively trivial through simple keyword searches and volume trendlines. However, last month during the World Cup, Tokyo’s waterworks bureau reminded us that as cities become increasingly instrumented, the myriad other signals in our daily digital exhaust offer powerful alternative signals that stretch beyond the digital divide.”
Library of Congress: Inside, Inside Baseball: A Look at the Construction of the Dataset Featuring the Smithsonian’s National Museum of African American History and Culture and the Library of Congress Digital Collections. “After weeks of preparations and four days of fast-pitched ideation and creation, this Friday LC Labs will unveil the efforts of ‘Inside Baseball’ – a collaboration between the Library of Congress, the Smithsonian National Museum of African American History and Culture, and JSTOR Labs. Joining the Baseball Americana batting lineup, this week of flash-building and design-thinking will debut new visualizations and prototypes to bring baseball-related digital collections to center field!”