Simon Willison: Analyzing US Election Russian Facebook Ads

Simon Willison: Analyzing US Election Russian Facebook Ads . “Two interesting data sources have emerged in the past few weeks concerning the Russian impact on the 2016 US elections. FiveThirtyEight published nearly 3 million tweets from accounts associated with the Russian ‘Internet Research Agency’—see my article and searchable tweet archive here. Separately, the House Intelligence Committee Minority released 3,517 Facebook ads that were reported to have been bought by the Russian Internet Research Agency as a set of redacted PDF files.” Mr. Willison created some tools for exploring the data, as well as creating ancillary utilities.

FiveThirtyEight: Why We’re Sharing 3 Million Russian Troll Tweets

FiveThirtyEight: Why We’re Sharing 3 Million Russian Troll Tweets. “FiveThirtyEight has obtained nearly 3 million tweets from accounts associated with the Internet Research Agency. To our knowledge, it’s the fullest empirical record to date of Russian trolls’ actions on social media, showing a relentless and systematic onslaught. In concert with the researchers who first pulled the tweets, FiveThirtyEight is uploading them to GitHub so that others can explore the data for themselves.”

ZDNet: Dropbox still has questions to answer after claims of improper data sharing

ZDNet: Dropbox still has questions to answer after claims of improper data sharing. “In case you missed it, the highlights of a research study by Northwestern University published on Harvard Business Review revealed Dropbox had given them ‘access to project-folder-related data’ over a two-year period from about 400,000 users across 1,000 universities. The researchers initially claimed Dropbox gave them raw data, which they anonymized, but their report was updated after ZDNet reported Monday that Dropbox said it anonymized the data before handing it over.”

Newswise: Berkeley Lab-Developed Digital Library is a Game Changer for Environmental Research

Newswise: Berkeley Lab-Developed Digital Library is a Game Changer for Environmental Research. “… storing, accessing and incorporating environmental data into models is challenging due to the diversity of the datasets, which include measurement of properties associated with bedrock, groundwater, soils, vegetation and atmospheric compartments of environmental systems. Now accessing archival data generated by environmental field, experimental and modeling activities has gotten much easier with the April 1 launch of ESS-DIVE (Environmental System Science – Data Infrastructure for a Virtual Ecosystem)—a digital archive that serves as a repository for hundreds of U.S. Department of Energy (DOE)-funded research projects under the agency’s Environmental System Science umbrella, which includes the Subsurface Biogeochemical Research and Terrestrial Ecosystem Sciences programs. The digital library also serves datasets that were previously stored in DOE’s Carbon Dioxide Information Analysis Center archive.”

Signal: Social Media Helps Detect Nuclear Agreement Violations

Signal: Social Media Helps Detect Nuclear Agreement Violations. “Researchers at North Carolina (NC) State University have developed a new computational model that draws on normally incompatible data sets, such as satellite imagery and social media posts, to answer questions about what is happening in targeted locations. The model identifies violations of nuclear nonproliferation agreements. The data can include traditional sources, such as Geiger counter readings or multispectral data from satellite imagery, but many may be nontraditional and diverse, including Flickr and Twitter posts.”

Calvin News: Calvin Prof Using AI To Hear Whisper In Twitter’s Whirlwind

Calvin News: Calvin Prof Using AI To Hear Whisper In Twitter’s Whirlwind. “When looking at Twitter, computer science professor Keith Vander Linden formerly saw noise: a continuous roar of chaotic 280-character messages. From this tumult, however, he now discerns meaningful patterns: ‘if you look at enough tweets,’ says Vander Linden, ‘with the right kind of statistical models, you can derive a signal from that, you can find out information about what people are saying about stuff, and from that you can infer what they are thinking.'”

Inside, Inside Baseball: A Look at the Construction of the Dataset Featuring the Smithsonian’s National Museum of African American History and Culture and the Library of Congress Digital Collections (Library of Congress)

Library of Congress: Inside, Inside Baseball: A Look at the Construction of the Dataset Featuring the Smithsonian’s National Museum of African American History and Culture and the Library of Congress Digital Collections. “After weeks of preparations and four days of fast-pitched ideation and creation, this Friday LC Labs will unveil the efforts of ‘Inside Baseball’ – a collaboration between the Library of Congress, the Smithsonian National Museum of African American History and Culture, and JSTOR Labs. Joining the Baseball Americana batting lineup, this week of flash-building and design-thinking will debut new visualizations and prototypes to bring baseball-related digital collections to center field!”