Information operations on Twitter: principles, process, and disclosure (Twitter Blog)

Twitter Blog: Information operations on Twitter: principles, process, and disclosure. “In October 2018, we published the first comprehensive archive of Tweets and media associated with known state-backed information operations on Twitter. Since its launch, thousands of researchers from across the globe have downloaded datasets, which contain more than 30 million Tweets and over 1 terabyte of media, using our archive to conduct their own investigations and to share their insights and independent analysis with the world. Today, we’re adding six additional datasets to our archive, covering coordinated, state-backed activities originating from four jurisdictions. All accounts have been removed from Twitter.”

Twitterbots: Anatomy of a Propaganda Campaign (Symantec)

Symantec: Twitterbots: Anatomy of a Propaganda Campaign. “One of the main talking points of the 2016 U.S. presidential election campaign involved attempts to surreptitiously influence public opinion using social media campaigns. In the months after the election, it quickly became apparent that a sophisticated propaganda operation had been directed against American voters. Not surprisingly, news of these campaigns caused widespread public concern, prompting social media firms to launch investigations into whether their services had been misused. In October 2018, Twitter released a massive dataset of content posted on its service by the Internet Research Agency (IRA), a Russian company responsible for the largest propaganda campaign directed against the U.S.”

Engadget: Microsoft discreetly wiped its massive facial recognition database

Engadget: Microsoft discreetly wiped its massive facial recognition database. “Microsoft has been vocal about its desire to properly regulate facial recognition technology. The company’s president, Brad Smith, appealed directly to Congress last year to take steps to manage the tech, which he says has ‘broad societal ramifications and potential for abuse.’ Such are the company’s concerns that it even blocked the sales of the tech to California police forces. Now, Microsoft is continuing its crusade by quietly deleting its MS Celeb database, which contains more than 10 million images of some 100,000 people.”

Newswise: Cornell team, EPA to partner on emissions big data project

Newswise: Cornell team, EPA to partner on emissions big data project. “ITHACA, N.Y. – A team from Cornell University associate professor Max Zhang’s lab will work with the Environmental Protection Agency (EPA) over the next year on a machine learning model designed to predict fossil fuel emissions. The project was a winning entry in the EPA-sponsored EmPOWER Air Data Challenge.”

In the Library’s Web Archives: Totally Tabular Data (Library of Congress)

Library of Congress: In the Library’s Web Archives: Totally Tabular Data. “Analysis of data from the Library of Congress Web Archives is useful in documenting how various forms of tabular data have become part of digital government publishing. We introduced the file datasets and our methods of creating them in recent posts about our PDF and audio datasets, and are glad to be continuing the series with the release of three CSV/TSV/XLS datasets. A wide range of government agencies are making tabular data available in these formats and many of these files have been archived in the Library of Congress Web Archives. Essentially, all of what we find in these files is—structurally speaking—tabular data. Still, exploring these sample datasets illustrates some of the varied and creative ways that these kinds of files have been used by different federal agencies.”

Penn Medicine News: Penn Medicine Releases Free, ‘Self-Service’ AI Tool for Data Analytics

Penn Medicine News: Penn Medicine Releases Free, ‘Self-Service’ AI Tool for Data Analytics. “The Penn Medicine Institute for Biomedical Informatics has launched a free, open-source automated machine learning system for data analysis that is designed for anyone to use, from a high school student looking to gain insight on their baseball team’s statistics, to trained researchers looking for associations between cancer and environmental factors.”

Phys .org: New tool for understanding cells in health and disease

Phys .org: New tool for understanding cells in health and disease . “The freely accessible platform, which includes software and several gold standard datasets compares the performance of thousands of different single-cell analysis options, enabling researchers to identify the best method for the questions they wish to answer.”