When Numbers Lie: How to Spot Fake Data in Music and Why It Matters (Complex)

Complex: When Numbers Lie: How to Spot Fake Data in Music and Why It Matters. “Landing a high stream count or massive social media following can be very lucrative for jumpstarting an artist’s career—it can lead directly to label attention, playlist inclusion, press coverage, sync placements, and more. But for the fans, media, and others on the outskirts of the music industry, it can be a challenge to decipher if and how these numbers translate to actual offline interest.”

Hongkiat: 10 Free Data Recovery Tools For Windows PC

Hongkiat: 10 Free Data Recovery Tools For Windows PC. “When you delete a file, you don’t actually ‘delete’ it. The system marks the area the file resides as reusable, removing the file from its index. This means that there is a period where it is possible to restore the file. To help you with your deleted data dilemma, we have for you 10 free data recovery tools for Windows, which will help from simple file recovery to a total partition rescue.”

Big data: With great data comes great responsibility (Open Access Government)

Open Access Government: Big data: With great data comes great responsibility. “Personal data stored within large repositories of companies are regularly exfiltrated in data breaches. Almost every individual in Western society has been subject to their data being exposed in almost always multiple data breaches. Nation-state actors have been exposed to conduct cyber-espionage on individuals and companies. Although it is 2021, George Orwell’s novel 1984 is a distinct possibility nowadays. So, we must ask ourselves: How do we combine Western civil liberties with the advance of ubiquitous data collection technologies?”

MIT: Online hub for research and teaching brings digital humanities to the fore

MIT: Online hub for research and teaching brings digital humanities to the fore. “… a new collaboration between the MIT Programs in Digital Humanities (DH Lab) and the MIT Libraries is helping foster relationships among scholars with intersecting interests in computational culture. Since September 2020, the DH Lab has partnered with the libraries to present Digital Teaching and Research Collaborative Sessions, a weekly series of virtual events that provide a regular, informal space for faculty and researchers to connect with DH Lab staff, MIT librarians, and with one another. Recordings of these sessions are now available on the MIT Libraries’ YouTube channel.”

The Next Web: We need more AI product owners, not data scientists

The Next Web: We need more AI product owners, not data scientists. “In this article, I share my perspective on the importance and required skills of AI POs [AI Product Owner]. Instead of staffing AI Product development teams only with Data Scientists, AI POs increase the chances of successfully developing AI Products. The article describes how to build successful AI Product Teams, the role of an AI PO, and study resources to become one. First, let’s analyze the roles of a traditional Product Owner and an AI Product Owner.”

Phys .org: Researchers can store the Declaration of Independence in a single molecule

Phys .org: Researchers can store the Declaration of Independence in a single molecule. “Just how much space would you need to store all of the world’s data? A building? A block? A city? The amount of global data is estimated to be around 44 zettabytes. A 15-million-square-foot warehouse can hold 1 billion gigabytes, or .001 zettabyte. So you would need 44,000 such warehouses—which would cover nearly the entire state of West Virginia. John Chaput is hoping to change all that.”

International Monetary Fund: Let’s Build A Better Data Economy

International Monetary Fund: Let’s Build A Better Data Economy. “Most transactions involving personal data are unbeknownst to users, who likely aren’t even aware that they have taken place, let alone that they have given permission. This gives rise to what is known in economics as an externality: the cost of privacy loss is not fully considered when an exchange of data is undertaken. The consequence is that the market’s opacity probably leads to too much data being collected, with too little of the value being shared with individuals.”

Datamation: The Huge Data Problems That Prevented A Faster Pandemic Response

Datamation: The Huge Data Problems That Prevented A Faster Pandemic Response. “Early in the year, when it first became clear we faced a pandemic, technology companies worldwide stepped up and pledged resources that should have been able to find the best ways to balance economic impact and safety. But while the response was faster than the 1918 Flu Virus response, it wasn’t that much faster, and tons of mistakes should have been avoidable given we have massive modeling capability. At the heart of the problem wasn’t the lack of data; it was the inability to get to that data and analyze it in a timely basis. Let’s talk about what went wrong and what companies and governments should be doing to speed up the response, so the next outbreak isn’t as catastrophic.”

Carnegie Mellon University: New Tool Simplifies Data Sharing, Preserves Privacy

Carnegie Mellon University: New Tool Simplifies Data Sharing, Preserves Privacy. “Meet Company X. Company X makes a popular product that lots of people – millions, in fact – use on a daily basis. One day, Company X decides it would like to improve some of the hardware in its product, which is manufactured by Vendor Y. To make these improvements, the company would need to share data with Vendor Y about how its customers use the product. Unfortunately, that data may contain personal information about Company X’s customers, so sharing it would be an invasion of their privacy. Company X doesn’t want to do that, so they abandon the improvement opportunity.”

StateScoop: Minneapolis’ new website ‘turns us all into data scientists,’ CIO says

StateScoop: Minneapolis’ new website ‘turns us all into data scientists,’ CIO says. “Minneapolis DataSource contains dashboards for four categories of public data, including elections, public health, community safety, and housing and development. But [city CIO Fadi] Fadhil said the city is working to include more categories and dashboards through ‘constant automation’ of data collection around the city.”

EurekAlert: Making raw data more usable

EurekAlert: Making raw data more usable. “Computers play a significant role in data science and analysis, but despite their speed and accuracy, they are unable to understand nuance and mitigating factors that could make raw data more usable. Gautam Das, a computer science professor at The University of Texas at Arlington, is leading a team of researchers working to address that shortcoming by increasing the role of humans in the data science pipeline.”

The Hustle: The company that wants to preserve our data for 500+ years

The Hustle: The company that wants to preserve our data for 500+ years . “Deep in the Norweigan arctic, on the ice-encrusted island of Spitsbergen, life stands still. The surrounding lands of the Svalbard archipelago are sparse and desolate. It is a place where there is a 1:10 polar bear to human ratio, where the sun doesn’t rise for 4 months per year, and the northern lights dance across the sky. But on the side of a mountain in Spitsbergen, there’s an abandoned coal mine. And inside — some 250 meters below the Earth’s surface — you’ll find a steel vault that contains an archive of film encoded with hundreds of thousands of open-source projects from around the world.”

InformationWeek: Why Data Science Isn’t an Exact Science

InformationWeek: Why Data Science Isn’t an Exact Science. “‘When we’re doing data science effectively, we’re using statistics to model the real world, and it’s not clear that the statistical models we develop accurately describe what’s going on in the real world,’ said Ben Moseley, associate professor of operations research at Carnegie Mellon University’s Tepper School of Business. ‘We might define some probability distribution, but it isn’t even clear the world acts according to some probability distribution.’”

EurekAlert: Computer servers now able to retrieve data much faster

EurekAlert: Computer servers now able to retrieve data much faster. “Current data storage systems use only one storage server to process information, making them slow to retrieve information to display for the user. A backup server only becomes active if the main storage server fails. The new approach, called FLAIR, optimizes data storage systems by using all the servers within a given network. Therefore, when a user makes a data request, if the main server is full, another server automatically activates to fill it.”