Newswise: Wrangling big data into real-time, actionable intelligence

Newswise: Wrangling big data into real-time, actionable intelligence. “Sandia researchers worked with students at the University of Illinois Urbana-Champaign, an Academic Alliance partner, to develop analytical and decision-making algorithms for streaming data sources and integrated them into a nearly real-time distributed data processing framework using big data tools and computing resources at Sandia. The framework takes disparate data from multiple sources and generates usable information that can be acted on in nearly real time.”

CNBC: Florida is scooping up huge amounts of data on schoolchildren, including security camera footage and discipline records, and researchers are worried

CNBC: Florida is scooping up huge amounts of data on schoolchildren, including security camera footage and discipline records, and researchers are worried. “Researchers from the Aspen Institute are raising concerns about a Florida initiative meant to collect and collate huge amounts of data on schoolchildren in the state, according to a report released Thursday. Florida schools are now required to collect, store and crunch data on students in the name of predicting school shootings.”

Simon Willison: Tracking PG&E outages by scraping to a git repo

Simon Willison: Tracking PG&E outages by scraping to a git repo. “PG&E have cut off power to several million people in northern California, supposedly as a precaution against wildfires. As it happens, I’ve been scraping and recording PG&E’s outage data every 10 minutes for the past 4+ months. This data got really interesting over the past two days! The original data lives in a GitHub repo (more importantly in the commit history of that repo).”

Harvard Business Review: Most Analytics Projects Don’t Require Much Data

Harvard Business Review: Most Analytics Projects Don’t Require Much Data. “n their headlong rush into advanced data science, big data, machine learning, and artificial intelligence, too many companies have ignored ‘small data.’ This is a huge miss. The relative ease, ubiquity, and power of small data projects carry profound implications for all employees, managers, and leaders at all levels, in every department, in every organization.”

National Institutes of Health: Five Petabytes of Sequence Read Archive Data Now in the Cloud

National Institutes of Health: Five Petabytes of Sequence Read Archive Data Now in the Cloud. “The National Center for Biomedical Information (NCBI) at the National Library of Medicine (NLM) recently moved the five petabytes of public SRA data to the cloud with support from the National Institutes of Health (NIH) Science and Technology Research Infrastructure for Discovery, Experimentation, and Sustainability (STRIDES) Initiative. These data include a variety of genomes, gene expression data, and more.”

Nature: Make more digital twins

Nature: Make more digital twins. “Digital twins — precise, virtual copies of machines or systems — are revolutionizing industry. Driven by data collected from sensors in real time, these sophisticated computer models mirror almost every facet of a product, process or service. Many major companies already use digital twins to spot problems and increase efficiency1. Half of all corporations might be using them by 2021, one analyst predicts2.”

University of Colorado Boulder: Anyone can look up school data with new online tool

University of Colorado Boulder: Anyone can look up school data with new online tool. “The database, first made available online in 2016 in a format designed mainly for researchers, is built from 350 million reading and math test scores from 3rd to 8th grade students during 2008-2016 in every public school in the nation. It also includes district-level measures of racial and socioeconomic composition, segregation patterns, and other educational conditions.”