ZDNet: Who’s the greatest golfer of all time? This data-led project might have the answer

ZDNet: Who’s the greatest golfer of all time? This data-led project might have the answer. “What do you do if a global pandemic means you can’t stage one of the world’s most famous golf tournaments? For The R&A, organisers of The Open, the answer was to use a combination of data and video to create a virtual tournament of golfing greats from the past 50 years.”

InformationWeek: Why Data Science Isn’t an Exact Science

InformationWeek: Why Data Science Isn’t an Exact Science. “‘When we’re doing data science effectively, we’re using statistics to model the real world, and it’s not clear that the statistical models we develop accurately describe what’s going on in the real world,’ said Ben Moseley, associate professor of operations research at Carnegie Mellon University’s Tepper School of Business. ‘We might define some probability distribution, but it isn’t even clear the world acts according to some probability distribution.'”

New York Times: Hoping to Understand the Virus, Everyone Is Parsing a Mountain of Data

New York Times: Hoping to Understand the Virus, Everyone Is Parsing a Mountain of Data. “Six months since the first cases were detected in the United States, more people have been infected by far than in any other country, and the daily rundown of national numbers on Friday was a reminder of a mounting emergency: more than 73,500 new cases, 1,100 deaths and 939,838 tests, as well as 59,670 people currently hospitalized for the virus. Americans now have access to an expanding set of data to help them interpret the coronavirus pandemic.”

Arizona State University: Data analytics can predict global warming trends, heat waves

Arizona State University: Data analytics can predict global warming trends, heat waves. “New research from Arizona State University and Stanford University is augmenting meteorological studies that predict global warming trends and heat waves, adding human-originated factors into the equation.”

Bing Blogs: Extracting Covid-19 insights from Bing search data

Bing Blogs: Extracting Covid-19 insights from Bing search data . “As is true for many other topics, search engine query logs may be able to give insight into the information gaps associated with Covid-19…. We are pleased to announce that we have already made Covid-19 query data freely available on GitHub as the Bing search dataset for Coronavirus intent, with scheduled updates every month over the course of the pandemic. This dataset includes explicit Covid-19 search queries containing terms such as corona, coronavirus, and covid, as well as implicit Covid-19 queries that are used to access the same set of web page search results (using the technique of random walks on the click graph).”

TechCrunch: Aclima and Google release a new air quality data set for researchers to investigate California pollution

TechCrunch: Aclima and Google release a new air quality data set for researchers to investigate California pollution. “As part of the Collision from Home conference, Aclima chief executive Davida Herzl released a new data set made in conjunction with Google. Free to the scientific community, the data is the culmination of four years of data collection and aggregation resulting in 42 million air quality measurements throughout the state of California.”

BetaNews: IBM launches open source tool to help COVID-19 data analysis

BetaNews: IBM launches open source tool to help COVID-19 data analysis. “COVID notebooks is designed to help with tasks including obtaining authoritative data on the current status of the outbreak, cleaning up the most serious data-quality problems, collating the data into a format amenable to easy analysis with tools like Pandas and Scikit-Learn, and building an initial set of example reports and graphs.”

Analytics Vidhya: 10+ Simple Yet Powerful Excel Tricks for Data Analysis

Analytics Vidhya: 10+ Simple Yet Powerful Excel Tricks for Data Analysis. “I’ve always admired the immense power of Excel. This software is not only capable of doing basic data computations, but you can also perform data analysis using it. It is widely used for many purposes including the likes of financial modeling and business planning. It can become a good stepping stone for people who are new to the world of business analytics.”

The Atlantic: How Virginia Juked Its COVID-19 Data

The Atlantic: How Virginia Juked Its COVID-19 Data. “The United States’ ability to test for the novel coronavirus finally seems to be improving. As recently as late April, the country rarely reported more than 150,000 new test results each day. The U.S. now routinely claims to conduct more than 300,000 tests a day, according to state-level data compiled by the COVID Tracking Project at The Atlantic. But these rosy numbers may conceal a problem: A lack of federal guidelines has created huge variation in how states are reporting their COVID-19 data and in what kind of data they provide to the public.”

First Draft: How to analyze Facebook data for misinformation trends and narratives

First Draft: How to analyze Facebook data for misinformation trends and narratives. “There is a mountain of data that can help us examine topics such as the spread of 5G conspiracy theories or where false narratives around Covid-19 cures came from. It can help us analyze cross-border narratives and identify which online communities most frequently discuss certain issues. While Twitter’s public data is accessible through its Application Programming Interface (API), it can be much more complicated for researchers to access platforms such as Facebook and Instagram. Facebook-owned platform CrowdTangle is the most easily accessible tool to handle three of the most important social networks — Facebook, Instagram, and Reddit — and it is free for journalists and researchers.”

Harvard Gazette: Real-time data to address real-time problems

Harvard Gazette: Real-time data to address real-time problems. “Called the Opportunity Insights Economic Tracker, the tool was created as a public resource to help policymakers assess the effects of the downturn in different regions of the U.S. with the most up-to-date information possible. With a more complete and current picture of the nation’s economic standing, policymakers should then be able to make evidence-based decisions as they move to reopen the nation. The tool provides lawmakers real-time analysis of data such as consumer spending and job postings, which normally takes them several weeks to get.”

Expert Tips for Data Analytics: COVID-19 to Dark Data (Datamation)

Datamation: Expert Tips for Data Analytics: COVID-19 to Dark Data. “Register for this live video webinar – Thursday, May 7, 9 AM PT Ask the experts – get your Data Analytics questions answered by two industry experts. In a wide ranging conversation with two of data analytic’s top thought leaders, we’ll delve into some key questions in analytics today.” I’m pretty sure this is free, but not 100% positive.

FierceBiotech: Life science companies combine to form COVID-19 research database

FierceBiotech: Life science companies combine to form COVID-19 research database. “A group of major CRO, life science, data analytics, publishing and healthcare companies joined forces to release a pro bono research database to build up and integrate a central hub on the latest data out for COVID-19. On the technical side, it’s a secure repository of HIPAA-compliant, de-identified and limited patient-level data sets that will be ‘made available to public health and policy researchers to extract insights to help combat the COVID-19 pandemic,’ according to the group.”

Analytics India: A Beginner’s Guide To Using Google Colab

Analytics India: A Beginner’s Guide To Using Google Colab. “We are all familiar with the pop-up alerts of ‘memory-error’ while trying to work with a large dataset of machine learning (ML) or deep learning algorithms on Jupyter notebooks. On top of that, owning a decent GPU from an existing cloud provider has remained out of bounds due to the financial investment it entails. The machines at our disposal, unfortunately, do not have the unlimited computational ability. But the wait is finally over as we can now build large ML models without selling our properties. The credit goes to Google for launching the Colab – an online platform that allows anyone to train models with large datasets, absolutely free.”

ZDNet: Verizon introduces open-source, big data coronavirus search engine

ZDNet: Verizon introduces open-source, big data coronavirus search engine. “As we struggle to get a grip on exactly how COVID-19 makes us ill and what we can do about it, researchers have created over 50,000 articles. That’s a lot of information! So, how do you make sense of it all? Verizon Media is doing it by using Vespa. This is an open-source, big data processing program to create a coronavirus academic research search engine: CORD-19 Search.”