BusinessDay: Africa must reap the benefits of its own data

BusinessDay: Africa must reap the benefits of its own data. “Today, an AI algorithm is not a competitive advantage but data is. The World Economic Forum calls data the new ‘oxygen’, while Chinese AI specialist Kai-Fu Lee calls it the new ‘oil’. Africa’s population is increasing faster than in any region in the world. The continent has a population of 1.3-billion people and a total nominal GDP of $2.3-trillion. This increase in the population is in effect an increase in data, and if data is the new oil, it is akin to an increase in oil reserve.”

Google AI Blog: Announcing Open Images V5 and the ICCV 2019 Open Images Challenge

Google AI Blog: Announcing Open Images V5 and the ICCV 2019 Open Images Challenge. “In 2016, we introduced Open Images, a collaborative release of ~9 million images annotated with labels spanning thousands of object categories. Since then we have rolled out several updates, culminating with Open Images V4 in 2018. In total, that release included 15.4M bounding-boxes for 600 object categories, making it the largest existing dataset with object location annotations, as well as over 300k visual relationship annotations. Today we are happy to announce Open Images V5, which adds segmentation masks to the set of annotations, along with the second Open Images Challenge, which will feature a new instance segmentation track based on this data.”

The Healthy Cognitive Aging Project: A major data resource for cognitive epidemiology (National Institute on Aging)

National Institute on Aging: The Healthy Cognitive Aging Project: A major data resource for cognitive epidemiology . “Anybody who has ever loved, lived with, or cared for a person with Alzheimer’s disease or its related dementias knows that its effects are multifaceted, complex, and often difficult to predict. That’s why NIA’s longitudinal aging studies are so important—they can provide prospective data on these as-yet incompletely understood points. This week, we are proud to announce the first public release of data from the Healthy Cognitive Aging Project (HCAP), a nationally representative study that will help shed light on how and when cognitive decline progresses in older adults.”

Google AI Blog: Announcing Google-Landmarks-v2: An Improved Dataset for Landmark Recognition & Retrieval

Google AI Blog: Announcing Google-Landmarks-v2: An Improved Dataset for Landmark Recognition & Retrieval. “Last year we released Google-Landmarks, the largest world-wide landmark recognition dataset available at that time. … this year we are releasing Google-Landmarks-v2, a completely new, even larger landmark recognition dataset that includes over 5 million images (2x that of the first release) of more than 200 thousand different landmarks (an increase of 7x).”

Phys .org: Preserving the contents of the New York Public Library in a teaspoon of protein, without energy, for millions of years

Phys .org: Preserving the contents of the New York Public Library in a teaspoon of protein, without energy, for millions of years. “As the data boom continues to boom, more and more information gets filed in less and less space. Even the cloud—whose name promises opaque, endless space—will eventually run out of space, can’t thwart all hackers, and gobbles up energy. Now, a new way to store information could stably house data for millions of years, lives outside the hackable internet, and, once written, uses no energy. All you need is a chemist, some cheap molecules, and your precious information.”

Purdue University: Genetic testing has a data problem. New software can help.

Purdue University: Genetic testing has a data problem. New software can help.. “In recent years, the market for direct-to-consumer genetic testing has exploded. The number of people who used at-home DNA tests more than doubled in 2017, most of them in the U.S. About 1 in 25 American adults now know where their ancestors came from, thanks to companies like AncestryDNA and 23andMe. As the tests become more popular, these companies are grappling with how to store all the accumulating data and how to process results quickly. A new tool called TeraPCA, created by researchers at Purdue University, is now available to help. The results were published in the journal Bioinformatics.”

Washington State University: Amid genomic data explosion, scientists find proliferating errors

Washington State University: Amid genomic data explosion, scientists find proliferating errors. “Washington State University researchers found a troubling number of errors in publicly available genomic data as they conducted a large-scale analysis of protein sequences. The work, published in the journal Frontiers in Microbiology, the world’s most cited microbiology journal, could have important implications for future genomic research.”