TechCrunch: Darrow raises $35M for an AI that parses public documents for class action lawsuit potential . “The [US] may not have the highest per capita amount of lawsuits (that’s Germany), but it has the most of any country overall amid a very active legal industry whose caseload is growing in a market that is worth many tens of billions of dollars. Now, an AI-based startup that’s tapping into those facts for its own business is announcing a round of funding. Darrow — which has developed an AI-based data engine that ingests large amounts of publicly available documents to search for class action litigation potential across areas like data privacy violations and environmental contamination — has raised $35 million.”
Tag Archives: datasets
NREL: NREL Researchers Reveal How Buildings Across United States Do—and Could—Use Energy
NREL: NREL Researchers Reveal How Buildings Across United States Do—and Could—Use Energy . “Buildings are responsible for 40% of total energy use in the United States, including 75% of all electricity use and 35% of the nation’s carbon emissions….To facilitate decarbonization of the U.S. building stock, researchers at the U.S. Department of Energy’s National Renewable Energy Laboratory (NREL) have created a new, meticulously researched data set that details how buildings do—and could—use energy. This data set, called the End-Use Load Profiles, reveals the massive climate impacts that improvements to the U.S. building stock could have.”
Temple University: Temple researchers examine patterns of inequality in banned books
Temple University: Temple researchers examine patterns of inequality in banned books. “Since July 2021, more than 1,500 books of contemporary literature have been banned in the United States. Now a team of Temple researchers is looking for patterns across these books to understand what may be causing them to be targeted. The team is made up of Temple faculty, library staff, and undergraduate and graduate English students who are using text mining to understand patterns of representation in these books.”
FSIS Launches New Data Tool: Recall and Public Health Alert API (USDA Food Safety and Inspection Service)
US Department of Agriculture, Food Safety and Inspection Service: FSIS Launches New Data Tool: Recall and Public Health Alert API. “The U.S. Department of Agriculture’s Food Safety and Inspection Service (FSIS) launched a new feature on its website that enables software developers to access data on recalls and public health alerts through an application programming interface (API).”
ReCANVo: A database of real-world communicative and affective nonverbal vocalizations (Scientific Data)
Scientific Data: ReCANVo: A database of real-world communicative and affective nonverbal vocalizations . “Here, we present ReCANVo: Real-World Communicative and Affective Nonverbal Vocalizations – a novel dataset of non-speech vocalizations labeled by function from minimally speaking individuals. The ReCANVo database contains over 7000 vocalizations spanning communicative and affective functions from eight minimally speaking individuals, along with communication profiles for each participant.”
University of Tübingen: Database with 2,400 prehistoric sites
University of Tübingen: Database with 2,400 prehistoric sites. “Scientists from the research center ROCEEH (“The Role of Culture in Early Expansions of Humans”) have compiled information on 2,400 prehistoric sites and 24,000 assemblages from more than 100 ancient cultures. The digital data collection is available for free to scientists and amateurs and was recently published in the journal PLoS ONE.”
Scientific Reports: GlobalUsefulNativeTrees, a database documenting 14,014 tree species, supports synergies between biodiversity recovery and local livelihoods in landscape restoration
Scientific Reports: GlobalUsefulNativeTrees, a database documenting 14,014 tree species, supports synergies between biodiversity recovery and local livelihoods in landscape restoration. “Developed primarily by combining data from GlobalTreeSearch with the World Checklist of Useful Plant Species (WCUPS), GlobUNT includes 14,014 tree species that can be filtered for ten major use categories, across 242 countries and territories.”
Bureau of Transportation Statistics: BTS Updates Datasets to National Transportation Atlas Database 07/28/2023
Bureau of Transportation Statistics: BTS Updates Datasets to National Transportation Atlas Database 07/28/2023. “The U.S. Department of Transportation’s (USDOT) Bureau of Transportation Statistics (BTS) today released its summer 2023 update to the National Transportation Atlas Database (NTAD), a set of nationwide geographic databases of transportation facilities, networks, and associated infrastructure.”
University of California San Francisco: COVID Tracking Project Records and Resources Now Available
University of California San Francisco: COVID Tracking Project Records and Resources Now Available. “The UCSF Library Archives and Special Collections is pleased to announce that the COVID Tracking Project (CTP) records are available for research. The CTP is a crowdsourced digital archive that was managed by a group of journalists at The Atlantic and approximately 500 volunteers. This committed group gathered, cataloged, and published state-level COVID-19 data over the first fifteen months of the pandemic.”
TechCrunch: Meta, Microsoft and Amazon release open map dataset to rival Google Maps, Apple Maps
TechCrunch: Meta, Microsoft and Amazon release open map dataset to rival Google Maps, Apple Maps. “A group formed by Meta, Microsoft, Amazon and mapping company TomTom is releasing data that could enable developers to build their own maps to take on Google Map and Apple Maps. The group, called the Overture Maps Foundation, was formed last year. Today, the group has released it first open map dataset.”
Scientific Data: The European Tertiary Education Register, the reference dataset on European Higher Education Institutions
Scientific Data: The European Tertiary Education Register, the reference dataset on European Higher Education Institutions . “ETER provides data on nearly 3,500 HEIs in about 40 European countries, including descriptive information, geographical information, students and graduates (with various breakdowns), revenues and expenditures, personnel, and research activities; as of March 2023, data cover the years from 2011–2020.” I grabbed a copy of this yesterday and I’m making an international version of Super Edu Search. Stay tuned.
Canine Chronicle: Morris Animal Foundation’s Data Commons Offers Rich Database
Canine Chronicles: Morris Animal Foundation’s Data Commons Offers Rich Database. “Morris Animal Foundation’s Golden Retriever Lifetime Study was launched in 2012 to better understand the risk factors for cancer and other diseases in dogs. Now, access to big data – over 51 million data points – from the Study is available through the Foundation’s Data Commons, a comprehensive, free resource for researchers interested in receiving and using longitudinal data from the Golden Retriever Lifetime Study to advance veterinary research.”
MIT News: Researchers teach an AI to write better chart captions
MIT News: Researchers teach an AI to write better chart captions. “The MIT researchers found that machine-learning models trained for autocaptioning with their dataset consistently generated captions that were precise, semantically rich, and described data trends and complex patterns. Quantitative and qualitative analyses revealed that their models captioned charts more effectively than other autocaptioning systems.”
MIT News: MIT scientists build a system that can generate AI models for biology research
MIT News: MIT scientists build a system that can generate AI models for biology research. “BioAutoMATED, an open-source, automated machine-learning platform, aims to help democratize artificial intelligence for research labs.”
Coming soon: a new tool to grapple with Chinese economic data (South China Morning Post)
South China Morning Post: Coming soon: a new tool to grapple with Chinese economic data. “A Washington think tank outlined a new tool Wednesday to address a problem that has plagued economists for decades: how to make sense of data from China often suspected of being more politically driven than statistically based.”