Scientific Data: Building the European Social Innovation Database with Natural Language Processing and Machine Learning

Scientific Data: Building the European Social Innovation Database with Natural Language Processing and Machine Learning . “ESID is based on the idea of large-scale collection of unstructured web site text to classify and characterise social innovation projects from around the world. We use advanced machine learning techniques to extract features such as social innovation dimensions, project locations, summaries, and topics, among others. Our models perform as high as 0.90 F1. ESID currently includes 11,468 projects from 159 countries. ESID data is available freely and also presented in a web-based app.”

Leave a Reply

%d bloggers like this: