Oooh, I got to wipe the drool off the desk. a new tool helps to find duplicate content in spreadsheets and databases. “…we are proud to be launching dedupe.io today. It’s a web interface for quickly and automatically finding similar rows in a spreadsheet or database, using machine learning methods. Powered by our open source dedupe library, dedupe.io is customized for your data by the training you give it. From there, it learns the best way to compare records to identify duplicates in your data. It’s built to be simple. Upload your data, provide some training examples, review some of the matches, and we take it from there.” It’s in private data.