Lifehacker: Detect Plagiarism With the Help of This Friendly AI Bot. “The next literary hit to bear the same name as Jane Austen’s 200-year-old masterpiece Emma isn’t a book at all, but a thoroughly modern AI. This Emma — formally Emma Identity — the creation of computer science professor Aleksandr Marchenko, is the world’s first publicly available authorship identification web app. Give her at least 5,000 words of a written piece, and she’ll use more than 50 math parameters to figure out who wrote the lines.”
Loud Techie has a roundup of duplicate content checkers.
I’ll just file this under “unexpected uses for databases” – a guy named Saul Pwanson created a database of crossword puzzles, people started looking at it, and all of a sudden there’s a plagiarism scandal in crossword world. “A group of eagle-eyed puzzlers, using digital tools, has uncovered a pattern of copying in the professional crossword-puzzle world that has led to accusations of plagiarism and false identity. Since 1999, Timothy Parker, editor of one of the nation’s most widely syndicated crosswords, has edited more than 60 individual puzzles that copy elements from New York Times puzzles, often with pseudonyms for bylines, a new database has helped reveal. The puzzles in question repeated themes, answers, grids and clues from Times puzzles published years earlier. Hundreds more of the puzzles edited by Parker are nearly verbatim copies of previous puzzles that Parker also edited.”