Holy sheet: How to pull contact details from a website using only Google Sheets (The Next Web)

The Next Web: Holy sheet: How to pull contact details from a website using only Google Sheets. “Are you in a job where you regularly need to collect contact information from companies? And do you find it an annoying and unnecessarily time-consuming task to sort through ever-changing web designs, to find that one button or bit of information you’re actually looking for? Don’t look any further… help is on the way!” Brilliant use of IMPORTXML / XPath.

Hongkiat: 5 Best Web Scraping Tools to Extract Online Data

Hongkiat: 5 Best Web Scraping Tools to Extract Online Data. “These software look for new data manually or automatically, fetching the new or updated data and storing them for your easy access. For example, one may collect info about products and their prices from Amazon using a scraping tool. In this post, we’re listing the use cases of web scraping tools and the top 10 web scraping tools to collect information, with zero codings.”

Smashing Magazine: The Guide To Ethical Scraping Of Dynamic Websites With Node.js And Puppeteer

Smashing Magazine: The Guide To Ethical Scraping Of Dynamic Websites With Node.js And Puppeteer. “For a lot of web scraping tasks, an HTTP client is enough to extract a page’s data. However, when it comes to dynamic websites, a headless browser sometimes becomes indispensable. In this tutorial, we will build a web scraper that can scrape dynamic websites based on Node.js and Puppeteer.”

The Guardian: How technology unlocked the secretive power of ‘Queen’s consent’

The Guardian: How technology unlocked the secretive power of ‘Queen’s consent’. “Have you ever right-clicked on a webpage and pressed the ‘View Page Source’ button? You’ll see the HTML building blocks: the mark-up incantations used to build the page on your screen. The HTML focuses on presentation: what colour that text should be, how big that image should be, and so on. Web scraping is the art of transforming this semi-structured soup back into the structured data that produced it – in this case, who was speaking in which chamber at what time, and what did they say.”

ZDNet: Facebook link preview feature used as a proxy in website-scraping scheme

ZDNet: Facebook link preview feature used as a proxy in website-scraping scheme. “The technique consisted of using Facebook developer accounts to place calls to Facebook or Facebook Messenger API servers, requesting a link preview for pages a group wanted to scrape. Facebook would fetch the data, assemble it in a link preview, and return it to the data scrappers as an API response, ready to be ingested into the scrapper’s database.” Pretty sure they mean scrapers, but I’m not going to argue with ZDNet.

The Verge: Facebook wants the NYU Ad Observer to quit collecting data about its ad targeting

The Verge: Facebook wants the NYU Ad Observer to quit collecting data about its ad targeting. “Facebook wants a New York University research project to stop collecting data about the social platform’s political ad-targeting, The Wall Street Journal reported. The Ad Observatory, a project of NYU’s engineering school with more than 6,000 volunteers, uses its AdObserver browser extension to scrape data from political ads shown on Facebook. But Facebook says the program is violates its terms of service, which bar scraping.” NYU has published a brief response.

Washington Post: Chinese firm harvests social media posts, data of prominent Americans and military

Washington Post: Chinese firm harvests social media posts, data of prominent Americans and military. “Biographies and service records of aircraft carrier captains and up-and-coming officers in the U.S. Navy. Real-time tweets originating from overseas U.S. military installations. Profiles and family maps of foreign leaders, including their relatives and children. Records of social media chatter among China watchers in Washington. Those digital crumbs, along with millions of other scraps of social media and online data, have been systematically collected since 2017 by a small Chinese company called Shenzhen Zhenhua Data Technology for the stated purpose of providing intelligence to Chinese military, government and commercial clients, according to a copy of the database that was left unsecured on the Internet and retrieved by an Australian cybersecurity consultancy.”

Techdirt: Clearview Hires Prominent First Amendment Lawyer To Argue For Its Right To Sell Scraped Data To Cops

Techdirt: Clearview Hires Prominent First Amendment Lawyer To Argue For Its Right To Sell Scraped Data To Cops. “Clearview — the facial recognition company selling law enforcement agencies (and others) access to billions of photos and personal info scraped from the web — is facing lawsuits over its business model, which appears to violate some states’ data privacy laws. It’s also been hit with cease-and-desist requests from a number of companies whose data has been scraped…. Now, the company appears to be going on the offensive.”

Technical .ly: Volunteer data scrapers helped Philadelphia Lawyers for Social Equity preserve client court records

Technical .ly: Volunteer data scrapers helped Philadelphia Lawyers for Social Equity preserve client court records. “As the first state to implement the Clean Slate Law in 2018, Pennsylvania committed to sealing millions of criminal records. The law was enacted to remove educational and vocational disadvantages for people with eligible records, including those associated with certain misdemeanors and people found not guilty in court. While the law cleared barriers to housing, education and employment for individuals across the state, it indirectly created new technological barriers for Philadelphia Lawyers for Social Equity (PLSE).”

CNET: Facebook sues developer over alleged data scraping abuse

CNET: Facebook sues developer over alleged data scraping abuse. “The social network announced on Thursday that it was filing a lawsuit against Mohammad Zaghar and his website, Massroot8, claiming that the service was grabbing Facebook users’ data without permission. The lawsuit filed in the northern district of California alleged that Zaghar’s website offered its customers the ability to scrape data from their Facebook friends — including their phone numbers, gender, date of birth and email addresses.”