MakeUseOf: What Is Web Scraping? How to Collect Data From Websites. “Think of a type of data and you can probably collect it by scraping the web. Real estate listings, sports data, email addresses of businesses in your area, and even the lyrics from your favorite artist can all be sought out and saved by writing a small script.” This article has a couple of good examples, but it’s mostly an overview (this is not meant as a criticism; it’s an incredibly broad topic that nobody could cover in one article!)
Technotification: Best Python Tools For Machine Learning And Data Science. “… when it comes to Data Science, Python has packages that are rooted specifically for data science job. SciPy, NumPy, and pandas facilitate data analysis and can be easily integrated with web apps. It can include statistical code in production database if needed. And at last, it is an open-source programming language having a lot of resources and high-quality documentation and active community of developers willing to provide advice and assistance through all stages of the development process.”
TechXplore: Researchers compile a new database of executable Python code snippets on GitHub. “A team of researchers at North Carolina State University has recently carried out an empirical analysis of the executable status of Python code snippets shared on GitHub. Their study, pre-published on arXiv, also presents Gistable, a new database of executable Python code snippets on GitHub’s gist system, which could enable reproducible studies in the field of software engineering.”
PR Newswire: Students Unveil New Tool to Help The World Learn to Code (PRESS RELEASE). “The tool, known as Code Shrew, is the brainchild of Ludwik Trammer and Jamie Nunez. It uses a popular programming language called Python combined with drawing and animation capabilities to teach coding in a way that feels like doodling. Written code is reflected immediately on the screen, allowing users to make mental connections between what they type and how it changes their drawings.” The tool appears to be free. I played with it some and liked it enough that I made an appointment on my calendar to go through the lessons.
PC World: The best online courses for learning Python. “If you’re looking to learn coding or want to pick up another programming language, Python is a good choice. One of the terrific things about Python is how closely it resembles the English language, so you’ll often see words like ‘not,’ ‘in,’ and ‘or’ in its scripts. Because of its readability, Python is commonly the first programming language schools teach. It’s a great launchpad for an aspiring coder.”
Motherboard: Motherboard Made a Tool That Archives Websites on Demand. “Archiving services, such as the Wayback Machine, may be a staple of online journalism, but they sometimes have a problem. While, say, Archive.is might preserve one particular webpage, perhaps the Wayback Machine can’t, depending on what sort of restrictions the website developer has put in place. For example, someone stopped copies of MSNBC host Joy Reid’s blog, which hosted a stream of homophobic comments, from displaying in the Wayback Machine. With that in mind, I made a quick tool that can push a single webpage or URL to multiple archiving sites at once, and fire back the newly minted digital copies in response. Hopefully it will help reporters and researchers more efficiently figure out which service will work best for that particular site.”
UK Web Archive Blog: A New Playback Tool for the UK Web Archive. “We are delighted to announce that the UK Web Archive will be working with Rhizome to build a version of pywb (Python Wayback) that we hope will greatly improve the quality of playback for access to our archived content.”