Scraping Websites with Python [electronic resource] / Deza, Alfredo
- Author
- Deza, Alfredo
- Published
- Pragmatic AI Solutions, 2021.
- Edition
- 1st edition.
- Physical Description
- 1 online resource (1 video file, approximately 60 min.)
- Additional Creators
- Gift, Noah and Safari, an O'Reilly Media Company
Access Online
- Summary
- Sometimes scraping is the only way to extract meaningful data when there are no options like an accessible API. Parsing raw HTML can be intimidating and full of failures if you aren't used to existing tooling that can help you parse faster and more efficiently. In this video, learn all the basics including some advanced techniques to parse HTML and extract data with the Scrapy library in Python. k Topics include: * Install, configure, and create a new project with Scrapy, a powerful scraping library written in Python * See what is required to start parsing a website, including looking at raw HTML, tags, and CSS. * Identify data to create a dataset or datasets to perform data science analysis later * Capture parsed data and save it in different formats locally * Ultra fast scraping techniques by using the filesystem directly A few resources that are helpful if you are trying to do scraping, some of them covered in the course: * Scrapy Library * Scrapy Getting started tutorial.
- Subject(s)
- ISBN
- 50114VIDEOPAIML
- Digital File Characteristics
- video file
- Reproduction Note
- Electronic reproduction. Boston, MA : Safari. Available via World Wide Web., 2021.
- Technical Details
- Mode of access: World Wide Web.
- Copyright Note
- © Pragmatic AI Solutions 2021
- Issuing Body
- Made available through: Safari, an O'Reilly Media Company.
View MARC record | catkey: 37458327