Course Outline

 Module I: Fundamentals of Data Acquisition and HTTP Protocol

1. HTTP Communication Mechanisms
2. Structure of HTML Documents
3. Browser Developer Tools and Element Inspection
4. Configuring Request Headers

Module II: Data Extraction Using BeautifulSoup
1. Sending HTTP Requests in Python
2. Analyzing HTML Structure with BeautifulSoup
3. Locating Elements by Tags, Classes, and Attributes
4. Retrieving Text Content and Attribute Values

Module III: Communication with APIs
1. Structure and Rules of JSON Format
2. Overview of Popular Open APIs
3. Executing Queries and Processing Returned Data

Module IV: Browser Automation with Selenium
1. Handling Pages with Dynamic JavaScript Content
2. Identifying Elements by Tags and CSS Classes (CSS_SELECTOR, XPATH)
3. Navigating Web Pages (Clicks, Scrolling, Refreshing the Page)
4. Filling Out Form Fields and Logging into Services

Module V: Practical Applications of Data Acquisition
1. Creating Scripts for Mass Data Collection

 14 Hours

Number of participants


Price Per Participant (Exc. Tax)

Testimonials (5)

Provisional Courses

Related Categories