Course Outline

Module I: Fundamentals of Data Collection and HTTP Protocol

 

1. HTTP Communication Mechanisms

2. Structure of HTML Documents

3. Browser Developer Tools and Element Inspection

5. Configuring Request Headers

 

Module II: Data Extraction Using BeautifulSoup

1. Sending HTTP Requests in Python

2. Analyzing HTML Structure with BeautifulSoup

3. Locating Elements by Tags, Classes, and Attributes

4. Extracting Text Content and Attribute Values

 

Module III: Communicating with APIs

1. Structure and Rules of JSON Format

2. Overview of Popular Open APIs

3. Making Requests and Processing Returned Data

 

Module IV: Browser Automation with Selenium

1. Handling Pages with Dynamic JavaScript Content

2. Identifying Elements by Tags and CSS Classes (CSS_SELECTOR, XPATH)

3. Navigating Web Pages (Clicks, Scrolling, Refreshing the Page)

4. Filling Out Form Fields and Logging into Services

 

Module V: Practical Applications of Data Collection

1. Creating Scripts for Mass Data Collection

 14 Hours

Number of participants


Price Per Participant (Exc. Tax)

Testimonials (5)

Provisional Courses

Related Categories