Data Structures

Web Scraping20 min read

How Websites Work

Understand how websites send HTML to your browser so you know what exactly your scraper is downloading and reading.

David Miller

December 6, 2025

1.5k63

Before scraping, you must understand how a website works.

A website is not magic.
It is just:

a server
sending text (HTML)
to your browser.

Your scraper does the same.

What happens when you open a site

You enter a URL
Browser sends request
Server responds with HTML
Browser renders page

Your scraper will stop at step 3 and read HTML.

What is HTML

HTML is a text document with tags:

<h1>News</h1>
<p>This is a paragraph</p>
<a href="/jobs">Jobs</a>

Tags describe structure, not data meaning.

Key parts for scraping

Tags: h1, p, div, span, a
Attributes: class, id, href
Text inside tags

Static vs Dynamic websites

Static

HTML already contains data.
Easy to scrape.

Dynamic

HTML loads empty, data comes later via JavaScript.
Harder to scrape, needs browser automation.

Graph: static vs dynamic

flowchart TD
  A[Request Page] --> B{Type?}
  B -->|Static| C[HTML has data]
  B -->|Dynamic| D[JS loads data later]

Developer tools (your best friend)

In browser:

Right click → Inspect
See HTML structure
Find tags and classes

This is how you decide what to scrape.

Remember

Scraper reads HTML, not visuals
Learn to inspect elements
Static sites are easier
Dynamic sites need extra tools

#Python#Beginner#Web Basics

More on Web Scraping

Web Scraping Intro

HTTP Requests Basics

HTML Parsing with BeautifulSoup