14.3. Plan 2: Get a soup from a URL¶
14.3.1. Plan 2: Example¶
The first step in web scraping is getting information from a webpage. To use the BeautifulSoup web scraping library, we have to put the webpage into something called a soup.
Here is the code for getting a soup from the Cottage Inn location page.
Goal: Get a soup from one webpage# Load libraries for web scraping from bs4 import BeautifulSoup import requests # Get a soup from a URL url = 'https://cottageinn.com/pick-a-location/' r = requests.get(url) soup = BeautifulSoup(r.content, 'html.parser')
14.3.2. Plan 2: When to use this plan¶
Use this plan when you want to scrape one webpage.
14.3.3. Plan 2: How to use this plan¶
Replace the URL with the URL of the website you want to scrape.
A URL is a web address, like you see in your web browser.
It should be complete (starting with http:// or https://).
In this plan, a URL should be surrounded by quotes (' '
).
![Copying a URL from the Cottage Inn location page](../_images/cottageinn_URL.gif)
14.3.4. Plan 2: Exercises¶
# Load libraries for web scraping from bs4 import BeautifulSoup import requests # Get a soup from a URL url = 'https://cottageinn.com/pick-a-location/' r = requests.get(url) soup = BeautifulSoup(r.content, 'html.parser'):
csp-10-2-2: Fill in the plan in order to get a soup from the University of Michigan wikipedia page.
# Load libraries for web scraping
from bs4 import BeautifulSoup
import requests
# Get a soup from a URL
url =
r = requests.get(url)
soup = BeautifulSoup(r.content, 'html.parser')