How to Download a Webpage in Python

The following python program demonstrates the use of urllib module to download a webpage to a local folder. Please note that this program downloads the webpage html content only, it doesn't download the linked images or other resources. The following program also demonstrates use of exception handling.

# python 3 only
import urllib.request
from urllib.error import URLError

target_file_path = "/Users/user/downloaded.html" # downloaded page saved here

try:
    response = urllib.request.urlopen('http://www.google.com/')
    html_content = response.read()

    with open(target_file_path,"wb") as fp:
        fp.write(html_content)

except URLError as e:
    print("Unable to download page: "+str(e.reason))

If you are looking for a way to scrap entire web pages including resources, you should look at the scrapy library. If you are more interested in the parsing of the web content, you should look at Beautiful Soup.

Posted in Python

Quick Programming Tips

How to Download a Webpage in Python

Programming Topics