Virtual environments: How to avoid Python dependency hell with venv and pip freeze
Daniel Cooper
Posted on February 3, 2023
As a web dev in the 2005, dependency management was one of the more challenging aspects of my role at a national media company.
One example that comes to mind was the almost literal pain involved in deploying a DLL update to our sprawling .NET web application across a group of three Microsoft IIS servers.
The update process was simple but achingly manual: take each server in turn, manually drag and drop the new DLL into the server's Global Assembly Cache (GAC) folder, and then add a reference to the version number of the DLL in the server's machine.config.
Even when diligently following these steps, a catastrophic "Server Error in '/' Application" would sometimes occur, bringing the whole web application crashing down followed by panicked calls from our editorial teams.
If we were lucky, depending on how many servers had been patched at that point, the web application would stay up every other page refresh or so. Not an ideal customer experience, even for 2005.
The underlying cause was invariably due to an errant reference to an older version of the DLL in a web.config
hidden away in some subfolder within the web application. It was commonly referred to as DLL hell.
Fast forward to 2020 and I was reminded of these past woes when trying to get some old Python code up and running on a new machine.
On running the code, Python complained about umpteen missing packages from NumPy to Beautiful Soup. Dependency hell. Now while it's easy enough to install them manually using pip
, you need to know the exact name and ideally the version.
pip install beautifulsoup
ERROR: Could not find a version that satisfies the requirement beautifulsoup
ERROR: No matching distribution found for beautifulsoup
pip install beautifulsoup
Huh? Oh.. it's actually called beautifulsoup4
. Got it. Ahh, the latest version has broken my code. Sigh. Seem familiar?
Fortunately, there is a better way to handle this scenario, and it involves virtual environments, pip freeze
and a file called requirements.txt
.
Virtual environments. It's good practice to create and activate a virtual environment in your working folder as soon as you start a new project.
The venv
module provides a way to do this. Here's what the docs say:
The venv module provides support for creating lightweight "virtual environments" with their own site directories, optionally isolated from system site directories. Each virtual environment has its own Python binary (which matches the version of the binary that was used to create this environment) and can have its own independent set of installed Python packages in its site directories.
It's easy to create a virtual environment in Linux and macOS:
python -m venv venv
source venv/bin/activate
...although the activate command is platform-specific so check the docs for the Windows syntax.
Now, with a virtual environment in place for your project, the interpreter can see only the packages you've installed in that specific environment. You can see exactly which ones by running pip freeze
:
pip freeze
beautifulsoup4==4.10.0
certifi==2021.5.30
charset-normalizer==2.0.5
idna==3.2
requests==2.26.0
soupsieve==2.2.1
urllib3==1.26.6
From there, simply feed the output from pip freeze
into a file called requirements.txt
.
pip freeze > requirements.txt
This creates a definitive record of your project's required packages and version numbers - in the form of requirements.txt
- which you and others can use to install those dependencies in future within a fresh environment. Here's the install command:
pip install -r requirements.txt
Had I followed these steps way back at the start of this particular Python project that was causing me grief, I could have used the requirements.txt
to get up and running in no time.
Of course, the specific install command above can be useful in other places too. If you happen to build Docker images, you can use it within your Dockerfile
.
In summary... virtual environments, pip freeze
and requirements.txt
all provide a neat way to manage Python packages.
Posted on February 3, 2023
Join Our Newsletter. No Spam, Only the good stuff.
Sign up to receive the latest update from our blog.
Related
February 3, 2023
September 16, 2022