top | item 42188555

Using uv with PyTorch

170 points| charliermarsh | 1 year ago |docs.astral.sh

54 comments

order

kolja005|1 year ago

I recently watched a talk by the author of uv that was surprisingly fascinating [1]. He goes into a few of the more notable hacks that they had to come up with to make it as fast as it is. The most interesting thing for me was that package resolution in python given constraints defined (eg. in requirements.txt) maps to a boolean satisfiability problem which is NP-complete. So uv uses a custom SAT solver to do this. I totally under-appreciated how much goes into this software and I'm bummed I have to use Poetry at work after having watched this talk.

[1] https://www.youtube.com/watch?v=gSKTfG1GXYQ

edit: NP-complete not NP-hard

wenc|1 year ago

I haven’t used Conda since 2021 but recall it had a SAT solver that was very slow especially on degenerate cases.

How does uv’s sat solver compare?

thundergolfer|1 year ago

Kind of an aside as this doc is about the complexities of installing particular PyTorch versions, but will say that uv is way faster at installing PyTorch than pip.

We run internal benchmarks of our custom container image builder and in the 'install torch' benchmark the p50 time saved when using `uv` is 25 seconds! (71.4s vs. 43.74s)

---

Aside 2: Seems there's a missing "involves" in this sentence: "As such, installing PyTorch typically often configuring a project to use the PyTorch index."

cik|1 year ago

Joining your aside to tout the benefits of uv. We use uv combined with a simple proxy I wrote, to cache python dependencies, and then install them in parallel. UV also makes it simple to regenerate a requirements file and know who requires the dependencies, which in turn makes it easy to manage the ecosystem, analyze packages, and determine if we can reduce our footprint.

Between that latter feature, the proxy, the parallelization we've reduced build times across ~100 engineers by a solid 10 minutes. There are other things we do as well, but uv is a must have nowadays.

gdiamos|1 year ago

uv significantly speeds up my pytorch in docker builds

  # Setup virtual env
  ENV VIRTUAL_ENV=/app/.venv
  ENV PATH="$VIRTUAL_ENV/bin:$PATH"
  RUN python3 -m venv $VIRTUAL_ENV
  RUN . $VIRTUAL_ENV/bin/activate

  # install using uv
  RUN pip install uv
  RUN uv pip install torch==${TORCH_VERSION} --index-url https://download.pytorch.org/whl/cpu
The index-url makes it really convenient.

ttyprintk|1 year ago

Apparently, uv respects a key in project.toml:

    [[tool.uv.index]]
    name = "pytorch-cu124"
    url = "https://download.pytorch.org/whl/cu124"
    explicit = true

samtheprogram|1 year ago

Speeds up installation, or speeds up PyTorch in general?

pcwelder|1 year ago

I just want to thank people behind uv. The tool is just amazing for development, packaging and running packages. And it's blazing fast!

minimaxir|1 year ago

So uv caused a bit of an issue with me installing PyTorch over the weekend.

When installed with brew on my MacBook, uv currently has PyTorch 3.13 as a dependency, which is fine. But PyTorch does not currently have a stable wheel that's compatable with Python 3.13! This resulted in very confusing errors. (Solution was to point to the Nightly index)

That's technically PyTorch's fault, but it's indicitave why a specific page on installing PyTorch is necessary, and it's good to know the documentation specifically calls it out.

the_mitsuhiko|1 year ago

Fun fact: that's called out in the doc you're commenting :)

0cf8612b2e1e|1 year ago

I have run into multiple package problems with 3.13 with a non obvious root cause error message. Thankfully, uv makes it trivial to switch out to 3.12

breuleux|1 year ago

I was trying to figure out how to set up a pyproject with uv that could support cuda, rocm and other device types this morning, and next thing I knew, there was a new release adding pretty much exactly what I needed.

The pace of development on uv is really impressive.

mrbonner|1 year ago

Does uv support global Python install now? I need something like Mise for this.

shlomo_z|1 year ago

uv python install 3.xx

chocolight|1 year ago

I've read that Torch was dropping their Conda support, but won't everybody just move to Mamba which is a drop-in replacement of Conda?

Conda (and Mamba) allows to avoid duplicating packages on the disk between environments (not just the downloaded archives, but the resulting expanded files too).

How does uv compare in this regard?

alex_suzuki|1 year ago

In a nutshell, what do I gain from switching to uv from my current workflow, which is: 1) create a venv (`python3.xx -m venv venv`) 2) install packages from a requirements.txt into that venv?

One limitation I know of are the inability to detect stale packages.

Apart from „blazing fast“, which I‘m not convinced it really matters to me as I rarely touch the dependencies, what are the main reasons why uv is gaining traction?

reubenmorais|1 year ago

You get correct version resolution (checking compatibility across the entire tree of deps-of-deps, including against different Python versions) and a lock file which represents a global state of the entire tree and gives you reproducibility of a working setup.

infamia|1 year ago

You might gain a couple of things...

No need to activate venvs and the almost inevitable Python pathing "murder mysteries". uv installs in venvs first, and only then someplace else (e.g., globally).

No more clunky typing `python -m pip install foo` even when you have activated your venv (or you think you have). `uv pip install foo` is nicer and easier to remember.

uv add will add new dependencies to your pyproject.toml so you don't have to.

uv can setup skeletons for new projects in a nice, modern way

For older projects, you can have uv to resolve dependencies as of a certain date. I imagine this is great for older projects, especially with numerous dependencies.

It might remove the need for pyenv or the need to rely on your system provided Python, since uv can install Python for your project.

Cross-platform lock files

I've just started looking in to uv, so maybe my list isn't complete/very good. Some down sides include, it's still green (has some bugs naturally and lacks some features) and some might not trust/like that it's VC backed.

alex_suzuki|1 year ago

PS: one thing I like about my current workflow is no extra tools needed, base python install is all that‘s required.

imjonse|1 year ago

One reason is that it is not just for project venv/deps management but can replace other tools like pipx and pyenv for most scenarios.

antman|1 year ago

Does anyone succeed in packaging a uv pytorch env to a pyinstaller exe or similar? I am having a hard time but I assume it can be automated

Mxbonn|1 year ago

Now that PyTorch is also ending their anaconda package distribution, I think a lot of ml/ds people should give uv a shot.

nickysielicki|1 year ago

I had no idea this was happening. This comment just made my week. Thank God.

paradite|1 year ago

I was just getting used to pipenv and pyenv combo.

Is this worth switching to?

silvester23|1 year ago

I would also say absolutely. We've been using pipenv for ~6 years and have managed to build a pretty good workflow around it. But uv is just _so much faster_. So we've started moving everything over to uv and I don't think we'll ever look back.

Migrating is not super hard, we wrote a small script that moves all the information from a Pipfile to a pyproject.toml and it works like a charm.

jpalomaki|1 year ago

I've been using a combination of pyenv, venv and Poetry in the past.

Now I have switched to uv with new projects. No problems so far. I definitely recommend giving it a go.

baggiponte|1 year ago

Absolutely. No second thoughts.

cbenz|1 year ago

Absolutely. It changed my Python developer life \o/

fernandotakai|1 year ago

yes, 100%. it's not only faster, but it makes sense.