feat: Added cuda version selection to uv build. by BlueCrescent · Pull Request #433 · Modalities/modalities

BlueCrescent · 2026-02-18T18:56:08Z

What does this PR do?

Support different CUDA versions in uv installation.

General Changes

Added different optional dependency version for pytorch which use different index urls when used via uv.
Updated Readme.
Also added B200 to MFU calculator.

Checklist before submitting final PR

My PR is minimal and addresses one issue in isolation
I have merged the latest version of the target branch into this feature branch
I have reviewed my own code w.r.t. correct implementation, missing type hints, proper documentation, etc.
I have run a sample config for model training
I have checked that all tests run through (python tests/tests.py)
I have updated the internal changelog (CHANGELOG_DEV.md)

Copilot

Pull request overview

This pull request adds support for different CUDA versions in uv installation by moving PyTorch from a required dependency to an optional dependency with multiple CUDA-specific variants (cpu, cu126, cu128, cu130). The PR configures uv to use different PyTorch index URLs based on the selected CUDA variant.

Changes:

Moved torch from required dependencies to optional dependencies with CUDA version variants
Added uv-specific configuration for CUDA version conflicts and PyTorch index sources
Updated README installation instructions to reflect the new CUDA version selection

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 5 comments.

File	Description
pyproject.toml	Removed torch from dependencies, added CUDA-specific torch extras (cpu, cu126, cu128, cu130), and configured uv for handling PyTorch installation from different index URLs
README.md	Updated installation commands to include CUDA version selection syntax for uv

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-18T19:00:50Z

pyproject.toml

+cpu = ["torch>=2.10,<2.11.0"]
+cu126 = ["torch>=2.10,<2.11.0"]
+cu128 = ["torch>=2.10,<2.11.0"]
+cu130 = ["torch>=2.10,<2.11.0"]


The torch dependency has been moved from the main dependencies to optional dependencies, which means torch is no longer installed by default. Users who install the package without specifying one of the CUDA extras (cpu, cu126, cu128, cu130) will not get PyTorch installed. This is a breaking change that should be documented in the CHANGELOG_DEV.md (which the PR description indicates as incomplete).

Consider whether this is the intended behavior. If users must explicitly choose a CUDA version, this should be clearly communicated in the documentation and changelog.

Copilot · 2026-02-18T19:00:50Z

README.md

 curl -LsSf https://astral.sh/uv/install.sh | sh

-uv sync
+uv sync --extra [cpu|cu126|cu128|cu130]  # Get CUDA version via nvidia-smi


The README shows installation commands using bracket notation [cpu|cu126|cu128|cu130], but this is not valid shell syntax. Users cannot literally type this command and expect it to work. The command should be clarified to show that users need to choose ONE option, for example:

uv sync --extra cpu

uv sync --extra cu126

uv sync --extra cu128

uv sync --extra cu130

Consider using a different notation or providing explicit examples rather than shell pipe syntax inside brackets.

Copilot · 2026-02-18T19:00:50Z

README.md


 # For developers: use [tests,linting] and install pre-commit hooks
-uv sync --extra tests --extra linting
+uv sync --extra [cpu|cu126|cu128|cu130] --extra tests --extra linting


The README shows installation commands using bracket notation [cpu|cu126|cu128|cu130], but this is not valid shell syntax. Users cannot literally type this command and expect it to work. The command should be clarified to show that users need to choose ONE option, for example:

uv sync --extra cpu --extra tests --extra linting

uv sync --extra cu126 --extra tests --extra linting

Consider using a different notation or providing explicit examples rather than shell pipe syntax inside brackets.

Copilot · 2026-02-18T19:00:51Z

pyproject.toml

+cpu = ["torch>=2.10,<2.11.0"]
+cu126 = ["torch>=2.10,<2.11.0"]
+cu128 = ["torch>=2.10,<2.11.0"]
+cu130 = ["torch>=2.10,<2.11.0"]


The flash-attn dependency (line 27) requires torch to be installed, but torch has been moved to optional dependencies. This could cause installation failures for users who don't specify one of the CUDA extras, or for users installing via pip rather than uv.

Consider either:

Moving flash-attn to the optional dependencies as well, with each CUDA extra including it

Adding a note in the installation documentation that one of the CUDA extras must be specified

Ensuring the package can't be installed without specifying a CUDA extra

Copilot · 2026-02-18T19:00:51Z

README.md


 # Install PyTorch, psutil, Ninja and Flash Attention
-pip install "torch<2.11.0"
+pip install "torch<2.11.0"  # Or appropriate version for your CUDA setup.


The comment suggests using "appropriate version for your CUDA setup" but the version constraint "torch<2.11.0" doesn't specify how to select the CUDA version. With the new changes, users should be directed to use the appropriate PyTorch index URL for their CUDA version when installing manually, or they should install with extras like in Option 1. Consider updating this comment to align with the new CUDA version selection approach.

Suggested change

pip install "torch<2.11.0" # Or appropriate version for your CUDA setup.

# For PyTorch, select the correct index URL for your CUDA/CPU setup from https://pytorch.org/get-started/locally/

pip install --index-url https://download.pytorch.org/whl/cu121 "torch<2.11.0"

…optional dependencies

…ting PreTrainedModel.

feat: Added cuda version selection to uv build.

7db8c74

BlueCrescent requested a review from Copilot February 18, 2026 18:56

Copilot started reviewing on behalf of BlueCrescent February 18, 2026 18:56 View session

Copilot AI reviewed Feb 18, 2026

View reviewed changes

BlueCrescent added 4 commits February 19, 2026 11:11

refactor: moved flash attention from dependencies into cuda specific …

af49ae3

…optional dependencies

fix: Added torchvision to dependencies due to import error when impor…

60aa4f4

…ting PreTrainedModel.

fix: set torchvision dependency to also use the correct index

68123ac

feat(utils): Added mfu support for B200.

11c40c3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: Added cuda version selection to uv build.#433

feat: Added cuda version selection to uv build.#433
BlueCrescent wants to merge 5 commits intomainfrom
uv_support_for_different_cuda_versions

BlueCrescent commented Feb 18, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 18, 2026

Uh oh!

Copilot AI Feb 18, 2026

Uh oh!

Copilot AI Feb 18, 2026

Uh oh!

Copilot AI Feb 18, 2026

Uh oh!

Copilot AI Feb 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	pip install "torch<2.11.0" # Or appropriate version for your CUDA setup.
	# For PyTorch, select the correct index URL for your CUDA/CPU setup from https://pytorch.org/get-started/locally/
	pip install --index-url https://download.pytorch.org/whl/cu121 "torch<2.11.0"

Comments

Conversation

BlueCrescent commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

General Changes

Checklist before submitting final PR

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

BlueCrescent commented Feb 18, 2026 •

edited

Loading