Set TORCH_CUDA_ARCH_LIST for SageAttention build
Some checks failed
Build and Push Docker Image / build (push) Has been cancelled

Build runner has no GPU, so specify target architectures explicitly.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
Nick
2025-12-26 12:09:57 +13:00
parent ab73c2d9be
commit bdc1d769e8

View File

@@ -65,6 +65,8 @@ WORKDIR /tmp
ENV EXT_PARALLEL=4 ENV EXT_PARALLEL=4
ENV NVCC_APPEND_FLAGS="--threads 8" ENV NVCC_APPEND_FLAGS="--threads 8"
ENV MAX_JOBS=32 ENV MAX_JOBS=32
# Target RunPod GPU architectures: A100(8.0), RTX 30xx(8.6), RTX 40xx(8.9), H100(9.0)
ENV TORCH_CUDA_ARCH_LIST="8.0;8.6;8.9;9.0"
RUN git clone https://github.com/thu-ml/SageAttention.git && \ RUN git clone https://github.com/thu-ml/SageAttention.git && \
cd SageAttention && \ cd SageAttention && \
pip install --no-build-isolation . && \ pip install --no-build-isolation . && \