Skip to content

Commit

Permalink
Remove outdated TODO
Browse files Browse the repository at this point in the history
  • Loading branch information
danieldk committed Feb 7, 2025
1 parent df582a1 commit fc3ac80
Showing 1 changed file with 0 additions and 1 deletion.
1 change: 0 additions & 1 deletion server/text_generation_server/layers/fp8.py
Original file line number Diff line number Diff line change
Expand Up @@ -23,7 +23,6 @@
quantization = None

try:
# TODO: needs to be ported over to MoE and used on CUDA.
from moe_kernels.fp8_utils import w8a8_block_fp8_matmul, per_token_group_quant_fp8
except ImportError:
w8a8_block_fp8_matmul = None
Expand Down

0 comments on commit fc3ac80

Please sign in to comment.