Fix for FlashAttention RuntimeError & Triton Multi GPU fix.
#17
by
Satandon1999
- opened
Fix based on the discussion here: https://huggingface.co./microsoft/Phi-3-small-8k-instruct/discussions/11
Satandon1999
changed pull request title from
Update positional_embedding.py
to Fix for FlashAttention RuntimeError
Satandon1999
changed pull request title from
Fix for FlashAttention RuntimeError
to Fix for FlashAttention RuntimeError & Triton Multi GPU fix.
@damajercakms . Please review. Thanks.