-
Notifications
You must be signed in to change notification settings - Fork 730
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Port batched_dense_vec_jagged_2d_mul and jagged_1d_to_truncated_values to tritonbench
cla signed
fb-exported
meta-exported
#5603
opened Apr 9, 2026 by
q10
Loading…
Use CUB_WRAPPED_NAMESPACE instead of legacy CUB_NS_PREFIX
cla signed
#5601
opened Apr 9, 2026 by
cyyever
Loading…
Remove dead CUDA < 11 workarounds and simplify bf16/CUB guards
cla signed
#5600
opened Apr 9, 2026 by
cyyever
Loading…
simplify ALIGNAS, remove useless attributes and stale CUDA workaround
cla signed
#5599
opened Apr 9, 2026 by
cyyever
Loading…
Replace rocm-smi with amd-smi across ROCm build, CI, and docs
cla signed
module: rocm
#5597
opened Apr 8, 2026 by
adam360x
Loading…
3 tasks done
bf16 scale/bias for INT4
cla signed
fb-exported
meta-exported
#5595
opened Apr 8, 2026 by
jeetkanjani7
Loading…
Fix int32 truncation in tbe_input_combine offset accumulation
cla signed
#5594
opened Apr 8, 2026 by
cyyever
Loading…
Enable more clang-tidy checks on C++20 (#5575)
cla signed
fb-exported
meta-exported
module: rocm
#5588
opened Apr 7, 2026 by
q10
Loading…
Add gflag to select feature names for SSD KV embedding table
cla signed
fb-exported
meta-exported
#5585
opened Apr 7, 2026 by
jnwan
Loading…
Split RowWiseSparseAdagradFused.cc.stripped.o from fbcode//admarket/adfinder:adfinder
cla signed
fb-exported
meta-exported
#5578
opened Apr 6, 2026 by
meta-codesync
bot
Loading…
Fix TBE v2 forward kernel for embedding dim > 1024 (#5326)
cla signed
#5569
opened Apr 2, 2026 by
cyyever
Loading…
Port expand_into_jagged_permute benchmark to tritonbench
cla signed
fb-exported
meta-exported
#5566
opened Apr 1, 2026 by
q10
Loading…
Fix bash scripts to fail correctly for ROCm jobs (#5564)
ciflow/rocm-mi300
cla signed
fb-exported
meta-exported
module: rocm
#5564
opened Mar 31, 2026 by
q10
Loading…
Add AMD/ROCm support for SSD TBE inference
cla signed
fb-exported
meta-exported
module: rocm
#5561
opened Mar 31, 2026 by
goldcoderZ
Loading…
Add TurboSSDInferenceModule for HSTU serving integration
cla signed
fb-exported
meta-exported
#5560
opened Mar 31, 2026 by
goldcoderZ
Loading…
Add AMD/ROCm support for SSD TBE inference (#5559)
cla signed
fb-exported
meta-exported
module: rocm
#5559
opened Mar 31, 2026 by
goldcoderZ
Loading…
Add TurboSSDInferenceModule for HSTU serving integration (#5558)
cla signed
fb-exported
meta-exported
#5558
opened Mar 31, 2026 by
goldcoderZ
Loading…
2D weights support for permute_1D_data_kernel_vec
cla signed
fb-exported
meta-exported
#5557
opened Mar 31, 2026 by
kausv
Loading…
Add streaming_update() and load_snapshot() for inference (#5554)
cla signed
fb-exported
meta-exported
#5554
opened Mar 31, 2026 by
goldcoderZ
Loading…
Add failure logging and alerting for SSD offloading (#5542)
cla signed
fb-exported
meta-exported
#5542
opened Mar 26, 2026 by
Frederick-Zhu
Loading…
Fix DramKV race: hold rlock during inplace update writes
cla signed
#5536
opened Mar 26, 2026 by
cyyever
Loading…
Add tests for group_index_select_dim0 mixed-dtype validation
cla signed
fb-exported
meta-exported
#5507
opened Mar 21, 2026 by
q10
Loading…
Fix Half2 UVM performance regression with vectorized store
cla signed
fb-exported
meta-exported
module: rocm
#5499
opened Mar 19, 2026 by
q10
Loading…
Use atomicAdd for lxu_cache_locking_counter increments/decrements
cla signed
fb-exported
meta-exported
#5479
opened Mar 16, 2026 by
goldcoderZ
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-03-09.