Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
pytorch
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Tracing torch.cuda.empty_cache() on an RTX 4090 - Where Do the 53 MB Go?
Ingero Team
Ingero Team
Ingero Team
Follow
May 28
Tracing torch.cuda.empty_cache() on an RTX 4090 - Where Do the 53 MB Go?
#
gpu
#
cuda
#
pytorch
#
debugging
Comments
Add Comment
5 min read
QAT vs PTQ on our edge vision model: 6 months of A/B data
Marco Rinaldi
Marco Rinaldi
Marco Rinaldi
Follow
May 28
QAT vs PTQ on our edge vision model: 6 months of A/B data
#
machinelearning
#
computervision
#
mlops
#
pytorch
Comments
Add Comment
4 min read
Structured channel pruning got our detector under 12ms on a Jetson
Marco Rinaldi
Marco Rinaldi
Marco Rinaldi
Follow
May 29
Structured channel pruning got our detector under 12ms on a Jetson
#
computervision
#
pytorch
#
machinelearning
#
mlops
Comments
Add Comment
4 min read
Serving 40 LoRA adapters on one base model: the throughput we got
Marcus Chen
Marcus Chen
Marcus Chen
Follow
May 29
Serving 40 LoRA adapters on one base model: the throughput we got
#
machinelearning
#
llm
#
pytorch
#
mlops
Comments
Add Comment
4 min read
torch.compile recompiled our SDXL UNet 38 times in production
Elise Moreau
Elise Moreau
Elise Moreau
Follow
May 29
torch.compile recompiled our SDXL UNet 38 times in production
#
pytorch
#
machinelearning
#
computervision
#
mlops
Comments
Add Comment
4 min read
LLM-as-judge variance broke our DPO training signal for 3 weeks
Marcus Chen
Marcus Chen
Marcus Chen
Follow
May 27
LLM-as-judge variance broke our DPO training signal for 3 weeks
#
machinelearning
#
mlops
#
llm
#
pytorch
Comments
Add Comment
4 min read
The bf16 grad accumulator that killed our SDXL LoRA training
Elise Moreau
Elise Moreau
Elise Moreau
Follow
May 27
The bf16 grad accumulator that killed our SDXL LoRA training
#
machinelearning
#
pytorch
#
mlops
#
computervision
Comments
Add Comment
4 min read
I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail
Aditya Mehra
Aditya Mehra
Aditya Mehra
Follow
May 26
I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail
#
pytorch
#
python
#
machinelearning
#
opensource
Comments
Add Comment
2 min read
Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)
Alan West
Alan West
Alan West
Follow
May 24
Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)
#
pytorch
#
performance
#
machinelearning
#
gpu
Comments
Add Comment
5 min read
VLM-scored calibration sets for INT8 quantisation, routed through Bifrost
Marco Rinaldi
Marco Rinaldi
Marco Rinaldi
Follow
May 28
VLM-scored calibration sets for INT8 quantisation, routed through Bifrost
#
machinelearning
#
mlops
#
computervision
#
pytorch
Comments
Add Comment
4 min read
Prefix caching in vLLM under multi-tenant agent traffic
Marcus Chen
Marcus Chen
Marcus Chen
Follow
May 26
Prefix caching in vLLM under multi-tenant agent traffic
#
llm
#
mlops
#
infrastructure
#
pytorch
Comments
1
 comment
4 min read
Why your diffusion model is slow at batch size 1 (and what actually helps)
Elise Moreau
Elise Moreau
Elise Moreau
Follow
May 19
Why your diffusion model is slow at batch size 1 (and what actually helps)
#
machinelearning
#
pytorch
#
computervision
#
mlops
Comments
Add Comment
4 min read
Your PyTorch Model File Can Execute Arbitrary Code — Here's How I Built a Scanner to Detect It
Pooja Kiran
Pooja Kiran
Pooja Kiran
Follow
May 19
Your PyTorch Model File Can Execute Arbitrary Code — Here's How I Built a Scanner to Detect It
#
security
#
python
#
machinelearning
#
pytorch
Comments
Add Comment
3 min read
Distilling SAM 2 into a 6MB student for industrial inspection
Marco Rinaldi
Marco Rinaldi
Marco Rinaldi
Follow
May 27
Distilling SAM 2 into a 6MB student for industrial inspection
#
computervision
#
machinelearning
#
pytorch
#
mlops
Comments
Add Comment
4 min read
My high-res image-to-video kept OOMing — turns out I was decoding outside no_grad
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 26
My high-res image-to-video kept OOMing — turns out I was decoding outside no_grad
#
pytorch
#
ai
#
machinelearning
#
python
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account