Skip to main content

Stack Exchange Network

Stack Exchange network consists of 183 Q&A communities including Stack Overflow, the largest, most trusted online community for developers to learn, share their knowledge, and build their careers.

Visit Stack Exchange

Loading…

current community
- Data Science
  
  help chat
- Data Science Meta
your communities

Sign up or log in to customize your list.

more stack exchange communities
company blog
Log in
Sign up

1. Home
2. Questions
3. Unanswered
4. AI Assist
5. Tags
7. Chat
8. Users
10. Companies
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Stack Internal
Bring the best of human thought and AI automation together at your work. Learn more

Stack Internal

Knowledge at work

Bring the best of human thought and AI automation together at your work.

Explore Stack Internal

Questions tagged [gpu]

Ask Question

Graphics Processing Units (GPUs) within the context of Machine Learning often refer to the hardware requirements, design considerations, or level of parallelization for implementing and running various machine learning algorithms.

Learn more…
Top users
Synonyms

171 questions

Newest Active Bountied Unanswered

Bountied 0
Unanswered
Frequent
Score
Trending
Week
Month
Unanswered (my tags)

Filter by

No answers

No upvoted or accepted answers

Has bounty

Days old

Sorted by

Newest

Recent activity

Highest score

Most frequent

Bounty ending soon

Trending

Most activity

Tagged with

My watched tags

The following tags:

5 votes

1 answer

72 views

Windows hyper v full gpu passthrough

I'm trying to fully pass through my GPU to a hyper v VM. However, all guides and tutorials only partition it, resulting in the GPU not appearing as a GPU in the VM's task manger performance tab. My ...

gpu
windows

Magdalena

51

asked May 14 at 17:30

2 votes

1 answer

135 views

XGBoost GPU version not outperforming CPU on small dataset despite parameter tuning – suggestions needed

I'm currently working on a Parallel and Distributed Computing project where I'm comparing the performance of both XGBoost and CatBoost when trained on CPU vs GPU. The goal is to demonstrate how GPU ...

python
gpu
parallel
processing

Mxneeb

21

asked May 3 at 21:32

6 votes

0 answers

65 views

Poor availability on Google Cloud Platform

I am trying to setup a VM on GCP, but every time I try to create an instance in Compute Engine, there is an error message saying that the configuration that I asked is not currently available in the ...

gpu
google-cloud-platform

Erwan

27.1k

asked May 2 at 8:51

0 votes

0 answers

18 views

Confusion Matrix Not Synchronized Properly in DDP with PyTorch Lightning

I am working on a typical classification task using the MNIST dataset and training with PyTorch Lightning and DDP. I am encountering an issue where the row sums in the confusion matrix are not ...

image-classification
metric
gpu
distributed
pytorch-lightning

FrancisVan

1

asked Dec 16, 2024 at 15:21

1 vote

0 answers

79 views

How to solve the issue with getting free ports in Pytorch DDP?

I am facing issues with getting a free port in the DDP setup block of PyTorch for parallelizing my deep learning training job across multiple GPUs on a Linux HPC cluster. I am trying to submit a deep ...

python
deep-learning
pytorch
gpu
hpc

Shataneek Banerjee

11

asked Dec 8, 2024 at 11:20

1 vote

0 answers

107 views

How to efficiently run a large language model with a 60k+ token context window across multiple GPUs?

I'm working with a large language model (LLM) that requires a large context window of 60,000 to 70,000 tokens for my application. My setup includes five GPUs, with three 16GB GPUs and two 8GB GPUs. I'...

gpu
llm
parallel
memory

Bhalala Gaurav

11

asked Nov 11, 2024 at 13:57

1 vote

0 answers

140 views

Efficient Net V2 M ONNX model infers significantly slower on small input

When I convert an Efficient net v2 m model from Pytorch to Onnx on differently sized inputs, I notice a strange and unexplained behavior. I was hoping to find an explanation to my observations from ...

machine-learning
deep-learning
neural-network
convolutional-neural-network
gpu

Nitish Agarwal

61

asked Oct 19, 2024 at 18:21

2 votes

1 answer

2k views

Advice on deep learning PC build using dual 4090s

I’m an engineering grad student, and I’ve been tasked with finding parts for building a shared workstation for my lab. Our work includes deep learning, computer vision, network analysis, reinforcement ...

machine-learning
deep-learning
gpu
hardware

yuki

21

asked Jul 5, 2024 at 18:51

1 vote

0 answers

132 views

GPU requirements for training vs inference

How to estimate GPU requirements for model Inference vs model training/fine tuning. If it's differ, then in what ratio? just as a rule of thumb

deep-learning
gpu

Akhil Surapuram

111

asked Jun 21, 2024 at 1:19

0 votes

0 answers

69 views

Why can't I increase my GPU utilization?

I have a simple UNet model (~1M params) written in Keras 3.0.1, running with a torch backend. My CUDA version is ...

training
gpu
semantic-segmentation

Savindi

101

asked Apr 4, 2024 at 23:40

0 votes

1 answer

1k views

Transformers Trainer: "RuntimeError: module must have its parameters ... on device cuda:6 (device_ids[0]) but found one of them on device: cuda:0"

I ask this since I could not fix it with the help of: Stack Overflow RuntimeError: module must have its parameters and buffers on device cuda:1 (device_ids[0]) but found one of them on device: cuda:2 ...

pytorch
transformer
gpu
nvidia
cuda

questionto42

215

asked Jan 19, 2024 at 15:25

2 votes

1 answer

209 views

"model.to('cuda:6')" becomes (nvidia-smi) GPU 4, same with any other "cuda:MY_GPU", only "cuda:0" becomes GPU 0. How do I get rid of this mapping?

Strange mapping: example In the following example, the first column is chosen in the code, second column is the one that does the work instead: 0:0 1234 MiB 1:2 1234 MiB 2:7 1234 MiB 3:5 2341 MiB 4:1 ...

pytorch
gpu
nvidia
cuda

questionto42

215

asked Jan 19, 2024 at 13:17

1 vote

1 answer

425 views

Holding batch size constant, will a bigger dataset consume more GPU memory?

If you hold (mini) batch size constant (as well as everything else) but increase the number of examples (and therefore the number of training iterations), should you expect a (significant) increase in ...

neural-network
training
gpu
mini-batch-gradient-descent
memory

ubadub

111

asked Nov 22, 2023 at 17:57

0 votes

1 answer

258 views

How to run our python scripts utilizing our device's GPU?

My laptop has NVIDIA GeForce GTX1650 GPU. I want to utilize this GPU to run my Python script. Any help in the form of code would be really helpful. I mean tried researching this so much but I couldn't ...

python
computer-vision
gpu
opencv
yolov8

Escanor6

1

asked Nov 6, 2023 at 20:19

1 vote

0 answers

201 views

Using gpu accelerated libSVM in python

I have been using libSVM in python notebook to classify my dataset and it takes approximately 5 hours for one run and for 5 fold cross validation, it will take almost a day+ time. I am planning to ...

python
gpu
cuda
libsvm

khushi

111

asked Sep 18, 2023 at 7:49

15 30 50 per page

1

2 3 4 5

…

The Overflow Blog
AI is a crystal ball into your codebase
Tell us what you really, really… do not want to spend time working on
Featured on Meta
AI Assist is now available on Stack Overflow
Native Ads coming soon to Stack Overflow and Stack Exchange

Hot Network Questions

Estimating confidence interval for parameters in a mathematical model
Credit Card Payment Dates - impossible timeline
Finding real solutions for a complex rational expression with constraints in Mathematica
How to Recover After Unprofessional Behavior
How is "no trespassing" enforced in a collective property with 1200 houses?
Do wooden cutting boards have antimicrobial properties?
If I pull the Master Sword using the campfire glitch, do I still get the extra slot?
How does one declare gold bullion when leaving the USA at LAX airport?
Physical origins of two timescales in the overdamped oscillator
Why are there no telescopes attached to or flying beside the ISS?
SPI bus layout with 8 slaves
How to count the number of multi-valued attributes in an LDAP entry... or lines in a paragraph, where diff paragraphs have different numbers of lines?
Who is this named person?
Why were the 2025 mayoral elections in the UK put off?
Why would an Airbus A350-1000 park at a different angle?
Is the Catholic Church the largest landowner in India after the Government?
Individual vs. a corporation actions
Apart from possible copyright issues, is reusing text from one's past papers a breach of conduct if it is merely unoriginal, preliminary material?
Which Shuttle astronauts refused to fly with a Centaur after the Challenger disaster?
Where should the bridges be built to minimize the length of the path between two towns?
Why is Bond calling the American luxury sedan a "jukebox"?
Using geometric constructions to solve algebraic problems (in Euclid and Descartes)
Is it better to keep non-refrigerator ingredients in the refrigerator once they have been moderately cooled?
How long will the Apollo astronaut footprints remain visible on the lunar surface?

more hot questions

Newest gpu questions feed

Subscribe to RSS

Newest gpu questions feed

To subscribe to this RSS feed, copy and paste this URL into your RSS reader.

Data Science

Tour
Help
Chat
Contact
Feedback

Company

Stack Overflow
Stack Internal
Stack Data Licensing
Stack Ads
About
Press
Legal
Privacy Policy
Terms of Service
Cookie Policy

Stack Exchange Network

Technology
Culture & recreation
Life & arts
Science
Professional
Business
API
Data

Blog
Facebook
Twitter
LinkedIn
Instagram

Site design / logo © 2025 Stack Exchange Inc; user contributions licensed under CC BY-SA . rev 2025.12.10.37894