1. Home
2. Questions
3. Unanswered
4. AI Assist
5. Tags
7. Chat
8. Users
10. Companies
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Stack Internal
Bring the best of human thought and AI automation together at your work. Learn more

Unanswered Questions

Ask Question

283 questions with no upvoted or accepted answers

My Tags Newest Score No Answers

6 votes

0 answers

618 views

Adversarial Learning for Semantic Segmentation

I am incorporating Adversarial Training for Semantic Segmentation from Adversarial Learning for Semi-Supervised Semantic Segmentation. The idea is like this: The discriminator takes as input a ...

Pluviophile

4,323

modified Nov 17, 2019 at 15:40

3 votes

0 answers

44 views

Loss while fine tuning a transformer based pose estimation model not reducing

I am trying to fine-tune a transformer/encoder based pose estimation model available here at: https://huggingface.co/docs/transformers/en/model_doc/vitpose When passing "labels" attribute to ...

Soham Bhaumik

131

asked May 10 at 9:13

3 votes

1 answer

122 views

How to train next token prediction text generation model using Pytorch Transformer classes?

For learning purposes, I have tried to train a text generation model at a tiny scale in this notebook using RNN/LSTM model. But I am not able to take it further to use transformer model. Can anyone ...

CommunityBot

1

modified Sep 15 at 17:04

3 votes

1 answer

399 views

How is padding masking considered in the Attention Head of a Transformer?

For purely educational purposes, my goal is to implement basic Transformer architecture from scratch. So far I focused on the encoder for classification tasks and assumed that all samples in a batch ...

CommunityBot

1

modified Aug 8 at 17:03

3 votes

0 answers

263 views

Cluster tabular data with text in some columns

Let's say I have a following features in the my dataframe: user_id user_age is_student is_graduate salary resume integer integer binary binary integer text (up to 1000 symbols) And also a few more ...

Mike

31

asked Mar 14, 2022 at 21:06

3 votes

0 answers

312 views

Struggling to understand/implement Transformer Decoder

I'm struggling to understand the decoder in a Transformer model, specifically with regards to some aspects of its architecture as well as how it actually handles the data during training. What I have ...

cuuupid

131

modified Jun 5, 2021 at 0:13

3 votes

0 answers

893 views

What exactly negative/positive value of Captum's Integrated Gradient mean?

I use Captum's Integrated Gradient to interprete my PyTorch's neural network. I know that from github and original paper mentioned that ... Positive attribution score means that the input in that ...

3ORZ

31

asked Jan 8, 2021 at 10:12

3 votes

0 answers

1k views

PyTorch: Train without dataloader (loop trough dataframe instead)

I was wondering if it is bad practice to instead of using built in tools such as dataloader just loop trough each row in a pandas df. Lets say I am doing text classification and my training loop looks ...

Isbister

193

asked Oct 5, 2020 at 20:05

3 votes

1 answer

167 views

How to specify version for dependencies so that each one is compatible and stays within a size limit?

I am trying to deploy a web app to Heroku. The free tier is limited to 500 MB. I am using my resnet34 model as a .pkl file. I create model with it using the fastai ...

CommunityBot

1

modified Sep 30 at 19:07

3 votes

0 answers

160 views

AlexNet Research Paper VS PytTorch and Tensorflow implementation

I'm making my way through Deep Learning research papers, starting with AlexNet, and I found differences in the implementation of PyTorch and Tensorflow that I can't explain. In the research paper, ...

Begoodpy

233

modified Jul 9, 2020 at 17:00

3 votes

0 answers

760 views

Explain FastText model using SHAP values

I have trained fastText model and some fully connected network build on its embeddings. I figured out how to use Lime on it: complete example can be found in Natural Language Processing Is Fun Part 3: ...

desertnaut

2,168

modified Oct 22, 2020 at 22:12

3 votes

1 answer

376 views

Policy Gradient not "learning"

I'm attempting to implement the policy gradient taken from the "Hands-On Machine Learning" book by Geron, which can be found here. The notebook uses Tensorflow and I'm attempting to do it with PyTorch....

CommunityBot

1

modified Nov 14 at 3:09

3 votes

1 answer

288 views

Is it possible to solve Rubik's cube using DQN?

I'm trying to solve Rubik's cube using deep learning and I came across with DQN, so I decided to give it a try. I developed all the code and started training but I got this results: Loss goes up and ...

CommunityBot

1

modified Dec 4 at 17:03

3 votes

0 answers

871 views

Understanding depthwise convolution vs convolution with group parameters in pytorch

So in the mobilenet-v1 network, depthwise conv layers are used. And I understand that as follows. For a input feature map of (C_in, F_in, F_in), we take only 1 ...

lincr

91

modified Apr 7, 2020 at 6:12

3 votes

0 answers

912 views

How can I get testing accuracy using tensorboard for Detectron2?

I'm learning to use Detecron2. I've followed this link to create a custom object detector. My training code - ...

mefahimrahman

131

asked Mar 3, 2020 at 6:45

15 30 50 per page

1

2 3 4 5

…