Skip to main content

Stack Exchange Network

Stack Exchange network consists of 183 Q&A communities including Stack Overflow, the largest, most trusted online community for developers to learn, share their knowledge, and build their careers.

Visit Stack Exchange

Loading…

current community
- Cross Validated
  
  help chat
- Cross Validated Meta
your communities

Sign up or log in to customize your list.

more stack exchange communities
company blog
Log in
Sign up

1. Home
2. Questions
3. Unanswered
4. AI Assist
5. Tags
7. Chat
8. Users
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Stack Internal
Bring the best of human thought and AI automation together at your work. Learn more

Stack Internal

Knowledge at work

Bring the best of human thought and AI automation together at your work.

Explore Stack Internal

Questions tagged [loss-functions]

Ask Question

A function used to quantify the difference between observed data and predicted values according to a model. Minimization of loss functions is a way to estimate the parameters of the model.

Learn more…
Top users
Synonyms (2)

1,191 questions

Newest Active Bountied Unanswered

Bountied 0
Unanswered
Frequent
Score
Trending
Week
Month
Unanswered (my tags)

Filter by

No answers

No upvoted or accepted answers

Has bounty

Days old

Sorted by

Newest

Recent activity

Highest score

Most frequent

Bounty ending soon

Trending

Most activity

Tagged with

My watched tags

The following tags:

7 votes

1 answer

171 views

Why do “good” loss functions in ML need both Lipschitz continuity and smoothness?

I’m trying to understand the common assumptions in machine-learning optimization theory, where a “well-behaved” loss function is often required to be both L-Lipschitz and β-smooth (i.e., have β-...

machine-learning
optimization
loss-functions
gradient-descent

Antonios Sarikas

941

asked Nov 26 at 17:39

0 votes

0 answers

16 views

Plotting Training VS Testing Curve

I am using gradient boosting regressor from scikit-learn with squared error as the loss function. Then i want to plot the training set vs test set curve. Based on what i read, it is used to see the ...

regression
loss-functions
mse
deviance

Ocean

111

asked Nov 18 at 2:51

0 votes

0 answers

42 views

Multiplying probabilities of weights in Bayesian neural networks to formulate a prior

A key element in Bayesian neural networks is finding the probability of a set of weights, so that it can be applied to Bayes rule. I cannot think of many ways of doing this, for P(w) (also sometimes ...

bayesian
neural-networks
loss-functions
prior

user494234

21

asked Oct 5 at 15:02

2 votes

1 answer

125 views

A question about minimizing $l_2$ norm with regularization

PREMISES: this question likely arises from my very basic knowledge of the field. Please, be very detailed in the answer, even it can seem that some facts are trivial. Also, sorry for my poor english. ...

optimization
regularization
loss-functions
norm

2by2is2mod2

123

asked Sep 3 at 20:52

0 votes

0 answers

70 views

Why is my loss curve so steep at the beginning?

For different models with same batchsizes the start loss and loss after the steep part would be very similar, is that normal? With bigger batchsizes, axis gets scaled but graph still has the same ...

regression
loss-functions

Darius

1

asked Sep 2 at 15:55

4 votes

1 answer

316 views

A detail on how MSE loss works in PyTorch

Given two tensors $x$ and $y$ both of shape $(N,n)$ ($N$ being the number of samples and $n$ the number of dimensions of each sample), the MSE loss is (according to what I think): $$ \mathrm{MSE}(x,y)=...

loss-functions

xuanphong

93

asked Aug 29 at 9:56

0 votes

0 answers

55 views

MSE Loss: Which target representation allows better focus on minority class learning?

Given these two target representations for the same underlying data: Target A : Minority class samples (Cluster 5) isolated in distribution tail, majority class samples (Clusters 3+6) shifted toward ...

random-forest
lasso
cart
loss-functions
unbalanced-classes

n0rdp0l

1

asked Aug 5 at 18:26

5 votes

1 answer

172 views

Distribution based loss for regression with unbounded data

Currently I am dealing with time-series data conserning the power consumption of machines. Therefore, all target variables range from zero to infinity, technically ($y \in [0, \infty)$). The data ...

regression
forecasting
density-function
loss-functions

Blindschleiche

153

asked Jul 30 at 8:41

1 vote

0 answers

93 views

KL divergence and deep learning paradigm

My question is regarding the paradigm of deep learning, I do not get where does the cost functions come from? For example for a classification task are we treating the encoder as the expected value of ...

neural-networks
maximum-likelihood
loss-functions
kullback-leibler

Kavalali

373

asked Jul 21 at 22:15

4 votes

2 answers

511 views

Proper loss functions in machine learning

Many textbooks on the theory of machine learning state that statistical decision theory provides the basis for comparing ML algorithms. In statistical decision theory, decision rules are compared ...

machine-learning
loss-functions
decision-theory

dcoccjcz

141

asked Jul 21 at 10:52

0 votes

0 answers

80 views

What is a suitable loss function for predicting cos(φ) and sin(φ) of a circular data using a CNN?

I want to predict an angular parameter ($\phi$) from some signal using a CNN. Due to the architecture of my code, the regression is done on the two targets ($\cos\phi$, $\sin\phi$). I created a model ...

regression
convolutional-neural-network
loss-functions
circular-statistics

Neinstein

101

asked Jul 8 at 13:18

8 votes

5 answers

1k views

Have we been using the wrong objective function when training logistic regression?

The standard objective function when training a logistic regression model is: Minimize Negative Log Likelihood This form makes it easier to optimize, but it is mathematically equivalent to the more ...

logistic
loss-functions

Sam

91

asked Jul 7 at 20:42

4 votes

1 answer

141 views

Loss function that is minimized by an HPD interval with specific coverage?

This answer describes two loss functions for Bayesian credible intervals, each of which is minimized by a particular kind of interval. I am curious whether there exists a loss function on credible ...

bayesian
loss-functions
credible-interval

Adam L. Taylor

633

asked Jun 30 at 16:14

1 vote

0 answers

69 views

Order sensitivity of scoring rules

This is from another question here. The theorem below is from Lambert's paper about forecasting, (Elicitation and Evaluation of Statistical Forecasts): $\textbf{Proposition}\quad 1:$ Let $(\Theta = \{\...

loss-functions
bias
accuracy
scoring-rules
quadratic-form

Oliver Queen

111

asked May 16 at 15:07

1 vote

0 answers

29 views

Choice of estimator that minimizes expected loss [duplicate]

Let us say we have an i.i.d. sample of data from a random variable $X$. Suppose an agent must guess the value $x$ of $X$ that will be generated next. The guess is $\hat x$. They will make an error $e:=...

loss-functions
estimators
decision-theory

Richard Hardy

71.6k

asked May 8 at 12:11

15 30 50 per page

1

2 3 4 5

…

Featured on Meta
AI Assist is now available on Stack Overflow
Native Ads coming soon to Stack Overflow and Stack Exchange

Hot Network Questions

Isolated point of spectrum of a self-adjoint operator and spectral projectors
How to solve this riddle with a MIP?
OPAMP Different input Vdiff between two opamps in LTspice
Powdered milk in bread recipe-whole fat or non fat?
Do indoor plants significantly lower indoor CO2 levels?
Docker daemon incorrectly responds with API version error
Why could a perfectly accessible file fail to be opened due to EACCES?
How to play the chords passages to make it melodious (Chopin Op. 15 No. 3 from m. 89)
Chrome says "To get security updates you need at least macOS 10.15. Please upgrade your OS." on 10.13 (High Sierra). How can I disable that message?
How long will the Apollo astronaut footprints remain visible on the lunar surface?
What is this usage of させてください?
Cauchy addressing "metaphysical difficulties" of Calculus
Does a clearnet connection to my lightning node not require a TLS certicicate?
Science fiction book with a bizarre punishment/execution method where the prisoner is compressed into a very small gap by an approaching wall
How to Recover After Unprofessional Behavior
Rewiring gas boiler thermostat, from Hive SLR1c to Beok TGR85
What are the nine visible heavenly bodies according to Pythagoreans?
How to logically isolate/disconnect second NVMe drive from Ubuntu system for a dual boot system?
What is the potential harm of using AI to check calculation error/ doing routine but boring calculations?
Solving linear system with triangular Toeplitz matrix
is the title "Bugonia" a double entendre?
Who is the 'One' John the Baptist is referring to in Jn 1:23?
I'd like to know more about my Shan Shui
Can a 11th level Beast Master ranger still attack after using their action for Bestial Fury?

more hot questions

Newest loss-functions questions feed

Subscribe to RSS

Newest loss-functions questions feed

To subscribe to this RSS feed, copy and paste this URL into your RSS reader.

Cross Validated

Tour
Help
Chat
Contact
Feedback

Company

Stack Overflow
Stack Internal
Stack Data Licensing
Stack Ads
About
Press
Legal
Privacy Policy
Terms of Service
Cookie Policy

Stack Exchange Network

Technology
Culture & recreation
Life & arts
Science
Professional
Business
API
Data

Blog
Facebook
Twitter
LinkedIn
Instagram

Site design / logo © 2025 Stack Exchange Inc; user contributions licensed under CC BY-SA . rev 2025.12.8.37763