Skip to main content

Questions tagged [model-evaluations]

This tag is meant to be used for questions related to how to evaluate a model performance, not only based on standard metrics, but also in the context of real use case applications. What is a good model might depend on many factors to take into account, to eventually get really useful data science applications.

Filter by
Sorted by
Tagged with
5 votes
2 answers
65 views

I was working on a dataset which is available on kaggle. At first, I split my data with a train-test ratio of 90:10. Then I fit 24 different models (6 different regressors with 4 different ...
ArshakParsa's user avatar
2 votes
0 answers
29 views

I’m working with a large language model that has been configured to behave conservatively in high-stakes contexts: it refuses unsafe or ambiguous user requests, prioritizes client welfare over ...
Rex H's user avatar
  • 21
0 votes
0 answers
18 views

I'm working on a binary classification problem to identify struggling students, my dataset contains 10 features and 200 samples, I implement Nested CV, the distribution of the target variable is 58%/...
Youness Belhaj's user avatar
4 votes
1 answer
44 views

What I’m trying to figure out I'm working on a machine learning project and would love to hear your thoughts on two things: A. How to prioritize feature exploration B. Whether to fix hyperparameters (...
Ten's user avatar
  • 41
2 votes
1 answer
70 views

I am working on a use case where I need to get the right answer and send it to the user. I have been struggling for a time to find a reliable metric to use that tells me when an answer is correct. The ...
Espoir Murhabazi's user avatar
6 votes
2 answers
217 views

I'm working on a classification task using PyTorch and Optuna. I originally split my dataset into three parts: training, validation, and test. I fit a MinMaxScaler only on the training set and applied ...
Antonio Rossi's user avatar
4 votes
3 answers
110 views

I read two articles by the same guy where he uses the whole dataset for hyperparameter optimisation using with CV and then evaluates the model with the best hyperparameters using leave one out on the ...
Lisana Daniel's user avatar
2 votes
0 answers
71 views

Suppose I have a dataset where, for each observation, we observe the loan's interest rate and whether the customer defaulted (i.e., failed to repay the loan). The interest rate is determined by a ...
Aaron's user avatar
  • 231
2 votes
0 answers
34 views

I am working on evaluating an explainability method for a text classification model that predicts whether a given text sequence contains hate speech or not. The method outputs token-level importance ...
Marc's user avatar
  • 21
2 votes
0 answers
143 views

I have a logistic regression model, the output of which is used to make decisions. I am testing an improved version of this model. In testing, it has substantially improved logloss vs old model. When ...
user179361's user avatar
3 votes
1 answer
48 views

I would like to get help with evaluation of my classification model. It is a typical model that for each input produces vector of floats that represents probabilities of labels and I classify the ...
Keeehi's user avatar
  • 31
0 votes
0 answers
44 views

I apologize in advance for the noob question, this is the first ML project that I have attempted although I have some stats background. I am in the data exploration phase for a project, where I am ...
therinoa's user avatar
0 votes
0 answers
34 views

I am trying to predict weather using QSVM. The dataset I am using can be seen here : Dataset: https://www.kaggle.com/datasets/muthuj7/weather-dataset I am using ZZfeatyremap and Linear Quantum Kernel. ...
ahmad javaid's user avatar
5 votes
3 answers
140 views

I am learning machine learning by myself. I am applying logistic regression to Weather Forecast dataset from Kaggle Weather_data. The goal is to predict Rain according to the given features and the ...
noreli's user avatar
  • 51
1 vote
0 answers
62 views

I have trained my model for the first time and inference it on random images. When I tried random image that has similar camera position with my dataset, it fits well at detecting river. But when it’s ...
Dean Debrio's user avatar

15 30 50 per page
1
2 3 4 5
25