Skip to main content

Stack Exchange Network

Stack Exchange network consists of 183 Q&A communities including Stack Overflow, the largest, most trusted online community for developers to learn, share their knowledge, and build their careers.

Visit Stack Exchange

Loading…

current community
- Data Science
  
  help chat
- Data Science Meta
your communities

Sign up or log in to customize your list.

more stack exchange communities
company blog
Log in
Sign up

1. Home
2. Questions
3. Unanswered
4. AI Assist
5. Tags
7. Chat
8. Users
10. Companies
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Stack Internal
Bring the best of human thought and AI automation together at your work. Learn more

Stack Internal

Knowledge at work

Bring the best of human thought and AI automation together at your work.

Explore Stack Internal

Questions tagged [pipelines]

Ask Question

A pipeline is a sequence of functions (or the equivalent thereof), composed so that the output of one is input for the next, in order to create a compound transformation. Famously, a shell pipeline looks like "command | command2 | command3" (but use the tag "pipe" for this). It's also used in computer architecture to define a sequence of serial stages that execute in parallel over elements being fed into a pipe, in order to increase the overall throughput.

Learn more…
Top users
Synonyms

100 questions

Newest Active Bountied Unanswered

Bountied 0
Unanswered
Frequent
Score
Trending
Week
Month
Unanswered (my tags)

Filter by

No answers

No upvoted or accepted answers

Has bounty

Days old

Sorted by

Newest

Recent activity

Highest score

Most frequent

Bounty ending soon

Trending

Most activity

Tagged with

My watched tags

The following tags:

5 votes

1 answer

52 views

Is there a way to programatically schedule jobs on Airflow or Cron Daemon?

The question is more data engineering related than data science, but since there is no data engineering stack exchange, thought I will shoot it here. Basically, as the title says. So, as part of a ...

python
pipelines
data-engineering
api
etl

Della

485

asked Jul 31 at 12:05

1 vote

0 answers

37 views

Suggestions for constructing data exploration/analysis workflows

As part of a research project, I'm testing various statistical learning algorithms on various acoustics datasets. Instead of tediously typing scripts in python and Jupyter, I want to create a pipeline/...

visualization
pipelines
data-engineering

DangerousTim

11

asked Jul 11 at 16:26

2 votes

0 answers

34 views

Pipeline Orchestration Tool with a large Number of Nodes

We are currently looking for a pipeline orchestration tool to refactor a complex biodata pipeline. However, our since we are dealing with biodata, the orchestration tool would have to manage an ...

data
pipelines

LiKao

121

asked Mar 24 at 9:41

3 votes

0 answers

44 views

Where do the different pipelines start and end?

There is a wide variety of "pipelines" that exists in today's Data Science world: data ("lift & shift," curation, reconciliation?) inference modeling machine learning (as ...

pipelines
data-engineering

d8aninja

151

asked Dec 17, 2024 at 17:11

7 votes

1 answer

828 views

Nested-cross validation pipeline and confidence intervals

I'm hoping someone can help me think through this. I've come across a lot of different resources on nested-cv, but I think I'm confused as to how to go about model selection and the appropriate ...

cross-validation
pipelines
confidence
bootstraping

molecularrunner

73

asked Nov 26, 2024 at 18:00

3 votes

1 answer

532 views

Which software engineering design patterns are most commonly applicable in building pipelines and other DE/DS/ML workflows?

In software engineering, a design pattern is a general, reusable solution to a common problem in software design. It is not a finished piece of code but rather a template or best practice that can be ...

python
pipelines
data-engineering

Robert Long

5,855

asked Oct 27, 2024 at 12:02

0 votes

1 answer

37 views

inconsistent numbers of samples model.fit MultinomialNB

Hello guys I am practicing Naive Bayes but I got an error : ValueError: Found input variables with inconsistent numbers of samples: [1, 4179] Also, I saw some ...

naive-bayes-classifier
tfidf
pipelines

Marco Feregrino

99

asked Aug 11, 2024 at 22:25

1 vote

1 answer

819 views

Separating the features data from the target in X and y before or after a pipeline?

I have the following: train_set, test_set = train_test_split(arbres_df, test_size=0.2, random_state=42) Which is the old ...

machine-learning
training
pipelines

Dimitri

43

asked Aug 7, 2024 at 16:32

1 vote

0 answers

116 views

Integrating MLFlow and SageMaker for a More Robust ML Model Deployment Pipeline

I'm seeking advice on enhancing the deployment pipeline of a machine learning model that's accessed via a FastApi in production. My goal is to replace the existing setup with a more robust and ...

python
data-science-model
pipelines
sagemaker
mlops

Daniel Ben Zaken

11

asked Jan 14, 2024 at 13:04

0 votes

1 answer

79 views

Add tuning stage to DVC pipeline

I have an ML pipeline built with DVC that I use for experiment tracking. This allows running and tracking several experiments. Also, using hydra integration I can grid search hyper parameters. However,...

hyperparameter-tuning
pipelines

giulatona

1

asked Dec 14, 2023 at 19:12

0 votes

1 answer

158 views

What causes a Data Transformation Pipeline Error

I'm making a data transformation pipeline on a dataset, and I am getting an error: all the input array dimensions except for concatenation axis must match exactly, but along dimension 0, the array at ...

dataset
pipelines
transformation

Amy

1

asked May 23, 2023 at 7:13

1 vote

1 answer

919 views

What is the best\correct data split approach over time-series data to compare performance of forecasting future data among ML and DL regressors?

Let's say I have dataset contains a timestamp (non-standard timestamp column without datetime format) as a single feature and count as Label/target to predict ...

python
time-series
regression
forecasting
pipelines

Mario

610

asked Apr 24, 2023 at 21:08

0 votes

0 answers

61 views

Memory Error when loading a txt file for an ML model

I am trying to run the Python code below: ...

transformer
python-3.x
pipelines
memory

anon

asked Jan 11, 2023 at 16:02

2 votes

1 answer

935 views

Creating new features as linear combination of others as part of a scikit-learn pipeline?

I have a number of raw features that go into a scikit-learn model. I've already got a number of preprocessing steps (such as PolynomialFeatures) that creates additional features as part of my pipeline....

python
scikit-learn
feature-engineering
pipelines

gammapoint

181

asked Dec 20, 2022 at 19:55

0 votes

2 answers

72 views

Optimization of the entire model development process

I want to perform a global optimization of the entire model development pipeline. I have several stages of development, each of which can be performed automatically: preprocessing, removal of outliers/...

machine-learning
hyperparameter-tuning
pipelines

Andrew

406

asked Dec 11, 2022 at 19:25

15 30 50 per page

1

2 3 4 5

…

The Overflow Blog
AI is a crystal ball into your codebase
Tell us what you really, really… do not want to spend time working on
Featured on Meta
AI Assist is now available on Stack Overflow
Native Ads coming soon to Stack Overflow and Stack Exchange

Hot Network Questions

What is an ambidextrous word™?
What are the reasons not to install an oil catch can (OCC)?
Pascual Jordan's paper on the "Fermi-Dirac" statistics
Where should the bridges be built to minimize the length of the path between two towns?
Gradients in TikZ
Past event horizon of a Rindler observer and causal accessibility
What does a verbal recommendation mean for postdoc and faculty applications?
Why does my ceiling fan spark and require power cycling to work again?
B⠀⠀⠀E⠀⠀⠀G⠀⠀⠀I⠀⠀⠀N
How to Recover After Unprofessional Behavior
What are the chances a Guardian Stalker drops an ancient core?
Fuse Rating - AC or DC Specifications
Why is there need of "out," such as 'sold out"?
When did Christians first claim a connection between Jesus and the angel of the Lord?
Is it fine if I bake chicken thighs whole, then dice them once cooked, when making stir fry?
Just now in reported speech
Function in Python that encrypts and decrypts, taking string to list and the other way around
Do wooden cutting boards have antimicrobial properties?
Non-atmospheric heat transfer mechanisms on a tidally locked planet
Are there any obvious holes in my homebrew rule for identifying curses in magic items?
Why would STM32 have a diode in series on the NRST line if it's supposed to get signals from STLINK?
Black and White effect
Blazed diffraction gratings
Physical origins of two timescales in the overdamped oscillator

more hot questions

Newest pipelines questions feed

Subscribe to RSS

Newest pipelines questions feed

To subscribe to this RSS feed, copy and paste this URL into your RSS reader.

Data Science

Tour
Help
Chat
Contact
Feedback

Company

Stack Overflow
Stack Internal
Stack Data Licensing
Stack Ads
About
Press
Legal
Privacy Policy
Terms of Service
Cookie Policy

Stack Exchange Network

Technology
Culture & recreation
Life & arts
Science
Professional
Business
API
Data

Blog
Facebook
Twitter
LinkedIn
Instagram

Site design / logo © 2025 Stack Exchange Inc; user contributions licensed under CC BY-SA . rev 2025.12.10.37894