Skip to main content

Questions tagged [computer-vision]

Questions related to image representation, segmentation, visual object categorization and image processing algorithms in general.

Filter by
Sorted by
Tagged with
0 votes
0 answers
40 views

I have a point cloud and the point cloud consists of room-like blocks where one wall is open of this rectangular box to make this as an open as entrance. The point cloud is extracted from the CAD ...
Encipher's user avatar
  • 185
0 votes
0 answers
27 views

I’m working on a video classification task with a long-tailed dataset where a few classes have many samples while most classes have very few. More specifically, my dataset has around 9k samples and 3....
Olivia's user avatar
  • 191
1 vote
0 answers
18 views

I'm developing a computer vision solution to count boxes in fractional palettes. Problem: inconsistent detections. I don't know if it's due to a lack of data, annotations, architecture, or ...
Lucas Anael's user avatar
1 vote
0 answers
40 views

I was curious if anyone happens to know why data augmentations (like color jitter, random cropping, etc) appear to not be always used when training autoencoder or neural-based compressors for images ...
thisIsAUsername's user avatar
0 votes
0 answers
51 views

I am currently working on the problem of creating a virtual library of toys. The preferred flow- user uploads a short video/series of photos and then, from the next session, the model can certainly ...
Soham Bhaumik's user avatar
0 votes
0 answers
51 views

I have run canny edge detector on an image that contains doors, beds, etc, in whose perimeters I am interested in. So in the Canny outputs I have, I can clearly see the edges of these objects clearly. ...
Sameer Kulkarni's user avatar
1 vote
0 answers
33 views

I am fairly new to machine learning and I've been tinkering with transformers for a short while now. I have written a transformer architecture that should in my opinion be able to understand why this ...
Soham Bhaumik's user avatar
1 vote
0 answers
37 views

Lets say that I want to train a network where the input is an image of a small part of an object. For eg: image of a building with corners and some part of exterior walls and some part of roof. I want ...
user146290's user avatar
0 votes
0 answers
35 views

I have a dataset of N images, k questions per image, Y/N answer for each question. I want to compare the accuracy of m VLMs (Vision Language Models) over this dataset: these are models, similar to the ...
DeltaIV's user avatar
  • 18.6k
1 vote
0 answers
104 views

I recently started working on a project that solely uses the semantic knowledge of image embedding that is encoded from a CLIP-based model (e.g., SigLIP) to reconstruct a semantically similar image. ...
Beothuk's user avatar
  • 11
0 votes
0 answers
61 views

We are conducting a study to compare the accuracy of two computer vision models: Model A: Trained on a non-augmented dataset of 11,200 real-world images. Model B: Trained on an augmented dataset ...
markcalendario's user avatar
0 votes
0 answers
65 views

I have a problem of data imbalance (1:10 ratio) for image classification tasks. To cope with it, I tried different imbalance training strategies, including weighted loss function, different loss ...
Yuju Ahn's user avatar
4 votes
1 answer
272 views

I am working on a predictive model for solar power production based on image sequences captured at 10-minute intervals. A single example my model receives as input consists of a sequence of images. My ...
schefflaa's user avatar
0 votes
0 answers
100 views

I'm looking for a captioning model that would be able to describe a group of images in a single sentence. Alternatively, I need a way to conceptually average a group of images before feeding that &...
Seedmanc's user avatar
  • 101
1 vote
0 answers
41 views

I have a general idea of Gaussian Mixture Models. My understanding: GMM is a way of clustering data points which, unlike K means clustering, soft assigns them under different distributions by ...
DeadAsDuck's user avatar

15 30 50 per page
1
2 3 4 5
32