Questions tagged [computer-vision]

The goal of computer vision is an automatic extraction of information out of images. It includes methods for acquiring, processing, analyzing, and understanding images and high-dimensional data from the real world in order to produce numerical or symbolic information, e.g., in the forms of decisions. The image data can take many forms, such as video sequences, views from multiple cameras, or multi-dimensional data from a medical scanner.

You might also consider the tags

for questions with the Computer vision tag.

See also

247 questions
8
votes
2 answers

Floor Segmentation to Determine Navigable Paths

In my application, my robot has the following physical setup: Differential drive mechanics with feedback (wheel encoders) Commercially available webcam mounted with a known transform to the base of the robot (RGB, no depth) The robot will be…
mjcarroll
  • 1,761
  • 1
  • 11
  • 14
8
votes
1 answer

Public training data for vehicle detectors in computer vision?

This question is to anyone familiar with object (specifically vehicle) detection research. I'm new to computer vision and am confused about training object detection classifiers. Specifically, the objective is vehicle detection. I've been reading…
user961627
  • 181
  • 2
6
votes
2 answers

computer vision using the FREAK local features descriptor - why overlapping fields?

I am currently studying the FREAK descriptor and I read the article published by its designers. It states that the aim was to mimic the retinal topology, and one of the advantages that could be gained is the fact that retinal receptive fields…
S.E.K.
  • 191
  • 7
5
votes
1 answer

How can I recognize animals in a video stream or static images with openCV or other library/software?

I'm a software developer not experienced in AI or machine learning, but I'm now interested in developing this kind of software. I want to develop software that recognizes some specific objects, specifically, animals from a video stream (or a…
K. Weber
  • 161
  • 1
  • 6
4
votes
2 answers

Is there a way to combine and sync two 2K cameras @ 90fps with ICs

I am searching for a way to minimize the size of a stereo vision module and cannot find any ICs that will combine and sync two MIPI CSI-2 (4 lane) data streams without an FPGA and too much code. there was one online (MAX7366A 3D Video…
Will
  • 41
  • 2
3
votes
1 answer

How is the Homography formula derived?

In the Wikipedia article about Homography, the projective transformation between picture A to picture B is defined as: $H_{ab} = R - \frac{t n^T}{d}$ However, the formula lacks the derivation. How is it derived?
3
votes
3 answers

How can I receive Genicam packets from a device in a docker container?

I installed a command line tool which accepts genicam packets from a device as input into a docker image. If I use the option --network host in the run command for the container the packets are received. If I use -p 3956:3956/udp to expose the…
thinwybk
  • 131
  • 5
3
votes
1 answer

Why is a computer vision system similar to the human vision

Does computer vision intentionally mimic the vision of a human or is it just coincidental that a good computer vision system (with convolutional neural networks for example) reassembles some properties of the human vision apparatus?
Christian
  • 454
  • 1
  • 4
  • 6
2
votes
0 answers

choice for camera sensor to be used with LiDAR

I am doing research on autonomous car and looking for a sensor to be used along with LiDAR laser scanner. Ladybug could be a very good option but the cost!! too expensive. Could you please suggest me options for camera sensors with good FOV and…
CHIANG
  • 21
  • 1
2
votes
1 answer

"Ambiguous up to scale" , Explanation required

I am reading "Computer Vision: Models, Learning, and Inference" in which author writes at several points (like on page 428-429) that although matrix A seems to have 'n' degree of freedom but since it is ambiguous up to scale so it only has 'n-1'…
Ameer Hamza
  • 123
  • 5
2
votes
2 answers

Image processing in bright lights

We are working on a project which requires us to detect and hit a ball. We are trying to accomplish the task by detecting the position of ball by processing the input from a camera. The problem is that we are required to do this in very bright…
lakshayg
  • 121
  • 3
2
votes
2 answers

image size vs image resolution

I read somewhere that in the case of photoshop for example, the size refers to the number of pixels an image contains, but resolution involves the pixel's size, I don't know whether this definition goes for all the other fields. In computer vision,…
S.E.K.
  • 191
  • 7
2
votes
1 answer

Machine Vision vs Computer Vision?

I'm trying to understand the core differences between the two topics. Is one simply a newer term? Connotations of automobile vs automation? Something with a screen vs without? I've only ever heard the term computer vision (tagged).
tarabyte
  • 123
  • 5
1
vote
1 answer

Compound vision system or Megapixel camera reduction

Are any commercially available compound vision sensors available? Not a simple 8 sensor system using photo-diodes but a genuine sensor that can provide a >32x32 compound matrix. Would some form of reduction in the granularity of a megapixel camera…
Neomada
  • 61
  • 1
  • 1
  • 8
1
vote
2 answers

What are the purpose of pre-trained weights for YOLO object detector?

Dear Research Community, I am new to object detection research and currently working on a project which will detect people using YOLO from a top-down camera image. While looking through tutorials for YOLO, I see all tutorials asks us to download…
1
2