Watch: From Raw Video to Deep Learning in 68 seconds

How do you go from raw video, to training machine learning models

Traditionally, video data in machine learning is treated as separate images, sampled from the video (sometimes at very low frame rates, one or two images per second), and then processed as images. This can seem counter-intuitive, since you’d want as much training data as possible. For example, a minute of video can have 1800 meaningful images with a wealth of data, why would you use only 60 images and discard of 96.6% of your potential training data? Why would you ignore the temporal data in the video?

Read below to learn about a straightforward treatment of video to get training data, using all the frames in the video. The training data is immediately available and can be directly used in your machine learning code, eg through a Jupyter notebook. You can also watch a webinar we recently hosted demonstrating the same approach. Continue reading “Watch: From Raw Video to Deep Learning in 68 seconds”

Why annotating data directly on videos is way superior to annotating images

If you’re still building classifiers based on annotated isolated images instead of annotated videos, you’re losing valuable information, getting lower quality annotations, spending more money, and walking away from much more training data. Read on to learn why and how you can transition to scalable, fast, and cost-effective video annotation.

Continue reading “Why annotating data directly on videos is way superior to annotating images”

How to get crowd annotations with expert’s quality

The Legend of The Data Scientist’s Fairy tale goes like this:

Once upon a time, in a village not far from here, lived a young girl who enjoyed looking at data. One day, she got a hold on a unicorn-sized trove of magical data, and after a quick look found inside it a huge diamond. The END!

A data scientist and her treasure trove of data. Source: http://wallpaperswide.com/girl_and_chest_treasure-wallpapers.html

Continue reading “How to get crowd annotations with expert’s quality”

“Machine Learning is 99% Manual Labor” — and what I’ve been up to

In the past year, I’ve been working on building a solution to a problem I frequently encountered while working as a data scientist. To train the machine learning models I was building, I had to use clean, labeled data. I often ended up manually reviewing and labeling data myself — a process that was tedious, lengthy, boring, error-prone, and probably not the best use of my time. As a fellow data scientist eloquently put it, “Machine Learning is 99% Manual Labor.” Continue reading ““Machine Learning is 99% Manual Labor” — and what I’ve been up to”