Mastering OpenCV with Practical Computer Vision Projects

Ebook658 pages5 hours

Mastering OpenCV with Practical Computer Vision Projects

Name: Mastering OpenCV with Practical Computer Vision Projects
Author: Shervin Emami
ISBN: 9781849517836

By Shervin Emami, Khvedchenia Levgen, Naureen Mahmood and

Rating: 0 out of 5 stars

()

Read preview

About this ebook

In Detail

Computer Vision is fast becoming an important technology and is used in Mars robots, national security systems, automated factories, driver-less cars, and medical image analysis to new forms of human-computer interaction. OpenCV is the most common library for computer vision, providing hundreds of complex and fast algorithms. But it has a steep learning curve and limited in-depth tutorials.

Mastering OpenCV with Practical Computer Vision Projects is the perfect book for developers with just basic OpenCV skills who want to try practical computer vision projects, as well as the seasoned OpenCV experts who want to add more Computer Vision topics to their skill set or gain more experience with OpenCV's new C++ interface before migrating from the C API to the C++ API.

Each chapter is a separate project including the necessary background knowledge, so try them all one-by-one or jump straight to the projects you're most interested in.

Create working prototypes from this book including real-time mobile apps, Augmented Reality, 3D shape from video, or track faces eyes, fluid wall using Kinect, number plate recognition and so on.

Mastering OpenCV with Practical Computer Vision Projects gives you rapid training in nine computer vision areas with useful projects.

Approach

Each chapter in the book is an individual project and each project is constructed with step-by-step instructions, clearly explained code, and includes the necessary screenshots.

Who this book is for

You should have basic OpenCV and C/C++ programming experience before reading this book, as it is aimed at Computer Science graduates, researchers, and computer vision experts widening their expertise.

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateDec 3, 2012

ISBN9781849517836

Author

Shervin Emami

Shervin Emami (born in Iran) taught himself electronics and hobby robotics during his early teens in Australia. While building his first robot at the age of 15, he learned how RAM and CPUs work. He was so amazed by the concept that he soon designed and built a whole Z80 motherboard to control his robot, and wrote all the software purely in binary machine code using two push buttons for 0s and 1s. After learning that computers can be programmed in much easier ways such as assembly language and even high-level compilers, Shervin became hooked to computer programming and has been programming desktops, robots, and smartphones nearly every day since then. During his late teens he created Draw3D (http://draw3d.shervinemami.info/), a 3D modeler with 30,000 lines of optimized C and assembly code that rendered 3D graphics faster than all the commercial alternatives of the time; but he lost interest in graphics programming when 3D hardware acceleration became available. In University, Shervin took a subject on computer vision and became highly interested in it; so for his first thesis in 2003 he created a real-time face detection program based on Eigenfaces, using OpenCV (beta 3) for camera input. For his master's thesis in 2005 he created a visual navigation system for several mobile robots using OpenCV (v0.96). From 2008, he worked as a freelance Computer Vision Developer in Abu Dhabi and Philippines, using OpenCV for a large number of short-term commercial projects that included: Detecting faces using Haar or Eigenfaces, Recognizing faces using Neural Networks, EHMM, or Eigenfaces, Detecting the 3D position and orientation of a face from a single photo using AAM and POSIT, Rotating a face in 3D using only a single photo, Face preprocessing and artificial lighting using any 3D direction from a single photo, Gender recognition, Facial expression recognition, Skin detection, Iris detection, Pupil detection, Eye-gaze tracking, Visual-saliency tracking, Histogram matching, Body-size detection, Shirt and bikini detection, Money recognition, Video stabilization, Face recognition on iPhone, Food recognition on iPhone, Marker-based augmented reality on iPhone (the second-fastest iPhone augmented reality app at the time). OpenCV was putting food on the table for Shervin's family, so he began giving back to OpenCV through regular advice on the forums and by posting free OpenCV tutorials on his website (http://www.shervinemami.info/openCV.html). In 2011, he contacted the owners of other free OpenCV websites to write this book. He also began working on computer vision optimization for mobile devices at NVIDIA, working closely with the official OpenCV developers to produce an optimized version of OpenCV for Android. In 2012, he also joined the Khronos OpenVL committee for standardizing the hardware acceleration of computer vision for mobile devices, on which OpenCV will be based in the future.

Related authors

Skip carousel

Related to Mastering OpenCV with Practical Computer Vision Projects

Related ebooks

Skip carousel

OpenCV with Python By Example
Ebook
OpenCV with Python By Example
byPrateek Joshi
Rating: 5 out of 5 stars
5/5
OpenCV: Computer Vision Projects with Python
Ebook
OpenCV: Computer Vision Projects with Python
byJoseph Howse
Rating: 0 out of 5 stars
0 ratings
Learning OpenCV 4 Computer Vision with Python 3 - Third Edition: Get to grips with tools, techniques, and algorithms for computer vision and machine learning, 3rd Edition
Ebook
Learning OpenCV 4 Computer Vision with Python 3 - Third Edition: Get to grips with tools, techniques, and algorithms for computer vision and machine learning, 3rd Edition
byJoseph Howse
Rating: 0 out of 5 stars
0 ratings
OpenCV By Example
Ebook
OpenCV By Example
byPrateek Joshi
Rating: 0 out of 5 stars
0 ratings
Learning OpenCV 3 Computer Vision with Python - Second Edition
Ebook
Learning OpenCV 3 Computer Vision with Python - Second Edition
byJoseph Howse
Rating: 0 out of 5 stars
0 ratings
Kivy Blueprints
Ebook
Kivy Blueprints
byMark Vasilkov
Rating: 0 out of 5 stars
0 ratings
Advanced Machine Learning with Python
Ebook
Advanced Machine Learning with Python
byJohn Hearty
Rating: 0 out of 5 stars
0 ratings
Python: Real World Machine Learning
Ebook
Python: Real World Machine Learning
byJohn Hearty
Rating: 0 out of 5 stars
0 ratings
Beginning with Deep Learning Using TensorFlow: A Beginners Guide to TensorFlow and Keras for Practicing Deep Learning Principles and Applications
Ebook
Beginning with Deep Learning Using TensorFlow: A Beginners Guide to TensorFlow and Keras for Practicing Deep Learning Principles and Applications
byMohan Kumar Silaparasetty
Rating: 0 out of 5 stars
0 ratings
OpenCV for Secret Agents
Ebook
OpenCV for Secret Agents
byJoseph Howse
Rating: 0 out of 5 stars
0 ratings
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
Ebook
Advanced Deep Learning with Python: Design and implement advanced next-generation AI solutions using TensorFlow and PyTorch
byIvan Vasilev
Rating: 0 out of 5 stars
0 ratings
Android Application Programming with OpenCV
Ebook
Android Application Programming with OpenCV
byJoseph Howse
Rating: 3 out of 5 stars
3/5
Python Deep Learning
Ebook
Python Deep Learning
byValentino Zocca
Rating: 5 out of 5 stars
5/5
Unity AI Programming Essentials
Ebook
Unity AI Programming Essentials
byCurtis Bennett
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Vision Systems
Ebook
Deep Learning for Vision Systems
byMohamed Elgendy
Rating: 5 out of 5 stars
5/5
Unreal Engine 4 AI Programming Essentials
Ebook
Unreal Engine 4 AI Programming Essentials
byNewton Peter L.
Rating: 0 out of 5 stars
0 ratings
Computer Vision for the Web
Ebook
Computer Vision for the Web
byAkhmadeev Foat
Rating: 0 out of 5 stars
0 ratings
Unity Game Development Scripting
Ebook
Unity Game Development Scripting
byKyle D'Aoust
Rating: 0 out of 5 stars
0 ratings
Getting Started with Unity 5
Ebook
Getting Started with Unity 5
byDr. Edward Lavieri
Rating: 5 out of 5 stars
5/5
OpenCL Programming by Example
Ebook
OpenCL Programming by Example
byRavishekhar Banger
Rating: 0 out of 5 stars
0 ratings
OpenGL Game Development By Example
Ebook
OpenGL Game Development By Example
byMadsen Robert
Rating: 0 out of 5 stars
0 ratings
Mastering Oculus Rift Development
Ebook
Mastering Oculus Rift Development
byJack Donovan
Rating: 0 out of 5 stars
0 ratings
Unreal Engine 4 Virtual Reality Projects: Build immersive, real-world VR applications using UE4, C++, and Unreal Blueprints
Ebook
Unreal Engine 4 Virtual Reality Projects: Build immersive, real-world VR applications using UE4, C++, and Unreal Blueprints
byKevin Mack
Rating: 0 out of 5 stars
0 ratings
Machine Learning with TensorFlow, Second Edition
Ebook
Machine Learning with TensorFlow, Second Edition
byChris Mattmann
Rating: 0 out of 5 stars
0 ratings
Learning C++ by Creating Games with UE4
Ebook
Learning C++ by Creating Games with UE4
byWilliam Sherif
Rating: 3 out of 5 stars
3/5
Microsoft Azure Machine Learning
Ebook
Microsoft Azure Machine Learning
bySumit Mund
Rating: 4 out of 5 stars
4/5
OpenCV with Python Blueprints
Ebook
OpenCV with Python Blueprints
byBeyeler Michael
Rating: 5 out of 5 stars
5/5
Mastering OpenCV 4 with Python: A practical guide covering topics from image processing, augmented reality to deep learning with OpenCV 4 and Python 3.7
Ebook
Mastering OpenCV 4 with Python: A practical guide covering topics from image processing, augmented reality to deep learning with OpenCV 4 and Python 3.7
byAlberto Fernández Villán
Rating: 0 out of 5 stars
0 ratings
Reinforcement Learning Algorithms with Python: Learn, understand, and develop smart algorithms for addressing AI challenges
Ebook
Reinforcement Learning Algorithms with Python: Learn, understand, and develop smart algorithms for addressing AI challenges
byAndrea Lonza
Rating: 0 out of 5 stars
0 ratings
Feature Extraction and Image Processing for Computer Vision
Ebook
Feature Extraction and Image Processing for Computer Vision
byMark Nixon
Rating: 4 out of 5 stars
4/5

Intelligence (AI) & Semantics For You

Skip carousel

2084: Artificial Intelligence and the Future of Humanity
Ebook
2084: Artificial Intelligence and the Future of Humanity
byJohn C Lennox
Rating: 4 out of 5 stars
4/5
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
Ebook
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
byMatthew Hayes
Rating: 0 out of 5 stars
0 ratings
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
Our Final Invention: Artificial Intelligence and the End of the Human Era
Ebook
Our Final Invention: Artificial Intelligence and the End of the Human Era
byJames Barrat
Rating: 4 out of 5 stars
4/5
Impromptu: Amplifying Our Humanity Through AI
Ebook
Impromptu: Amplifying Our Humanity Through AI
byReid Hoffman
Rating: 5 out of 5 stars
5/5
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 4 out of 5 stars
4/5
Summary of Super-Intelligence From Nick Bostrom
Ebook
Summary of Super-Intelligence From Nick Bostrom
bySummary Station
Rating: 5 out of 5 stars
5/5
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
Ebook
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
byAlexander Cooper
Rating: 1 out of 5 stars
1/5
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
Ebook
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
byJasmine Wang
Rating: 5 out of 5 stars
5/5
Midjourney Mastery - The Ultimate Handbook of Prompts
Ebook
Midjourney Mastery - The Ultimate Handbook of Prompts
byAndreea Todinca
Rating: 5 out of 5 stars
5/5
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
Ebook
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
byDavid Mayer
Rating: 0 out of 5 stars
0 ratings
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
Ebook
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
byKavita Ganesan
Rating: 0 out of 5 stars
0 ratings
Ways of Being: Animals, Plants, Machines: The Search for a Planetary Intelligence
Ebook
Ways of Being: Animals, Plants, Machines: The Search for a Planetary Intelligence
byJames Bridle
Rating: 4 out of 5 stars
4/5
Discovery Writing with ChatGPT: AI-Powered Storytelling: Three Story Method, #6
Ebook
Discovery Writing with ChatGPT: AI-Powered Storytelling: Three Story Method, #6
byJ. Thorn
Rating: 0 out of 5 stars
0 ratings
AI for Educators: AI for Educators
Ebook
AI for Educators: AI for Educators
byMatt Miller
Rating: 5 out of 5 stars
5/5
The Algorithm of the Universe (A New Perspective to Cognitive AI)
Ebook
The Algorithm of the Universe (A New Perspective to Cognitive AI)
byAncient Philosophy
Rating: 5 out of 5 stars
5/5
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 0 out of 5 stars
0 ratings
Dancing with Qubits: How quantum computing works and how it can change the world
Ebook
Dancing with Qubits: How quantum computing works and how it can change the world
byRobert S. Sutor
Rating: 5 out of 5 stars
5/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
Podcast episode
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
Podcast episode
Build Better Machine Learning Models With Confidence By Adding Validation With Deepchecks: A cross-over episode from The Machine Learning Podcast with the team from Deepchecks, exploring the challenges of testing and validating machine learning applications and their work to make it easier.
byThe Python Podcast.__init__
0 ratings
0% found this document useful
55: Go on The Web: Summary Andrew Gerrand (@enneff), Developer Advocate at Google & Go core contributor, talks about GoLang and how it is being used in Web Development today as well as the plans for the future of the Go as a platform for the web. Resources Go...
Podcast episode
55: Go on The Web: Summary Andrew Gerrand (@enneff), Developer Advocate at Google & Go core contributor, talks about GoLang and how it is being used in Web Development today as well as the plans for the future of the Go as a platform for the web. Resources Go...
byThe Web Platform Podcast
100%
100% found this document useful
Anaconda + Pyston and more: with Peter Wang, CEO of Anaconda
Podcast episode
Anaconda + Pyston and more: with Peter Wang, CEO of Anaconda
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
Generators, Coroutines, and Learning Python Through Exercises
Podcast episode
Generators, Coroutines, and Learning Python Through Exercises
byThe Real Python Podcast
0 ratings
0% found this document useful
Unraveling Python's Syntax to Its Core With Brett Cannon
Podcast episode
Unraveling Python's Syntax to Its Core With Brett Cannon
byThe Real Python Podcast
100%
100% found this document useful
How Data Discovery is Changing the Game with Shinji Kim: Shinji Kim, CEO and Co-Founder of Select Star, joins Corey to talk about the fast-growing world of data discovery. Shinji presents the question that Select Star answers, “How discoverable is your data?” and explains how Select Star is differentiating itse
Podcast episode
How Data Discovery is Changing the Game with Shinji Kim: Shinji Kim, CEO and Co-Founder of Select Star, joins Corey to talk about the fast-growing world of data discovery. Shinji presents the question that Select Star answers, “How discoverable is your data?” and explains how Select Star is differentiating itse
byScreaming in the Cloud
0 ratings
0% found this document useful
008 jsAir - JavaScript Debugging with Alex Liu, Todd Gardner, and Yunong Xiao: JavaScript Debugging with Alex Liu, Todd Gardner, and Yunong XiaoDescription:JavaScript has a pretty interesting debugging story. The developer tools of your browser of choice and the various tools and IDEs for debugging Node or mobile development make f...
Podcast episode
008 jsAir - JavaScript Debugging with Alex Liu, Todd Gardner, and Yunong Xiao: JavaScript Debugging with Alex Liu, Todd Gardner, and Yunong XiaoDescription:JavaScript has a pretty interesting debugging story. The developer tools of your browser of choice and the various tools and IDEs for debugging Node or mobile development make f...
byJavaScript Air
0 ratings
0% found this document useful
ADU 01173: Mapping Software’s and Acquisition Strategies for Creating Beautiful, Likelike Drone Models: Today's show is about drone mapping software and acquisition strategies. Our caller for today, Bob has come up with a great follow-up question today. Bob listened to our previous show, "Which Pix4D Version Should I Use for Mapping a Roof?
Podcast episode
ADU 01173: Mapping Software’s and Acquisition Strategies for Creating Beautiful, Likelike Drone Models: Today's show is about drone mapping software and acquisition strategies. Our caller for today, Bob has come up with a great follow-up question today. Bob listened to our previous show, "Which Pix4D Version Should I Use for Mapping a Roof?
byAsk Drone U
0 ratings
0% found this document useful
014 jsAir - End to End Testing with Julie Ralph: End to End Testing with Julie RalphDescription:End-to-End testing is among the various forms of testing that is critical for the development and quality assurance of your applications. Join us with a subject matter expert to chat about this important par...
Podcast episode
014 jsAir - End to End Testing with Julie Ralph: End to End Testing with Julie RalphDescription:End-to-End testing is among the various forms of testing that is critical for the development and quality assurance of your applications. Join us with a subject matter expert to chat about this important par...
byJavaScript Air
0 ratings
0% found this document useful
S19:E7 - How freeCodeCamp has evolved over time (Quincy Larson): Eight years and going strong
Podcast episode
S19:E7 - How freeCodeCamp has evolved over time (Quincy Larson): Eight years and going strong
byCodeNewbie
0 ratings
0% found this document useful
JSJ 458: Codota Tabnine and the Rise of Ai-powered Developer Tooling with Kyle Simpson: Imagine a world in which your editor / IDE can actually write some of your code for you. Where you’re able to produce software faster and more efficiently because your development environment “knows” what you want to do, based on code you’ve written before.
Podcast episode
JSJ 458: Codota Tabnine and the Rise of Ai-powered Developer Tooling with Kyle Simpson: Imagine a world in which your editor / IDE can actually write some of your code for you. Where you’re able to produce software faster and more efficiently because your development environment “knows” what you want to do, based on code you’ve written before.
byJavaScript Jabber
0 ratings
0% found this document useful
42: Fostering a Culture of Creativity with Rob Walling: Ben is away, so Derrick invited a special guest for this episode. Rob Walling was the co-founder of Drip and is the co-host of MicroConf. Rob recently removed Slack from his phone, and is waiting for Derrick’s Level product. Level is an open source team communication tool that Derrick hopes will replace Slack among software teams. Others are looking forward to Level, as well, and wondering how many clients Level is going to be available on: mobile, desktop...As Derrick continues to work on Level, the two also reminisce about their days together at Drip.
Podcast episode
42: Fostering a Culture of Creativity with Rob Walling: Ben is away, so Derrick invited a special guest for this episode. Rob Walling was the co-founder of Drip and is the co-host of MicroConf. Rob recently removed Slack from his phone, and is waiting for Derrick’s Level product. Level is an open source team communication tool that Derrick hopes will replace Slack among software teams. Others are looking forward to Level, as well, and wondering how many clients Level is going to be available on: mobile, desktop...As Derrick continues to work on Level, the two also reminisce about their days together at Drip.
byThe Art of Product
0 ratings
0% found this document useful
029 jsAir - Web Animations with Matias Niemelä, Rachel Nabors, and Sarah Drasner: Web Animations with Matias Niemelä, Rachel Nabors, and Sarah Drasner Description: It's totally mind blowing what can be done to enhance the user's experience with animations on the web these days. Let's talk about how some of this is done and what t...
Podcast episode
029 jsAir - Web Animations with Matias Niemelä, Rachel Nabors, and Sarah Drasner: Web Animations with Matias Niemelä, Rachel Nabors, and Sarah Drasner Description: It's totally mind blowing what can be done to enhance the user's experience with animations on the web these days. Let's talk about how some of this is done and what t...
byJavaScript Air
0 ratings
0% found this document useful
Syntax Live React Edition: It’s another live episode of Syntax in which Wes and Scott do Hook’d on Hooks, Who’s Snackin’ on React, Stump’d, Unpopular Opinions, Q & Eh, and more! Sentry - Sponsor If you want to know what’s happening with your errors, track them...
Podcast episode
Syntax Live React Edition: It’s another live episode of Syntax in which Wes and Scott do Hook’d on Hooks, Who’s Snackin’ on React, Stump’d, Unpopular Opinions, Q & Eh, and more! Sentry - Sponsor If you want to know what’s happening with your errors, track them...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Inside China's AI Ecosystem: A View From Beijing: In this episode, we explore the Chinese AI ecosystem with 'L-squared,' an anonymous tech worker based in Beijing.
Podcast episode
Inside China's AI Ecosystem: A View From Beijing: In this episode, we explore the Chinese AI ecosystem with 'L-squared,' an anonymous tech worker based in Beijing.
by"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
0 ratings
0% found this document useful
Cloud Video Intelligence API with Sara Robinson: Have you ever wanted to apply Cloud Vision API to videos? Well, Sara Robinson is here to tell us about Cloud Video Intelligence API, which allows you to do that and much more.
Podcast episode
Cloud Video Intelligence API with Sara Robinson: Have you ever wanted to apply Cloud Vision API to videos? Well, Sara Robinson is here to tell us about Cloud Video Intelligence API, which allows you to do that and much more.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Robotics Research Update, with Keerthana Gopalakrishnan and Ted Xiao of Google DeepMind: Google DeepMind researchers Keerthana Gopalakrishnan and Ted Xiao discuss their latest breakthroughs in AI robotics.
Podcast episode
Robotics Research Update, with Keerthana Gopalakrishnan and Ted Xiao of Google DeepMind: Google DeepMind researchers Keerthana Gopalakrishnan and Ted Xiao discuss their latest breakthroughs in AI robotics.
by"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis
0 ratings
0% found this document useful
740: Local AI Models in JavaScript - Machine Learning Deep Dive With Xenova: Scott and Wes are joined by special guest Xenova to explore local AI models in JavaScript. From Hugging Face to Transformers.js and practical applications like real-time speech recognition and object detection, this episode dives deep into the world...
Podcast episode
740: Local AI Models in JavaScript - Machine Learning Deep Dive With Xenova: Scott and Wes are joined by special guest Xenova to explore local AI models in JavaScript. From Hugging Face to Transformers.js and practical applications like real-time speech recognition and object detection, this episode dives deep into the world...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
ADU 01194: Is the Skydio 2 a Good Option for Drone Mapping?: Today's show is about using the Skydio 2 for drone mapping. Our caller for today, Grant flies the Mavic Air 2 for real estate. Currently, he is going through our Comprehensive Mapping course and is keen on exploring new verticals to grow his drone bus...
Podcast episode
ADU 01194: Is the Skydio 2 a Good Option for Drone Mapping?: Today's show is about using the Skydio 2 for drone mapping. Our caller for today, Grant flies the Mavic Air 2 for real estate. Currently, he is going through our Comprehensive Mapping course and is keen on exploring new verticals to grow his drone bus...
byAsk Drone U
0 ratings
0% found this document useful
Azul and the Current State of the Java Ecosystem with Scott Sellers: Corey is joined by Scott Sellers, CEO & Co-Founder of Azul, to discuss the current state of the Java ecosystem and how Java is changing to adapt to a cloud-native world. Scott describes how he transitioned from hardware to the world of Java software, Java
Podcast episode
Azul and the Current State of the Java Ecosystem with Scott Sellers: Corey is joined by Scott Sellers, CEO & Co-Founder of Azul, to discuss the current state of the Java ecosystem and how Java is changing to adapt to a cloud-native world. Scott describes how he transitioned from hardware to the world of Java software, Java
byScreaming in the Cloud
0 ratings
0% found this document useful
Gitting After It with Katie Sylor-Miller: Katie Sylor-Miller is a frontend architect at Etsy, a company she joined in November 2015. Prior to this position, Katie worked as a senior front end developer at Constant Contact, a technical lead at EF Education, a front end web developer at Miller Syst
Podcast episode
Gitting After It with Katie Sylor-Miller: Katie Sylor-Miller is a frontend architect at Etsy, a company she joined in November 2015. Prior to this position, Katie worked as a senior front end developer at Constant Contact, a technical lead at EF Education, a front end web developer at Miller Syst
byScreaming in the Cloud
0 ratings
0% found this document useful
NVIDIA and Deep Learning Research with Bryan Catanzaro: VP Applied Deep Learning Research at NVIDIA, Bryan Catanzaro, joins the podcast to discuss the research his team is doing, GPUs and deep learning research in general.
Podcast episode
NVIDIA and Deep Learning Research with Bryan Catanzaro: VP Applied Deep Learning Research at NVIDIA, Bryan Catanzaro, joins the podcast to discuss the research his team is doing, GPUs and deep learning research in general.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Pizza As A Service
Podcast episode
Pizza As A Service
byGame TeaTime Podcast
0 ratings
0% found this document useful
Putting the “Fun” in Functional with Frank Chen: Almost everyone is using Slack, and a lot of that is because of the work of those like Frank Chen, Slack’s Senior Staff Software Engineer. Frank is here to tell us how Slack keeps us all angrily typing. But equally as important is his own trajectory which
Podcast episode
Putting the “Fun” in Functional with Frank Chen: Almost everyone is using Slack, and a lot of that is because of the work of those like Frank Chen, Slack’s Senior Staff Software Engineer. Frank is here to tell us how Slack keeps us all angrily typing. But equally as important is his own trajectory which
byScreaming in the Cloud
0 ratings
0% found this document useful
JSJ 459: Codota Tabnine and the Rise of Ai-powered Developer Tooling with Kyle Simpson PT 2: Imagine a world in which your editor / IDE can actually write some of your code for you. Where you're able to produce software faster and more efficiently because your development environment "knows" what you want to do, based on code you've written before.
Podcast episode
JSJ 459: Codota Tabnine and the Rise of Ai-powered Developer Tooling with Kyle Simpson PT 2: Imagine a world in which your editor / IDE can actually write some of your code for you. Where you're able to produce software faster and more efficiently because your development environment "knows" what you want to do, based on code you've written before.
byJavaScript Jabber
0 ratings
0% found this document useful
Episode 71: 071 JSJ JavaScript Strategies at Microsoft with Scott Hanselman: Panel Scott Hanselman (twitter github blog) Joe Eames (twitter github blog) Aaron Frost (twitter github blog) Charles Max Wood (twitter github Teach Me To Code Rails Ramp Up) - Discussion 01:14 - Scott Hanselman Introduction -
Podcast episode
Episode 71: 071 JSJ JavaScript Strategies at Microsoft with Scott Hanselman: Panel Scott Hanselman (twitter github blog) Joe Eames (twitter github blog) Aaron Frost (twitter github blog) Charles Max Wood (twitter github Teach Me To Code Rails Ramp Up) - Discussion 01:14 - Scott Hanselman Introduction -
byJavaScript Jabber
0 ratings
0% found this document useful
Wearable AI assistant & end of an era for Chuck E. Cheese animatronics
Podcast episode
Wearable AI assistant & end of an era for Chuck E. Cheese animatronics
byRich On Tech
0 ratings
0% found this document useful
ADU 1315: What is the best mapping software for volumetric measurements requirements?: Mapping softwares available in today's markets and their limitations and capabilities. When should pilots choose softwares like DroneDeploy, Pix4D and cloud solutions available today Today's episode is brought to you by Drone U In-person Mapping Bootc...
Podcast episode
ADU 1315: What is the best mapping software for volumetric measurements requirements?: Mapping softwares available in today's markets and their limitations and capabilities. When should pilots choose softwares like DroneDeploy, Pix4D and cloud solutions available today Today's episode is brought to you by Drone U In-person Mapping Bootc...
byAsk Drone U
0 ratings
0% found this document useful
Advice for Programmers Graduating College with Dr. Saylani Ph.D.
Podcast episode
Advice for Programmers Graduating College with Dr. Saylani Ph.D.
byGame TeaTime Podcast
0 ratings
0% found this document useful

Skip carousel

Create Asynchronous Code With Python
Linux Format
Article
Create Asynchronous Code With Python
Jun 29, 2021
8 min read
Access Your Mac Anywhere
MacLife
Article
Access Your Mac Anywhere
Nov 8, 2022
2 min read
Tensor Flow 101
APC
Article
Tensor Flow 101
Jan 27, 2020
4 min read
Scikit-Learn: The Ultimate Python Library
APC
Article
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
Introduction to eBPF Revolutionizing Linux Kernel Technology
Techfastly
Article
Introduction to eBPF Revolutionizing Linux Kernel Technology
Apr 1, 2022
6 min read
How Image Recognition Works
APC
Article
How Image Recognition Works
Nov 4, 2019
4 min read
Bleeding Edge: The Best Of Emerging Tech
TechLife
Article
Bleeding Edge: The Best Of Emerging Tech
Jul 27, 2020
4 min read
Is It Possible To Render On My iPad?
3D World
Article
Is It Possible To Render On My iPad?
Jan 30, 2024
2 min read
AI-generated Images: The Latest ‘Disruptor’ In Photo-imaging
Photo Review
Article
AI-generated Images: The Latest ‘Disruptor’ In Photo-imaging
Aug 31, 2023
AI (artificial intelligence) has been in the news this year, mainly focusing on text generation, with OpenAI’s ChatGPT taking centre stage. A sign of its importance to photography came when German photographer Boris Eldagsen’s image “The Electrician”
6 min read
Artist Q&A
3D World
Article
Artist Q&A
Jun 21, 2022
10 min read
A.i. Coding
Linux Format
Article
A.i. Coding
Aug 22, 2023
16 min read
Mobile Apps For 3d Scanning
3D World
Article
Mobile Apps For 3d Scanning
Sep 5, 2023
9 min read
How Do I Create Ui Graphics With Trapcode Form In After Effects?
3D World
Article
How Do I Create Ui Graphics With Trapcode Form In After Effects?
Sep 9, 2020
3 min read
Add Colour To Your Research
Who Do You Think You Are?
Article
Add Colour To Your Research
Sep 20, 2022
6 min read
Professor Newman On… Specifying Autofocus
Amateur Photographer
Article
Professor Newman On… Specifying Autofocus
Jul 26, 2022
In recent articles I have detailed the way in which mirrorless autofocus differs from SLR AF, being more dependent on computational processes for its functionality. This requires that the AF system be designed around a powerful image processor. This
2 min read
Labs Laptops For Work
PC Pro Magazine
Article
Labs Laptops For Work
Oct 5, 2023
A cross these two pages, you’ll find graphs summarising the results of our most important tests.We measure each laptop’s productivity and digital creation power using a trio of benchmarks. First, using the industry-standard PCMark 10 benchmark. This
2 min read
The Five Best Educational Resources For Virtual Production Skills
3D World
Article
The Five Best Educational Resources For Virtual Production Skills
Jul 11, 2023
6 min read
Image Recognition
Linux Format
Article
Image Recognition
Apr 6, 2021
4 min read
Gifts & giveaways
Digital Photographer
Article
Gifts & giveaways
Sep 6, 2022
1 min read
Micro:bit Part 2 – Coding Using Your Phone Or Tablet
APC
Article
Micro:bit Part 2 – Coding Using Your Phone Or Tablet
Sep 6, 2021
The micro:bit single-board computer is one of the best low-cost ways to help kids young and old learn the basics of computer science. With its fun mix of built-in sensors and processor powerful enough to handle the Python coding language, it gets com
4 min read
Create An Advertising Illustration
3D World
Article
Create An Advertising Illustration
Apr 22, 2020
8 min read
Hands-on With Microsoft Designer, An AI Art Masterpiece
Tech Advisor
Article
Hands-on With Microsoft Designer, An AI Art Masterpiece
Jan 4, 2023
6 min read
Image Recognition
Maximum PC
Article
Image Recognition
Sep 14, 2021
4 min read
Image Recognition
APC
Article
Image Recognition
Oct 4, 2021
4 min read
Can I Use My VR Models In Other Programs Like Zbrush?
3D World
Article
Can I Use My VR Models In Other Programs Like Zbrush?
Sep 7, 2021
2 min read
Bleeding Edge: The Best Of Emergingtech
TechLife
Article
Bleeding Edge: The Best Of Emergingtech
May 3, 2021
3 min read
Daniel “Stubby” Stubbington
3D World
Article
Daniel “Stubby” Stubbington
Oct 7, 2020
6 min read
Interview//
Essential Apple User Magazine
Article
Interview//
Nov 30, 2018
9 min read
Building Scenes With AI And Dreams
3D World
Article
Building Scenes With AI And Dreams
Feb 21, 2023
10 min read
Building Scenes With AI And Dreams
3D World
Article
Building Scenes With AI And Dreams
Feb 21, 2023
10 min read

Related categories

Skip carousel

Reviews for Mastering OpenCV with Practical Computer Vision Projects

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Mastering OpenCV with Practical Computer Vision Projects - Shervin Emami

Mastering OpenCV with Practical Computer Vision Projects

Credits

About the Authors

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers and more

Why Subscribe?

Free Access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

1. Cartoonifier and Skin Changer for Android

Accessing the webcam

Main camera processing loop for a desktop app

Generating a black-and-white sketch

Generating a color painting and a cartoon

Generating an evil mode using edge filters

Generating an alien mode using skin detection

Skin-detection algorithm

Showing the user where to put their face

Implementation of the skin-color changer

Porting from desktop to Android

Setting up an Android project that uses OpenCV

Color formats used for image processing on Android

Input color format from the camera

Output color format for display

Adding the cartoonifier code to the Android NDK app

Reviewing the Android app

Cartoonifying the image when the user taps the screen

Saving the image to a file and to the Android picture gallery

Showing an Android notification message about a saved image

Changing cartoon modes through the Android menu bar

Reducing the random pepper noise from the sketch image

Showing the FPS of the app

Using a different camera resolution

Customizing the app

Summary

2. Marker-based Augmented Reality on iPhone or iPad

Creating an iOS project that uses OpenCV

Adding OpenCV framework

Including OpenCV headers

Application architecture

Accessing the camera

Marker detection

Marker identification

Grayscale conversion

Image binarization

Contours detection

Candidates search

Marker code recognition

Reading marker code

Marker location refinement

Placing a marker in 3D

Camera calibration

Marker pose estimation

Rendering the 3D virtual object

Creating the OpenGL rendering layer

Rendering an AR scene

Summary

References

3. Marker-less Augmented Reality

Marker-based versus marker-less AR

Using feature descriptors to find an arbitrary image on video

Feature extraction

Definition of a pattern object

Matching of feature points

PatternDetector.cpp

Outlier removal

Cross-match filter

Ratio test

PatternDetector.cpp

Homography estimation

PatternDetector.cpp

Homography refinement

PatternDetector.cpp

Putting it all together

Pattern pose estimation

PatternDetector.cpp

Obtaining the camera-intrinsic matrix

Pattern.cpp

Application infrastructure

ARPipeline.hpp

ARPipeline.cpp

Enabling support for 3D visualization in OpenCV

Creating OpenGL windows using OpenCV

Video capture using OpenCV

Rendering augmented reality

ARDrawingContext.hpp

ARDrawingContext.cpp

Demonstration

main.cpp

Summary

References

4. Exploring Structure from Motion Using OpenCV

Structure from Motion concepts

Estimating the camera motion from a pair of images

Point matching using rich feature descriptors

Point matching using optical flow

Finding camera matrices

Reconstructing the scene

Reconstruction from many views

Refinement of the reconstruction

Visualizing 3D point clouds with PCL

Using the example code

Summary

References

5. Number Plate Recognition Using SVM and Neural Networks

Introduction to ANPR

ANPR algorithm

Plate detection

Segmentation

Classification

Plate recognition

OCR segmentation

Feature extraction

OCR classification

Evaluation

Summary

6. Non-rigid Face Tracking

Overview

Utilities

Object-oriented design

Data collection: Image and video annotation

Training data types

Annotation tool

Pre-annotated data (The MUCT dataset)

Geometrical constraints

Procrustes analysis

Linear shape models

A combined local-global representation

Training and visualization

Facial feature detectors

Correlation-based patch models

Learning discriminative patch models

Generative versus discriminative patch models

Accounting for global geometric transformations

Training and visualization

Face detection and initialization

Face tracking

Face tracker implementation

Training and visualization

Generic versus person-specific models

Summary

References

7. 3D Head Pose Estimation Using AAM and POSIT

Active Appearance Models overview

Active Shape Models

Getting the feel of PCA

Triangulation

Triangle texture warping

Model Instantiation – playing with the Active Appearance Model

AAM search and fitting

POSIT

Diving into POSIT

POSIT and head model

Tracking from webcam or video file

Summary

References

8. Face Recognition using Eigenfaces or Fisherfaces

Introduction to face recognition and face detection

Step 1: Face detection

Implementing face detection using OpenCV

Loading a Haar or LBP detector for object or face detection

Accessing the webcam

Detecting an object using the Haar or LBP Classifier

Grayscale color conversion

Shrinking the camera image

Histogram equalization

Detecting the face

Step 2: Face preprocessing

Eye detection

Eye search regions

Geometrical transformation

Separate histogram equalization for left and right sides

Smoothing

Elliptical mask

Step 3: Collecting faces and learning from them

Collecting preprocessed faces for training

Training the face recognition system from collected faces

Viewing the learned knowledge

Average face

Eigenvalues, Eigenfaces, and Fisherfaces

Step 4: Face recognition

Face identification: Recognizing people from their face

Face verification: Validating that it is the claimed person

Finishing touches: Saving and loading files

Finishing touches: Making a nice and interactive GUI

Drawing the GUI elements

Startup mode

Detection mode

Collection mode

Training mode

Recognition mode

Checking and handling mouse clicks

Summary

References

Index

Mastering OpenCV with Practical Computer Vision Projects

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the authors, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: November 2012

Production Reference: 1161112

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK.

ISBN 978-1-84951-782-9

www.packtpub.com

Cover Image by Neha Rajappan (<neha.rajappan1@gmail.com>)

Credits

Authors

Daniel Lélis Baggio

Shervin Emami

David Millán Escrivá

Khvedchenia Ievgen

Naureen Mahmood

Jason Saragih

Roy Shilkrot

Reviewers

Kirill Kornyakov

Luis Díaz Más

Sebastian Montabone

Acquisition Editor

Usha Iyer

Lead Technical Editor

Ankita Shashi

Technical Editors

Sharvari Baet

Prashant Salvi

Copy Editors

Brandt D'Mello

Aditya Nair

Alfida Paiva

Project Coordinator

Priya Sharma

Proofreaders

Chris Brown

Martin Diver

Indexer

Hemangini Bari

Tejal Soni

Rekha Nair

Graphics

Valentina D'silva

Aditi Gajjar

Production Coordinator

Arvindkumar Gupta

Cover Work

Arvindkumar Gupta

About the Authors

Daniel Lélis Baggio started his work in computer vision through medical image processing at InCor (Instituto do Coração – Heart Institute) in São Paulo, where he worked with intra-vascular ultrasound image segmentation. Since then, he has focused on GPGPU and ported the segmentation algorithm to work with NVIDIA's CUDA. He has also dived into six degrees of freedom head tracking with a natural user interface group through a project called ehci (http://code.google.com/p/ehci/). He now works for the Brazilian Air Force.

I'd like to thank God for the opportunity of working with computer vision. I try to understand the wonderful algorithms He has created for us to see. I also thank my family, and especially my wife, for all their support throughout the development of the book. I'd like to dedicate this book to my son Stefano.

In University, Shervin took a subject on computer vision and became highly interested in it; so for his first thesis in 2003 he created a real-time face detection program based on Eigenfaces, using OpenCV (beta 3) for camera input. For his master's thesis in 2005 he created a visual navigation system for several mobile robots using OpenCV (v0.96). From 2008, he worked as a freelance Computer Vision Developer in Abu Dhabi and Philippines, using OpenCV for a large number of short-term commercial projects that included:

Detecting faces using Haar or Eigenfaces

Recognizing faces using Neural Networks, EHMM, or Eigenfaces

Detecting the 3D position and orientation of a face from a single photo using AAM and POSIT

Rotating a face in 3D using only a single photo

Face preprocessing and artificial lighting using any 3D direction from a single photo

Gender recognition

Facial expression recognition

Skin detection

Iris detection

Pupil detection

Eye-gaze tracking

Visual-saliency tracking

Histogram matching

Body-size detection

Shirt and bikini detection

Money recognition

Video stabilization

Face recognition on iPhone

Food recognition on iPhone

Marker-based augmented reality on iPhone (the second-fastest iPhone augmented reality app at the time).

OpenCV was putting food on the table for Shervin's family, so he began giving back to OpenCV through regular advice on the forums and by posting free OpenCV tutorials on his website (http://www.shervinemami.info/openCV.html). In 2011, he contacted the owners of other free OpenCV websites to write this book. He also began working on computer vision optimization for mobile devices at NVIDIA, working closely with the official OpenCV developers to produce an optimized version of OpenCV for Android. In 2012, he also joined the Khronos OpenVL committee for standardizing the hardware acceleration of computer vision for mobile devices, on which OpenCV will be based in the future.

I thank my wife Gay and my baby Luna for enduring the stress while I juggled my time between this book, working fulltime, and raising a family. I also thank the developers of OpenCV, who worked hard for many years to provide a high-quality product for free.

David Millán Escrivá was eight years old when he wrote his first program on an 8086 PC with Basic language, which enabled the 2D plotting of basic equations. In 2005, he finished his studies in IT through the Universitat Politécnica de Valencia with honors in human-computer interaction supported by computer vision with OpenCV (v0.96). He had a final project based on this subject and published it on HCI Spanish congress. He participated in Blender, an open source, 3D-software project, and worked in his first commercial movie Plumiferos - Aventuras voladoras as a Computer Graphics Software Developer.

David now has more than 10 years of experience in IT, with experience in computer vision, computer graphics, and pattern recognition, working on different projects and startups, applying his knowledge of computer vision, optical character recognition, and augmented reality. He is the author of the DamilesBlog (http://blog.damiles.com), where he publishes research articles and tutorials about OpenCV, computer vision in general, and Optical Character Recognition algorithms.

David has reviewed the book gnuPlot Cookbook by Lee Phillips and published by Packt Publishing.

Thanks Izaskun and my daughter Eider for their patience  and support. Os quiero pequeñas.

I also thank Shervin for giving me this opportunity, the OpenCV team for their work, the support of Artres, and the useful help provided by Augmate.

Khvedchenia Ievgen is a computer vision expert from Ukraine. He started his career with research and development of a camera-based driver assistance system for Harman International. He then began working as a Computer Vision Consultant for ESG. Nowadays, he is a self-employed developer focusing on the development of augmented reality applications. Ievgen is the author of the Computer Vision Talks blog (http://computer-vision-talks.com ), where he publishes research articles and tutorials pertaining to computer vision and augmented reality.

I would like to say thanks to my father who inspired me to learn programming when I was 14. His help can't be overstated. And thanks to my mom, who always supported me in all my undertakings. You always gave me a freedom to choose my own way in this life. Thanks, parents!

Thanks to Kate, a woman who totally changed my life and made it extremely full. I'm happy we're together. Love you.

Naureen Mahmood is a recent graduate from the Visualization department at Texas A&M University. She has experience working in various programming environments, animation software, and microcontroller electronics. Her work involves creating interactive applications using sensor-based electronics and software engineering. She has also worked on creating physics-based simulations and their use in special effects for animation.

I wanted to especially mention the efforts of another student from Texas A&M, whose name you will undoubtedly come across in the code included for this book. Fluid Wall was developed as part of a student project by Austin Hines and myself. Major credit for the project goes to Austin, as he was the creative mind behind it. He was also responsible for the arduous job of implementing the fluid simulation code into our application. However, he wasn't able to participate in writing this book due to a number of work- and  study-related preoccupations.

Jason Saragih received his B.Eng degree in mechatronics (with honors) and Ph.D. in computer science from the Australian National University, Canberra, Australia, in 2004 and 2008, respectively. From 2008 to 2010 he was a Postdoctoral fellow at the Robotics Institute of Carnegie Mellon University, Pittsburgh, PA. From 2010 to 2012 he worked at the Commonwealth Scientific and Industrial Research Organization (CSIRO) as a Research Scientist. He is currently a Senior Research Scientist at Visual Features, an Australian tech startup company.

Dr. Saragih has made a number of contributions to the field of computer vision, specifically on the topic of deformable model registration and modeling. He is the author of two non-profit open source libraries that are widely used in the scientific community; DeMoLib and FaceTracker, both of which make use of generic computer vision libraries including OpenCV.

Roy Shilkrot is a researcher and professional in the area of computer vision and computer graphics. He obtained a B.Sc. in Computer Science from Tel-Aviv-Yaffo Academic College, and an M.Sc. from Tel-Aviv University. He is currently a PhD candidate in Media Laboratory of the Massachusetts Institute of Technology (MIT) in Cambridge.

Roy has over seven years of experience as a Software Engineer in start-up companies and enterprises. Before joining the MIT Media Lab as a Research Assistant he worked as a Technology Strategist in the Innovation Laboratory of Comverse, a telecom solutions provider. He also dabbled in consultancy, and worked as an intern for Microsoft research at Redmond.

Thanks go to my wife for her limitless support and patience, my past and present advisors in both academia and industry for their wisdom, and my friends and colleagues for their challenging thoughts.

About the Reviewers

Kirill Kornyakov is a Project Manager at Itseez, where he leads the development of OpenCV library for Android mobile devices. He manages activities for the mobile operating system's support and computer vision applications development, including performance optimization for NVIDIA's Tegra platform. Earlier he worked at Itseez on real-time computer vision systems for open source and commercial products, chief among them being stereo vision on GPU and face detection in complex environments. Kirill has a B.Sc. and an M.Sc. from Nizhniy Novgorod State University, Russia.

I would like to thank my family for their support, my colleagues from Itseez, and Nizhniy Novgorod State University for productive discussions.

Luis Díaz Más considers himself a computer vision researcher and is passionate about open source and open-hardware communities. He has been working with image processing and computer vision algorithms since 2008 and is currently finishing his PhD on 3D reconstructions and action recognition. Currently he is working in CATEC (http://www.catec.com.es/en), a research center for advanced aerospace technologies, where he mainly deals with the sensorial systems of UAVs. He has participated in several national and international projects where he has proven his skills in C/C++ programming, application development for embedded systems with Qt libraries, and his experience with GNU/Linux distribution configuration for embedded systems. Lately he is focusing his interest in ARM and CUDA development.

Sebastian Montabone is a Computer Engineer with a Master of Science degree in computer vision. He is the author of scientific articles pertaining to image processing and has also authored a book, Beginning Digital Image Processing: Using Free Tools for Photographers.

Embedded systems have also been of interest to him, especially mobile phones. He created and taught a course about the development of applications for mobile phones, and has been recognized as a Nokia developer champion.

Currently he is a Software Consultant and Entrepreneur. You can visit his blog at www.samontab.com, where he shares his current projects with the world.

www.PacktPub.com

Support files, eBooks, discount offers and more

You might want to visit www.PacktPub.com for support files and downloads related to your book.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

http://PacktLib.PacktPub.com

Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can access, read and search across Packt's entire library of books.

Why Subscribe?

Fully searchable across every book published by Packt

Copy and paste, print and bookmark content

On demand and accessible via web browser

Free Access for Packt account holders

If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view nine entirely free books. Simply use your login credentials for immediate access.

Preface

Mastering OpenCV with Practical Computer Vision Projects contains nine chapters, where each chapter is a tutorial for an entire project from start to finish, based on OpenCV's C++ interface including full source code. The author of each chapter was chosen for their well-regarded online contributions to the OpenCV community on that topic, and the book was reviewed by one of the main OpenCV developers. Rather than explaining the basics of OpenCV functions, this is the first book that shows how to apply OpenCV to solve whole problems, including several 3D camera projects (augmented reality, 3D Structure from Motion, Kinect interaction) and several facial analysis projects (such as, skin detection, simple face and eye detection, complex facial feature tracking, 3D head orientation estimation, and face recognition), therefore it makes a great companion to existing OpenCV books.

What this book covers

Chapter 1, Cartoonifier and Skin Changer for Android, contains a complete tutorial and source code for both a desktop application and an Android app that automatically generates a cartoon or painting from a real camera image, with several possible types of cartoons including a skin color changer.

Chapter 2, Marker-based Augmented Reality on iPhone or iPad, contains a complete tutorial on how to build a marker-based augmented reality (AR) application for iPad and iPhone devices with an explanation of each step and source code.

Chapter 3, Marker-less Augmented Reality, contains a complete tutorial on how to develop a marker-less augmented reality desktop application with an explanation of what marker-less AR is and source code.

Chapter 4, Exploring Structure from Motion Using OpenCV, contains an introduction to Structure from Motion (SfM) via an implementation of SfM concepts in OpenCV. The reader will learn how to reconstruct 3D geometry from multiple 2D images and estimate camera positions.

Chapter 5, Number Plate Recognition Using SVM and Neural Networks, contains a complete tutorial and source code to build an automatic number plate recognition application using pattern recognition algorithms using a support vector machine and Artificial Neural Networks. The reader will learn how to train and predict pattern-recognition algorithms to decide if an image is a number plate or not. It will also help classify a set of features into a character.

Chapter 6, Non-rigid Face Tracking, contains a complete tutorial and source code to build a dynamic face tracking system that can model and track the many complex parts of a person's face.

Chapter 7, 3D Head Pose Estimation Using AAM and POSIT, contains all the background required to understand what Active Appearance Models (AAMs) are and how to create them with OpenCV using a set of face frames with different facial expressions. Besides, this chapter explains how to match a given frame through fitting capabilities offered by AAMs. Then, by applying the POSIT algorithm, one can find the 3D head pose.

Chapter 8, Face Recognition using Eigenfaces or Fisherfaces, contains a complete tutorial and source code for a real-time face-recognition application that includes basic face and eye detection to handle the rotation of faces and varying lighting conditions in the images.

Chapter 9, Developing Fluid Wall Using the Microsoft Kinect, covers the complete development of an interactive fluid simulation called the Fluid Wall, which uses the Kinect sensor. The chapter will explain how to use Kinect data with OpenCV's optical flow methods and integrating it into a fluid solver.

You can download this chapter from: http://www.packtpub.com/sites/default/files/downloads/7829OS_Chapter9_Developing_Fluid_Wall_Using_the_Microsoft_Kinect.pdf.

What you need for this book

You don't need to have special knowledge in computer vision to read this book, but you should have good C/C++ programming skills and basic experience with OpenCV before reading this book. Readers without experience in OpenCV may wish to read the book Learning OpenCV for an introduction to the OpenCV features, or read OpenCV 2 Cookbook for examples on how to use OpenCV with recommended C/C++ patterns, because Mastering OpenCV with Practical Computer Vision Projects will show you how to solve real problems, assuming you are already familiar with the basics of OpenCV and C/C++ development.

In addition to C/C++ and OpenCV experience, you will also need a computer, and IDE of your choice (such as Visual Studio, XCode, Eclipse, or QtCreator, running on Windows, Mac or Linux). Some chapters have further requirements, in particular:

To develop the Android app, you will need an Android device, Android development tools, and basic Android development experience.

To develop the iOS app, you will need an iPhone, iPad, or iPod Touch device, iOS development tools (including an Apple computer, XCode IDE, and an Apple Developer Certificate), and basic iOS and Objective-C development experience.

Several desktop projects require a webcam connected to your computer. Any common USB webcam should suffice, but a webcam of at least 1 megapixel may be desirable.

CMake is used in some projects, including OpenCV itself, to build across operating systems and compilers. A basic understanding of build systems is required, and knowledge of cross-platform building is recommended.

An understanding of linear algebra is expected, such as basic vector and matrix operations and eigen decomposition.

Who this book is for

Mastering OpenCV with Practical Computer Vision Projects is the perfect book for developers with basic OpenCV knowledge to create practical computer vision projects, as well as for seasoned OpenCV experts who want to add more computer vision topics to their skill set. It is aimed at senior computer science university students, graduates, researchers, and computer vision experts who wish to solve real problems using the OpenCV C++ interface, through practical step-by-step tutorials.

Conventions

In this book, you will find a number of styles of text that distinguish between different kinds of information. Here are some examples of these styles, and an explanation of their meaning.

Code words in text are shown as follows: You should put most of the code of this chapter into the cartoonifyImage() function.

A block of code is set as follows:

int cameraNumber = 0;

if (argc > 1)

cameraNumber = atoi(argv[1]);

// Get access to the camera.

cv::VideoCapture capture;

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

// Get access to the camera.

cv::VideoCapture capture;

camera.open(cameraNumber);

if (!camera.isOpened()) {

std::cerr << ERROR: Could not access the camera or video! <<

New terms and important words are shown in bold. Words that you see on the screen, in menus or dialog boxes for example, appear in the text like this: clicking the Next button moves you to the next screen.

Note

Warnings or important notes appear in a box like this.

Tip

Tips and tricks appear like this.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this book—what you liked or may have disliked. Reader feedback is important for us to develop titles that you really get the most out of.

To send us general feedback, simply send an e-mail to <feedback@packtpub.com>, and mention the book title via the subject of your message.

If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide on www.packtpub.com/authors.

Customer support

Now that you are the proud owner of a Packt book, we have a number of things to help you to get the most from your purchase.

Downloading the example code

You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com. If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the files e-mailed directly to you.

Errata

Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you find a mistake in one of our books—maybe a mistake in the text or the code—we would be grateful if you would report this to us. By doing so, you can save other readers from frustration and help us improve subsequent versions of this book. If you find any errata, please report them by visiting http://www.packtpub.com/support, selecting your book, clicking on the errata submission form link, and entering the details of your errata. Once your errata are verified, your submission will be accepted and the errata will be uploaded on our website, or added to any list of existing errata, under the Errata section of that title. Any existing errata can be viewed by selecting your title from http://www.packtpub.com/support.

Piracy

Piracy of copyright material on the Internet is an ongoing problem across all media. At Packt, we take the protection of our copyright and licenses very seriously. If you come across any illegal copies of our works, in any form, on the Internet,

Enjoying the preview?

Page 1 of 1

Mastering OpenCV with Practical Computer Vision Projects

About this ebook

Shervin Emami

Related authors

Related to Mastering OpenCV with Practical Computer Vision Projects

Related ebooks

Intelligence (AI) & Semantics For You

Related podcast episodes

Related articles

Related categories

Reviews for Mastering OpenCV with Practical Computer Vision Projects

What did you think?

Book preview

Mastering OpenCV with Practical Computer Vision Projects - Shervin Emami

Table of Contents

Mastering OpenCV with Practical Computer Vision Projects

Mastering OpenCV with Practical Computer Vision Projects

Credits

About the Authors

About the Reviewers

Support files, eBooks, discount offers and more

Why Subscribe?

Preface

What this book covers

What you need for this book

Who this book is for

Conventions

Note

Tip

Reader feedback

Customer support

Downloading the example code

Errata

Piracy