Ebook1,327 pages13 hours

Fundamentals of Multimedia

Name: Fundamentals of Multimedia
Brand: Springer
Rating: 5.0 (1 reviews)

By Ze-Nian Li, Mark S. Drew and Jiangchuan Liu

Rating: 5 out of 5 stars

5/5

()

Read preview

About this ebook

This textbook introduces the “Fundamentals of Multimedia”, addressing real issues commonly faced in the workplace. The essential concepts are explained in a practical way to enable students to apply their existing skills to address problems in multimedia. Fully revised and updated, this new edition now includes coverage of such topics as 3D TV, social networks, high-efficiency video compression and conferencing, wireless and mobile networks, and their attendant technologies.
Features: presents an overview of the key concepts in multimedia, including color science; reviews lossless and lossy compression methods for image, video and audio data; examines the demands placed by multimedia communications on wired and wireless networks; discusses the impact of social media and cloud computing on information sharing and on multimedia content search and retrieval; includes study exercises at the end of each chapter; provides supplementary resources for both students andinstructors at an associated website.

Skip carousel

LanguageEnglish

PublisherSpringer

Release dateApr 9, 2014

ISBN9783319052908

Author

Ze-Nian Li

Related authors

Skip carousel

Related to Fundamentals of Multimedia

Related ebooks

Skip carousel

Deep Learning Pipeline: Building a Deep Learning Model with TensorFlow
Ebook
Deep Learning Pipeline: Building a Deep Learning Model with TensorFlow
byHisham El-Amir
Rating: 0 out of 5 stars
0 ratings
Usability Engineering: Scenario-Based Development of Human-Computer Interaction
Ebook
Usability Engineering: Scenario-Based Development of Human-Computer Interaction
byMary Beth Rosson
Rating: 4 out of 5 stars
4/5
Introduction to Deep Learning Business Applications for Developers: From Conversational Bots in Customer Service to Medical Image Processing
Ebook
Introduction to Deep Learning Business Applications for Developers: From Conversational Bots in Customer Service to Medical Image Processing
byArmando Vieira
Rating: 0 out of 5 stars
0 ratings
Building Intelligent Information Systems Software: Introducing the Unit Modeler Development Technology
Ebook
Building Intelligent Information Systems Software: Introducing the Unit Modeler Development Technology
byThomas D. Feigenbaum
Rating: 0 out of 5 stars
0 ratings
Instructor's Manual to Accompany Microcomputer Use: Word Processors, Spreadsheets, and Data Bases with Accompanying MicroUSE Software
Ebook
Instructor's Manual to Accompany Microcomputer Use: Word Processors, Spreadsheets, and Data Bases with Accompanying MicroUSE Software
byTeresa Alberte-Hallam
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Beginners: A Comprehensive Introduction of Deep Learning Fundamentals for Beginners to Understanding Frameworks, Neural Networks, Large Datasets, and Creative Applications with Ease
Ebook
Deep Learning for Beginners: A Comprehensive Introduction of Deep Learning Fundamentals for Beginners to Understanding Frameworks, Neural Networks, Large Datasets, and Creative Applications with Ease
bySteven Cooper
Rating: 3 out of 5 stars
3/5
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
Ebook
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
bySteven Cooper
Rating: 0 out of 5 stars
0 ratings
Troubleshooting Finite-Element Modeling with Abaqus: With Application in Structural Engineering Analysis
Ebook
Troubleshooting Finite-Element Modeling with Abaqus: With Application in Structural Engineering Analysis
byRaphael Jean Boulbes
Rating: 0 out of 5 stars
0 ratings
Going One-to-One: iPads and Mobile Computing in Education
Ebook
Going One-to-One: iPads and Mobile Computing in Education
byPatrick Fogarty
Rating: 0 out of 5 stars
0 ratings
Software Engineer's Reference Book
Ebook
Software Engineer's Reference Book
byJohn A McDermid
Rating: 0 out of 5 stars
0 ratings
Sharing Data and Models in Software Engineering
Ebook
Sharing Data and Models in Software Engineering
byTim Menzies
Rating: 5 out of 5 stars
5/5
Learning Analytics Cookbook: How to Support Learning Processes Through Data Analytics and Visualization
Ebook
Learning Analytics Cookbook: How to Support Learning Processes Through Data Analytics and Visualization
byRoope Jaakonmäki
Rating: 0 out of 5 stars
0 ratings
Monetizing Machine Learning: Quickly Turn Python ML Ideas into Web Applications on the Serverless Cloud
Ebook
Monetizing Machine Learning: Quickly Turn Python ML Ideas into Web Applications on the Serverless Cloud
byManuel Amunategui
Rating: 0 out of 5 stars
0 ratings
The Infinite Bit: An Inside Story of Digital Technology
Ebook
The Infinite Bit: An Inside Story of Digital Technology
byArvind Padmanabhan
Rating: 0 out of 5 stars
0 ratings
Digital Design for Print and Web: An Introduction to Theory, Principles, and Techniques
Ebook
Digital Design for Print and Web: An Introduction to Theory, Principles, and Techniques
byJohn DiMarco
Rating: 0 out of 5 stars
0 ratings
E-Learning for Educators
Ebook
E-Learning for Educators
byDenise Taylor
Rating: 0 out of 5 stars
0 ratings
Domain-Specific Knowledge Graph Construction
Ebook
Domain-Specific Knowledge Graph Construction
byMayank Kejriwal
Rating: 0 out of 5 stars
0 ratings
Data Analysis in the Cloud: Models, Techniques and Applications
Ebook
Data Analysis in the Cloud: Models, Techniques and Applications
byDomenico Talia
Rating: 0 out of 5 stars
0 ratings
Introduction to Algorithms for Data Mining and Machine Learning
Ebook
Introduction to Algorithms for Data Mining and Machine Learning
byXin-She Yang
Rating: 0 out of 5 stars
0 ratings
Digital Magazine Design: with Case Studies
Ebook
Digital Magazine Design: with Case Studies
byDaniel Carpenter
Rating: 0 out of 5 stars
0 ratings
Planning and Design of Information Systems
Ebook
Planning and Design of Information Systems
byAndré Blokdijk
Rating: 0 out of 5 stars
0 ratings
Discrete Structure and Automata Theory for Learners: Learn Discrete Structure Concepts and Automata Theory with JFLAP
Ebook
Discrete Structure and Automata Theory for Learners: Learn Discrete Structure Concepts and Automata Theory with JFLAP
bySukhpreet Kaur Gill
Rating: 0 out of 5 stars
0 ratings
Integrating Information into the Engineering Design Process
Ebook
Integrating Information into the Engineering Design Process
byMichael Fosmire
Rating: 4 out of 5 stars
4/5
The Teaching Librarian: Web 2.0, Technology, and Legal Aspects
Ebook
The Teaching Librarian: Web 2.0, Technology, and Legal Aspects
byKris Helge
Rating: 0 out of 5 stars
0 ratings
Emerging Technologies for the Classroom: A Learning Sciences Perspective
Ebook
Emerging Technologies for the Classroom: A Learning Sciences Perspective
byChrystalla Mouza
Rating: 0 out of 5 stars
0 ratings
Semantic Applications: Methodology, Technology, Corporate Use
Ebook
Semantic Applications: Methodology, Technology, Corporate Use
byThomas Hoppe
Rating: 0 out of 5 stars
0 ratings
Learning Material Design
Ebook
Learning Material Design
byMew Kyle
Rating: 4 out of 5 stars
4/5
Machine Learning with Noisy Labels: Definitions, Theory, Techniques and Solutions
Ebook
Machine Learning with Noisy Labels: Definitions, Theory, Techniques and Solutions
byGustavo Carneiro
Rating: 0 out of 5 stars
0 ratings
Introduction to Digital Communications
Ebook
Introduction to Digital Communications
byAli Grami
Rating: 0 out of 5 stars
0 ratings
Multimedia Semantics: Metadata, Analysis and Interaction
Ebook
Multimedia Semantics: Metadata, Analysis and Interaction
byRaphael Troncy
Rating: 0 out of 5 stars
0 ratings

System Administration For You

Skip carousel

Linux Bible
Ebook
Linux Bible
byChristopher Negus
Rating: 0 out of 5 stars
0 ratings
Microsoft SharePoint Guide to Success: Learn In A Guided Way How To Manage and Store Files to Optimize Your Organization, Tasks & Projects, Surprising Your Colleagues And Clients: Career Elevator, #10
Ebook
Microsoft SharePoint Guide to Success: Learn In A Guided Way How To Manage and Store Files to Optimize Your Organization, Tasks & Projects, Surprising Your Colleagues And Clients: Career Elevator, #10
byKevin Pitch
Rating: 5 out of 5 stars
5/5
Mastering Microsoft Endpoint Manager
Ebook
Mastering Microsoft Endpoint Manager
byCharles Smith
Rating: 0 out of 5 stars
0 ratings
Linux Command-Line Tips & Tricks
Ebook
Linux Command-Line Tips & Tricks
byV. Subhash
Rating: 0 out of 5 stars
0 ratings
ConfigMgr - An Administrator's Guide to Deploying Applications using PowerShell
Ebook
ConfigMgr - An Administrator's Guide to Deploying Applications using PowerShell
byOwen Smith
Rating: 5 out of 5 stars
5/5
Learn Windows PowerShell in a Month of Lunches
Ebook
Learn Windows PowerShell in a Month of Lunches
byDon Jones
Rating: 0 out of 5 stars
0 ratings
Learn Cisco Network Administration in a Month of Lunches
Ebook
Learn Cisco Network Administration in a Month of Lunches
byBen Piper
Rating: 0 out of 5 stars
0 ratings
Practical Data Analysis
Ebook
Practical Data Analysis
byHector Cuesta
Rating: 4 out of 5 stars
4/5
Organize Your Digital Life: How to Store Your Photographs, Music, Videos, and Personal Documents in a Digital World
Ebook
Organize Your Digital Life: How to Store Your Photographs, Music, Videos, and Personal Documents in a Digital World
byAimee Baldridge
Rating: 3 out of 5 stars
3/5
Cybersecurity: The Beginner's Guide: A comprehensive guide to getting started in cybersecurity
Ebook
Cybersecurity: The Beginner's Guide: A comprehensive guide to getting started in cybersecurity
byDr. Erdal Ozkaya
Rating: 5 out of 5 stars
5/5
Linux For Beginners: A Step-By-Step Guide to Learn Linux Operating System + The Basics of Kali Linux Hacking by Command Line Interface. Tools Explanation and Exercises Included
Ebook
Linux For Beginners: A Step-By-Step Guide to Learn Linux Operating System + The Basics of Kali Linux Hacking by Command Line Interface. Tools Explanation and Exercises Included
byAxel Ross
Rating: 1 out of 5 stars
1/5
Wordpress 2023 A Beginners Guide : Design Your Own Website With WordPress 2023
Ebook
Wordpress 2023 A Beginners Guide : Design Your Own Website With WordPress 2023
byJames Patrick
Rating: 0 out of 5 stars
0 ratings
CompTIA A+ Complete Review Guide: Core 1 Exam 220-1101 and Core 2 Exam 220-1102
Ebook
CompTIA A+ Complete Review Guide: Core 1 Exam 220-1101 and Core 2 Exam 220-1102
byTroy McMillan
Rating: 5 out of 5 stars
5/5
Microsoft OneDrive Guide to Success: Streamlining Your Workflow and Data Management with the MS Cloud Storage: Career Elevator, #7
Ebook
Microsoft OneDrive Guide to Success: Streamlining Your Workflow and Data Management with the MS Cloud Storage: Career Elevator, #7
byKevin Pitch
Rating: 5 out of 5 stars
5/5
Operating Systems DeMYSTiFieD
Ebook
Operating Systems DeMYSTiFieD
byAnn McIver McHoes
Rating: 0 out of 5 stars
0 ratings
Linux: Learn in 24 Hours
Ebook
Linux: Learn in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
Improve your skills with Google Sheets: Professional training
Ebook
Improve your skills with Google Sheets: Professional training
byRémy Lentzner
Rating: 0 out of 5 stars
0 ratings
Learn PowerShell in a Month of Lunches, Fourth Edition: Covers Windows, Linux, and macOS
Ebook
Learn PowerShell in a Month of Lunches, Fourth Edition: Covers Windows, Linux, and macOS
byTravis Plunk
Rating: 0 out of 5 stars
0 ratings
Web Penetration Testing with Kali Linux
Ebook
Web Penetration Testing with Kali Linux
byJoseph Muniz
Rating: 5 out of 5 stars
5/5
Learn PowerShell Scripting in a Month of Lunches
Ebook
Learn PowerShell Scripting in a Month of Lunches
byDon Jones
Rating: 0 out of 5 stars
0 ratings
Microsoft Outlook Guide to Success: Learn Smart Email Practices and Calendar Management for a Smooth Workflow [II EDITION]
Ebook
Microsoft Outlook Guide to Success: Learn Smart Email Practices and Calendar Management for a Smooth Workflow [II EDITION]
byKevin Pitch
Rating: 5 out of 5 stars
5/5
LINUX Beginner's Crash Course: Linux for Beginner's Guide to Linux Command Line, Linux System & Linux Commands
Ebook
LINUX Beginner's Crash Course: Linux for Beginner's Guide to Linux Command Line, Linux System & Linux Commands
byQuick Start Guides
Rating: 4 out of 5 stars
4/5
Learn SQL Server Administration in a Month of Lunches
Ebook
Learn SQL Server Administration in a Month of Lunches
byDon Jones
Rating: 0 out of 5 stars
0 ratings
Linux Commands By Example
Ebook
Linux Commands By Example
byKhaled Jamal
Rating: 5 out of 5 stars
5/5
Learning Linux Shell Scripting
Ebook
Learning Linux Shell Scripting
byNaik Ganesh Sanjiv
Rating: 4 out of 5 stars
4/5
Mastering Windows PowerShell Scripting
Ebook
Mastering Windows PowerShell Scripting
byBrenton J.W. Blawat
Rating: 4 out of 5 stars
4/5
Networking for System Administrators: IT Mastery, #5
Ebook
Networking for System Administrators: IT Mastery, #5
byMichael W. Lucas
Rating: 5 out of 5 stars
5/5
Mastering Bash
Ebook
Mastering Bash
byGiorgio Zarrelli
Rating: 5 out of 5 stars
5/5
The Complete Powershell Training for Beginners
Ebook
The Complete Powershell Training for Beginners
byAbdelfattah Benammi
Rating: 0 out of 5 stars
0 ratings
PowerShell: A Beginner's Guide to Windows PowerShell
Ebook
PowerShell: A Beginner's Guide to Windows PowerShell
byRoger Wilson
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Completing teaching assignments within music notation software: Creating music assignments for students to complete directly in notation software can help students both learn about theoretical concepts like suspensions and appoggiaturas as well as the practical skills of manipulating music notation on the screen.
Podcast episode
Completing teaching assignments within music notation software: Creating music assignments for students to complete directly in notation software can help students both learn about theoretical concepts like suspensions and appoggiaturas as well as the practical skills of manipulating music notation on the screen.
byScoring Notes
0 ratings
0% found this document useful
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
Podcast episode
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
byPartially Redacted: Data Privacy, Security & Compliance
0 ratings
0% found this document useful
LA 059: e-Learning is dead! Why We Need to Flip The Corporate Classroom Now!
Podcast episode
LA 059: e-Learning is dead! Why We Need to Flip The Corporate Classroom Now!
byJoy@Work Podcast
0 ratings
0% found this document useful
How to Use EdTech for Project-Based Learning - HoET242: Explore how integrating technology into PBL can enhance the learning experience, making it more accessible, engaging, and impactful for students. #EdTech Thought: Are You Really Trying? In this segment, I ask myself if I am doing all that I could be...
Podcast episode
How to Use EdTech for Project-Based Learning - HoET242: Explore how integrating technology into PBL can enhance the learning experience, making it more accessible, engaging, and impactful for students. #EdTech Thought: Are You Really Trying? In this segment, I ask myself if I am doing all that I could be...
byHouse of #EdTech
0 ratings
0% found this document useful
Conversation with Dr. Guido Lang, Associate Professor, Quinnipiac University: Guido shares his insights on developing new ideas and bringing them to the market, especially in the technology space. He reinforces the idea that staying focused on solving one particular program first, before trying to scale, is key to a successful new business.
Podcast episode
Conversation with Dr. Guido Lang, Associate Professor, Quinnipiac University: Guido shares his insights on developing new ideas and bringing them to the market, especially in the technology space. He reinforces the idea that staying focused on solving one particular program first, before trying to scale, is key to a successful new business.
byRetail Revolution
0 ratings
0% found this document useful
191: Boss Stimulation Needed: www.japan.dalecarnegie.com
Podcast episode
191: Boss Stimulation Needed: www.japan.dalecarnegie.com
byThe Leadership Japan Series
0 ratings
0% found this document useful
Apple Vision Pro: A New Dimension for K-12 Education - HoET225: Feedback & Shout Outs (1:20) I will NOT be taking a Summer break from the podcast! Rest easy knowing you will get some great content throughout the Summer! I can't wait to meet you at ISTE Live 2023 if you will be in Philly! EdTech Thought (4:39)...
Podcast episode
Apple Vision Pro: A New Dimension for K-12 Education - HoET225: Feedback & Shout Outs (1:20) I will NOT be taking a Summer break from the podcast! Rest easy knowing you will get some great content throughout the Summer! I can't wait to meet you at ISTE Live 2023 if you will be in Philly! EdTech Thought (4:39)...
byHouse of #EdTech
0 ratings
0% found this document useful
Building Digital Learning Skills and Lessons into the Core Curricular Subjects
Podcast episode
Building Digital Learning Skills and Lessons into the Core Curricular Subjects
byAsk The Tech Coach
0 ratings
0% found this document useful
Brian Schobel - Supporting Assistive Technology During the Transition to Employment: This week, we present Chris’s interview with Brian Schobel, a District Resource Teacher for Transition in Albuquerque, NM. Brian has worked for years supporting transition and employment for people with special needs. Brian reached out to interview C...
Podcast episode
Brian Schobel - Supporting Assistive Technology During the Transition to Employment: This week, we present Chris’s interview with Brian Schobel, a District Resource Teacher for Transition in Albuquerque, NM. Brian has worked for years supporting transition and employment for people with special needs. Brian reached out to interview C...
byTalking With Tech AAC Podcast
0 ratings
0% found this document useful
Active Learning: 6 Feet of Separation: During the fall 2020 semester, many faculty will be working in a classroom environment in which they will be in a classroom using a video conferencing tool to work simultaneously with a mix of remote students online and masked and physically distanced fa...
Podcast episode
Active Learning: 6 Feet of Separation: During the fall 2020 semester, many faculty will be working in a classroom environment in which they will be in a classroom using a video conferencing tool to work simultaneously with a mix of remote students online and masked and physically distanced fa...
byTea for Teaching
0 ratings
0% found this document useful
Start Using #EdTech - HoET200: Feedback & Shout Outs (2:26) Emily Pool & Cheri Dotterer EdTech Thought (7:29) Take Risks! Take it from me that risk-taking in education is WORTH it! Starting the House of #EdTech has transformed my life and I am in debited to the podcast for all it's...
Podcast episode
Start Using #EdTech - HoET200: Feedback & Shout Outs (2:26) Emily Pool & Cheri Dotterer EdTech Thought (7:29) Take Risks! Take it from me that risk-taking in education is WORTH it! Starting the House of #EdTech has transformed my life and I am in debited to the podcast for all it's...
byHouse of #EdTech
0 ratings
0% found this document useful
The Accidental Instructional Designer with Lars Hecker: Let’s say you’re talking to a fellow L&D leader who suggests an interactive digital tool they’ve been using in their onboarding and training sessions, such as an online game or simulation. Although you might be tempted to jump right into the latest and greatest technologies, you should first consider if they will actually facilitate a better learning experience for your employees.
Podcast episode
The Accidental Instructional Designer with Lars Hecker: Let’s say you’re talking to a fellow L&D leader who suggests an interactive digital tool they’ve been using in their onboarding and training sessions, such as an online game or simulation. Although you might be tempted to jump right into the latest and greatest technologies, you should first consider if they will actually facilitate a better learning experience for your employees.
byThe Digital Adoption Show | Upskilling the Future Digital Workforce
0 ratings
0% found this document useful
55. Rak Chugh on Byte Academy and edChain: Our guest today is Rak Chugh, the Founder of Byte Academy and the Architect of edChain. We met in his office in Midtown Manhattan. Byte Academy provides training in python development, FinTech, Data Science, Quant Algorithms, and Blockchain. They have...
Podcast episode
55. Rak Chugh on Byte Academy and edChain: Our guest today is Rak Chugh, the Founder of Byte Academy and the Architect of edChain. We met in his office in Midtown Manhattan. Byte Academy provides training in python development, FinTech, Data Science, Quant Algorithms, and Blockchain. They have...
byUnleashed - How to Thrive as an Independent Professional
0 ratings
0% found this document useful
#191 - What Matters In Edtech ASIA: Our BETT series is BACK, but with a GLOBAL focus!
Podcast episode
#191 - What Matters In Edtech ASIA: Our BETT series is BACK, but with a GLOBAL focus!
byThe Edtech Podcast
0 ratings
0% found this document useful
#52 Aurasma and AR in the Classroom: Dr. Tim Green, Instructional Design Expert, talks today about augmented reality. Specifically he shares his favorite Augmented Reality (AR) instructional tool, Aurasma. Tim also shares a vision for the future of where augmented reality is probably...
Podcast episode
#52 Aurasma and AR in the Classroom: Dr. Tim Green, Instructional Design Expert, talks today about augmented reality. Specifically he shares his favorite Augmented Reality (AR) instructional tool, Aurasma. Tim also shares a vision for the future of where augmented reality is probably...
by10 Minute Teacher Podcast with Cool Cat Teacher
0 ratings
0% found this document useful
Conrad Wolfram: ‘The Mathematician’: Aldo talks to Conrad Wolfram about why he feels he is in a unique central position to spread the word on radically changing the way we teach and learn Mathematics. Guest Introduction: Throughout my career as an edtech builder...
Podcast episode
Conrad Wolfram: ‘The Mathematician’: Aldo talks to Conrad Wolfram about why he feels he is in a unique central position to spread the word on radically changing the way we teach and learn Mathematics. Guest Introduction: Throughout my career as an edtech builder...
byMessy and Masterful
0 ratings
0% found this document useful
Easy-to-Use Technologies You Can Implement Into Your Course Today
Podcast episode
Easy-to-Use Technologies You Can Implement Into Your Course Today
byFaculty Focus Live
0 ratings
0% found this document useful
10. Unlocking Contract Intelligence: The Intersection of AI and Transformative Mathematics with Randy Friedman: The CLM Rx
Podcast episode
10. Unlocking Contract Intelligence: The Intersection of AI and Transformative Mathematics with Randy Friedman: The CLM Rx
byThe CLM Rx
0 ratings
0% found this document useful
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
Podcast episode
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
byUVA Data Points
0 ratings
0% found this document useful
Creating and Supporting a Culture for Digital Learning and Innovation in a K12 School District
Podcast episode
Creating and Supporting a Culture for Digital Learning and Innovation in a K12 School District
byAsk The Tech Coach
0 ratings
0% found this document useful
Doing Software Engineering in Academia - Johanna Bayer
Podcast episode
Doing Software Engineering in Academia - Johanna Bayer
byDataTalks.Club
0 ratings
0% found this document useful
eQMS in Academia: Practical Learning for Biomedical Engineering Students: Have you ever thought about the versatility of an eQMS? As it turns out, the use of one medical device eQMS solution in particular is extending across multiple sectors.In this episode of the Global Medical Device Podcast, Jon Speer talks to R...
Podcast episode
eQMS in Academia: Practical Learning for Biomedical Engineering Students: Have you ever thought about the versatility of an eQMS? As it turns out, the use of one medical device eQMS solution in particular is extending across multiple sectors.In this episode of the Global Medical Device Podcast, Jon Speer talks to R...
byGlobal Medical Device Podcast powered by Greenlight Guru
0 ratings
0% found this document useful
073 - Extracting the secret of IMFSE from Bart Merci and Eulalia Planas
Podcast episode
073 - Extracting the secret of IMFSE from Bart Merci and Eulalia Planas
byFire Science Show
0 ratings
0% found this document useful
063 - Why do we need a handbook of fire and the environment with Brian Meacham and Margaret McNamee
Podcast episode
063 - Why do we need a handbook of fire and the environment with Brian Meacham and Margaret McNamee
byFire Science Show
0 ratings
0% found this document useful
Practical MLOps // Noah Gift // MLOps Coffee Sessions #27
Podcast episode
Practical MLOps // Noah Gift // MLOps Coffee Sessions #27
byMLOps.community
0 ratings
0% found this document useful
101: Quantum Disruption: The Future of Materials Discovery | (ft. Dr. David Muñoz Ramo): By leveraging the power of quantum computing (QC), scientists can quickly identify promising materials (new or existing) for ANY application. QC enables this while saving on hefty lab operation costs, enabling speedy and cheap materials discovery. In...
Podcast episode
101: Quantum Disruption: The Future of Materials Discovery | (ft. Dr. David Muñoz Ramo): By leveraging the power of quantum computing (QC), scientists can quickly identify promising materials (new or existing) for ANY application. QC enables this while saving on hefty lab operation costs, enabling speedy and cheap materials discovery. In...
byIt's a Material World | Materials Science Podcast
0 ratings
0% found this document useful
Ep. 37 - The Rise of the Data Engineer: When Maxime worked at Facebook, his role started evolving. He was developing new skills, new ways of doing things, and new tools. And — more often than not — he was turning his back on traditional methods. He was a pioneer. He was a...
Podcast episode
Ep. 37 - The Rise of the Data Engineer: When Maxime worked at Facebook, his role started evolving. He was developing new skills, new ways of doing things, and new tools. And — more often than not — he was turning his back on traditional methods. He was a pioneer. He was a...
byfreeCodeCamp Podcast
0 ratings
0% found this document useful
How The Modern Day Learning Experience Is Shaping Digital Workforce Transformations with Steven Rath Morgan: The fourth industrial revolution has completely changed the way human effort factors into the modern-day workplace. Technological changes have visibly improved business efficiency, cut costs, and given companies a sharper competitive edge than before. But more than anything, it has defined “a golden age for modern-day L&D organizations in supporting the digital workforce transformation that is so widespread today.” Quite an interesting fact from the Digital Adoption Show put forward by our guest, Steven Rath Morgan, Head of Learning & Development at Xerox and a thought leader who uniquely envisions the new age L&D to make the workforce effective and efficient. In this episode of the Digital Adoption Show with Steven, we’ve chatted about how he has seen L&D evolving over the past few decades and how he envisions the incorporation of Digital Adoption practices in corporate L&D.
Podcast episode
How The Modern Day Learning Experience Is Shaping Digital Workforce Transformations with Steven Rath Morgan: The fourth industrial revolution has completely changed the way human effort factors into the modern-day workplace. Technological changes have visibly improved business efficiency, cut costs, and given companies a sharper competitive edge than before. But more than anything, it has defined “a golden age for modern-day L&D organizations in supporting the digital workforce transformation that is so widespread today.” Quite an interesting fact from the Digital Adoption Show put forward by our guest, Steven Rath Morgan, Head of Learning & Development at Xerox and a thought leader who uniquely envisions the new age L&D to make the workforce effective and efficient. In this episode of the Digital Adoption Show with Steven, we’ve chatted about how he has seen L&D evolving over the past few decades and how he envisions the incorporation of Digital Adoption practices in corporate L&D.
byThe Digital Adoption Show | Upskilling the Future Digital Workforce
0 ratings
0% found this document useful
Ep 46: Dr. Sam Johnston on Design-based Research: On this episode, I am joined by Dr. Sam Johnston, a research scientist with The Center for Applied Special Technology, or CAST. With support of the Gates Foundation’s Open Professionals Education Network, she recently led the development of UDL On...
Podcast episode
Ep 46: Dr. Sam Johnston on Design-based Research: On this episode, I am joined by Dr. Sam Johnston, a research scientist with The Center for Applied Special Technology, or CAST. With support of the Gates Foundation’s Open Professionals Education Network, she recently led the development of UDL On...
byResearch in Action | A podcast for faculty & higher education professionals on research design, methods, productivity & more
0 ratings
0% found this document useful
David Moehn: Supporting Students More Universally with Technology: This week, Chris talks with Assistive Technology Specialist David Moehn about supporting general education and special education students more effectively with assistive technology (including AAC). They discuss how to train staff more effectively, suppor...
Podcast episode
David Moehn: Supporting Students More Universally with Technology: This week, Chris talks with Assistive Technology Specialist David Moehn about supporting general education and special education students more effectively with assistive technology (including AAC). They discuss how to train staff more effectively, suppor...
byTalking With Tech AAC Podcast
0 ratings
0% found this document useful

Skip carousel

SYNC OR SWIM A Content-pedagogy Manifesto
Screen Education
Article
SYNC OR SWIM A Content-pedagogy Manifesto
Mar 12, 2020
8 min read
The Amnesia Antidote
Marketing
Article
The Amnesia Antidote
Feb 11, 2019
4 min read
Quantum Leap
Marketing
Article
Quantum Leap
Jul 11, 2019
6 min read
SYNC OR SWIM Rough Animator
Screen Education
Article
SYNC OR SWIM Rough Animator
Dec 1, 2019
11 min read
Educational Effects
Landscape Architecture Australia
Article
Educational Effects
Jul 27, 2020
5 min read
Intel ...ON TE FUTURE OF... Computing
TechLife
Article
Intel ...ON TE FUTURE OF... Computing
Jan 13, 2020
5 min read
Why Parents Should Worry About Office 365’s Immersive Reader Dictation Feature
PCWorld
Article
Why Parents Should Worry About Office 365’s Immersive Reader Dictation Feature
Mar 6, 2018
3 min read
Brain Trust
Architectural Review Asia Pacific
Article
Brain Trust
Nov 11, 2019
4 min read
Why Parents Should Worry About Office 365’s Immersive Reader Dictation Feature
PCWorld
Article
Why Parents Should Worry About Office 365’s Immersive Reader Dictation Feature
Mar 3, 2018
3 min read
Intel …ON THE FUTURE OF… Computing
T3
Article
Intel …ON THE FUTURE OF… Computing
Sep 27, 2019
5 min read
Intel …ON THE FUTURE OF… Computing
T3 Australia
Article
Intel …ON THE FUTURE OF… Computing
Nov 4, 2019
5 min read
Questions for Tim Brown, CEO, IDEO
Rotman Management
Article
Questions for Tim Brown, CEO, IDEO
Jan 1, 2018
You have said that, at its best, design creates relationships between people and technologies. Please explain. When I use the term ‘technologies’, I mean anything that is constructed by human beings — whether it’s an iPod, an automobile, a rapid tran
8 min read
Terminal Velocity
Linux Format
Article
Terminal Velocity
Jun 4, 2019
9 min read
Note-taking Applications For Family History
Family Tree UK
Article
Note-taking Applications For Family History
Mar 10, 2023
7 min read
Virtual Toolkit
Screen Education
Article
Virtual Toolkit
Apr 1, 2018
8 min read
Sync Or Swim Adobe Spark
Screen Education
Article
Sync Or Swim Adobe Spark
Apr 1, 2018
I realise that I’ve gotten into a bit of a rhythm with these Sync or Swim columns: the introduction of each could easily be prefaced by ‘I don’t want to go off on a rant, but … ’, and they tend to involve me taking a few jabs at various educational t
8 min read
Software Whiteboards
Linux Format
Article
Software Whiteboards
Jul 26, 2022
1 min read
Real-World Experience
Residential Tech Today
Article
Real-World Experience
Jan 30, 2019
Richard Millson often seems like the smartest guy in the room. There’s a confidence, bordering on arrogance, sure, but he’s not one of those people who thinks he has all of the answers but turns out to be all bluster. Millson actually seems to know a
6 min read
‘MBAs THAT DON’T FOCUS ON DATA & TECH WON’T DO WELL’
Business Today
Article
‘MBAs THAT DON’T FOCUS ON DATA & TECH WON’T DO WELL’
Oct 28, 2022
6 min read
CREATIVE DEVICES Makers Empire
Screen Education
Article
CREATIVE DEVICES Makers Empire
Sep 15, 2019
8 min read
What Do Academics Think?
The Big Issue Magazine
Article
What Do Academics Think?
May 19, 2023
3 min read
Education 2.0: The Destructive Reconstruction of Higher Learning
Rotman Management
Article
Education 2.0: The Destructive Reconstruction of Higher Learning
Jan 1, 2018
8 min read
Tired Of AI Doomsday Tropes, Cohere CEO Says His Goal Is Technology That’s ‘Additive To Humanity’
AppleMagazine
Article
Tired Of AI Doomsday Tropes, Cohere CEO Says His Goal Is Technology That’s ‘Additive To Humanity’
Mar 29, 2024
4 min read
Tired Of AI Doomsday Tropes, Cohere CEO Says His Goal Is Technology That’s ‘Additive To Humanity’
TechLife News
Article
Tired Of AI Doomsday Tropes, Cohere CEO Says His Goal Is Technology That’s ‘Additive To Humanity’
Mar 30, 2024
4 min read
Ideas Lab
K-Zone
Article
Ideas Lab
Oct 10, 2021
Meet Rashina Hoda, a software engineering researcher who studies how software engineers develop the software products we all love! K-Z : Hi Rashina! What do you do in your role at Monash University? R: As Associate Professor of Software Engineeri
2 min read
Picture In A Mainframe
Linux Format
Article
Picture In A Mainframe
Jul 2, 2019
11 min read
Safer Cyber
Cosmos Magazine
Article
Safer Cyber
Mar 14, 2024
3 min read
The Security Dilemma Of Iot Devices And Potential Consequences
HWM Singapore
Article
The Security Dilemma Of Iot Devices And Potential Consequences
Jan 10, 2021
3 min read
Programmed To Excel
India Today
Article
Programmed To Excel
Jun 26, 2021
3 min read
Global Classrooms | E-Learning
India Today
Article
Global Classrooms | E-Learning
Jul 12, 2019
It doesn't matter which college you go to or what field your career is in; e-learning platforms can augment your skill set. Five experts on why massive online open courses are the way forward: There is a notable demand for educational programmes from
4 min read

Related categories

Skip carousel

Reviews for Fundamentals of Multimedia

Rating: 5 out of 5 stars

5/5

1 rating0 reviews

Book preview

Fundamentals of Multimedia - Ze-Nian Li

Part 1

Introduction and Multimedia Data Representations

Ze-Nian Li, Mark S. Drew and Jiangchuan LiuTexts in Computer ScienceFundamentals of Multimedia2nd ed. 201410.1007/978-3-319-05290-8© Springer International Publishing Switzerland 2014

As an introduction to multimedia, in Chap. 1 we consider the question of just what multimedia is. The components of multimedia are first introduced and then current multimedia research topics and projects are discussed to put the field into a perspective of what is actually at play at the edge of work in this field.

Since Multimedia is indeed a practical field, Chap. 1 also supplies an overview of multimedia software tools, such as video editors and digital audio programs.

A Taste of Multimedia

As a taste of multimedia, in Chap. 2, we introduce a set of tasks and concerns that are considered in studying multimedia. Then issues in multimedia production and presentation are discussed, followed by a further taste by considering how to produce sprite animation and build-your-own video transitions.

We then go on to review the current and future state of multimedia sharing and distribution, outlining later discussions of Social Media, Video Sharing, and new forms of TV.

Finally, the details of some popular multimedia tools are set out for a quick start into the field.

Multimedia Data Representations

As in many fields, the issue of how best to represent the data is of crucial importance in the study of multimedia, and Chaps. 3–6 consider how this is addressed in this field. These Chapters set out the most important data representations for use in multimedia applications. Since the main areas of concern are images, video, and audio, we begin investigating these in Chap. 3, Graphics and Image Data Representations. Before going on to look at Fundamental Concepts in Video in Chap. 5. we take a side-trip in Chap. 4 to explore several issues in the use of color, since color is vitally important in multimedia programs.

Audio data has special properties and Chap. 6, Basics of Digital Audio, introduces methods to compress sound information, beginning with a discussion of digitization of audio, and linear and nonlinear quantization, including companding. MIDI is explicated, as an enabling technology to capture, store, and play back musical notes. Quantization and transmission of audio is discussed, including the notion of subtraction of signals from predicted values, yielding numbers that are easier to compress. Differential Pulse Code Modulation (DPCM) and Adaptive DPCM are introduced, and we take a look at encoder/decoder schema.

Ze-Nian Li, Mark S. Drew and Jiangchuan LiuTexts in Computer ScienceFundamentals of Multimedia2nd ed. 201410.1007/978-3-319-05290-8_1

1. Introduction to Multimedia

Ze-Nian Li¹ , Mark S. Drew¹ and Jiangchuan Liu¹

(1)

Simon Fraser University, Vancouver, BC, Canada

Ze-Nian Li (Corresponding author)

Email: li@cs.sfu.ca

Mark S. Drew

Email: mark@cs.sfu.ca

Jiangchuan Liu

Email: jcliu@cs.sfu.ca

Abstract

In this chapter, we discuss the uses of the term multimedia since people may have quite different, even opposing, viewpoints on what this means. This textbook is aimed at computer science or engineering students, and consequently a more application-oriented view of what multimedia consists of is what is emphasized. The convergence going on in this field, with computers, smartphones, games, digital TV including 3D, multimedia-based search, and so on converging in technology, means that multimedia is a field that is essentially mandatory for such students to study. Moreover with the pervasive penetration of wireless mobile networks and development of mobile applications for smartphones and tablets, and the advent of social media, the contents of a multimedia course arguably forms the basis for much of the further studies many students will engage in. The components of multimedia are first introduced and then current multimedia research topics and projects are discussed to put the field into a perspective of what is actually at play at the edge of work in this field. For a fuller perspective, the remarkably short history of multimedia is synopsized, from the development of the World Wide Web up to current pervasive social media and anytime/anywhere access. Since multimedia is indeed a practical field, Chapter 1 also supplies an overview of multimedia software tools, such as video editors and digital audio programs, that are typically used to produce multimedia products such as those that are indeed produced in a course in this subject.

1.1 What is Multimedia?

People who use the term multimedia may have quite different, even opposing, viewpoints. A consumer entertainment vendor, say a phone company, may think of multimedia as interactive TV with hundreds of digital channels, or a cable-TV-like service delivered over a high-speed Internet connection. A hardware vendor might, on the other hand, like us to think of multimedia as a laptop that has good sound capability and perhaps the superiority of multimedia-enabled microprocessors that understand additional multimedia instructions.

A computer science or engineering student reading this book likely has a more application-oriented view of what multimedia consists of: applications that use multiple modalities to their advantage, including text, images, drawings, graphics, animation, video, sound (including speech), and, most likely, interactivity of some kind. This contrasts with media that use only rudimentary computer displays such as text-only or traditional forms of printed or hand-produced material.

The popular notion of convergence is one that inhabits the college campus as it does the culture at large. In this scenario, computers, smartphones, games, digital TV, multimedia-based search, and so on are converging in technology, presumably to arrive in the near future at a final and fully functional all-round, multimedia-enabled product. While hardware may indeed strive for such all-round devices, the present is already exciting—multimedia is part of some of the most interesting projects underway in computer science, with the keynote being interactivity. The convergence going on in this field is in fact a convergence of areas that have in the past been separated but are now finding much to share in this new application area. Graphics, visualization, HCI, computer vision, data compression, graph theory, networking, database systems—all have important contributions to make in multimedia at the present time.

1.1.1 Components of Multimedia

The multiple modalities of text, audio, images, drawings, animation, video, and interactivity in multimedia are put to use in ways as diverse as

Geographically based, real-time augmented-reality, massively multiplayer online video games, making use of any portable device such as smartphones, laptops, or tablets, which function as GPS-aware mobile game consoles. For example, a game in which players reinforce and link friendly portals, and attack enemy ones that are played on GPS-enabled devices where the players must physically move to the portals (which are overlaid on real sites such as public art, interesting buildings, or parks) in order to interact with them.

Shapeshifting TV, where viewers vote on the plot path by phone text-messages, which are parsed to direct plot changes in real-time.

A camera that suggests what would be the best type of next shot so as to adhere to good technique guidelines for developing storyboards.

A Web-based video editor that lets anyone create a new video by editing, annotating, and remixing professional videos on the cloud.

Cooperative education environments that allow schoolchildren to share a single educational game using two mice at once that pass control back and forth.

Searching (very) large video and image databases for target visual objects, using semantics of objects.

Compositing of artificial and natural video into hybrid scenes, placing real-appearing computer graphics and video objects into scenes so as to take the physics of objects and lights (e.g., shadows) into account.

Visual cues of video-conference participants, taking into account gaze direction and attention of participants.

Making multimedia components editable—allowing the user side to decide what components, video, graphics, and so on are actually viewed and allowing the client to move components around or delete them—making components distributed.

Building inverse-Hollywood applications that can recreate the process by which a video was made, allowing storyboard pruning and concise video summarization.

From a computer science student’s point of view, what makes multimedia interesting is that so much of the material covered in traditional computer science areas bears on the multimedia enterprise. In today’s digital world, multimedia content is recorded and played, displayed, or accessed by digital information content processing devices, ranging from smartphones, tablets, laptops, personal computers, smart TVs, and game consoles, to servers and datacenters, over such distribution media as tapes, harddrives, and disks, or more popularly nowadays, wired and wireless networks. This leads to a wide variety of research topics:

Multimedia processing and coding. This includes audio/image/video processing, compression algorithms, multimedia content analysis, content-based multimedia retrieval, multimedia security, and so on.

Multimedia system support and networking. People look at such topics as network protocols, Internet and wireless networks, operating systems, servers and clients, and databases.

Multimedia tools, end systems, and applications. These include hypermedia systems, user interfaces, authoring systems, multimodal interaction, and integration: ubiquity—Web-everywhere devices, multimedia education, including computer supported collaborative learning and design, and applications of virtual environments.

Multimedia research touches almost every branch of computer science. For example, data mining is an important current research area, and a large database of multimedia data objects is a good example of just what big data we may be interested in mining; telemedicine applications, such as telemedical patient consultative encounters, are multimedia applications that place a heavy burden on network architectures. Multimedia research is also highly interdisciplinary, involving such other research fields as electric engineering, physics, and psychology; signal processing for audio/video signals is an essential topic in electric engineering; color in image and video has a long-history and solid foundation in physics; more importantly, all multimedia data are to be perceived by human beings, which is, certainly, related to medical and psychological research.

1.2 Multimedia: Past and Present

To place multimedia in its proper context, in this section we briefly scan the history of multimedia, a relatively recent part of which is the connection between multimedia and hypermedia. We also show the rapid evolution and revolution of multimedia in the new millennium with the new generation of computing and communication platforms.

1.2.1 Early History of Multimedia

A brief history of the use of multimedia to communicate ideas might begin with newspapers, which were perhaps the first mass communication medium, using text, graphics, and images. Before still-image camera was invented, these graphics and images were generally hand-drawn.

Joseph Nicéphore Niépce captured the first natural image from his window in 1826 using a sliding wooden box camera [1, 2]. It was made using an 8-h exposure on pewter coated with bitumen. Later, Alphonse Giroux built the first commercial camera with a double-box design. It had an outer box fitted with a landscape lens, and an inner box holding a ground glass focusing screen and image plate. Sliding the inner box makes the objects of different distances be focused. Similar cameras were used for exposing wet silver-surfaced copper plates, commercially introduced in 1839. In the 1870s, wet plates were replaced by the more convenient dry plates. Figure 1.1 (image from author’s own collection) shows an example of a nineteenth century dry-plate camera, with bellows for focusing. By the end of the nineteenth century, film-based cameras were introduced, which soon became dominant until replaced by digital cameras.

A316017_2_En_1_Fig1_HTML.jpg

Fig. 1.1

A vintage dry-plate camera. E&H T Anthony model Champion, circa 1890

Thomas Alva Edison’s phonograph, invented in 1877, was the first device that was able to record and reproduce sound. It originally recorded sound onto a tinfoil sheet phonograph cylinder [3]. Figure 1.2 shows an example of an Edison’s phonograph (Edison GEM, 1905; image from author’s own collection).

A316017_2_En_1_Fig2_HTML.jpg

Fig. 1.2

An Edison phonograph, model GEM. Note the patent plate in the bottom picture, which suggests that the importance of patents had long been realized and also how serious Edison was in protecting his inventions. Despite the warnings in the plate, this particular phonograph was modified by the original owner, a good DIYer 100 years ago, to include a more powerful spring motor from an Edison Standard model and a large flower horn from the Tea Tray Company

A316017_2_En_1_Fig3_HTML.gif

Fig. 1.3

Evolution of audio storage media. Left to right an Edison cylinder record, a flat vinyl record, a reel-to-reel magnetic tape, a cassette tape, and a CD

The phonographs were later improved by Alexander Graham Bell. Most notable improvements include wax-coated cardboard cylinders, and a cutting stylus that moved from side to side in a zig zag pattern across the record. Emile Berliner further transformed the phonograph cylinders to gramophone records. Each side of such a flat disk has a spiral groove running from the periphery to near the center, which can be conveniently played by a turntable with a tonearm and a stylus. These components were improved over time in the twentieth century, which eventually enabled quality sound reproducing that is very close the origin. The gramophone record was one of the dominant audio recording formats throughout much of the twentieth century. From the mid-1980s, phonograph use declined sharply because of the rise of audio tapes, and later the Compact Disc (CD) and other digital recording formats [4]. Figure 1.3 shows the evolution of audio storage media, starting from the Edison cylinder record, to the flat vinyl record, to magnetic tapes (reel-to-reel and cassette), and modern digital CD.

Motion pictures were originally conceived of in the 1830s to observe motion too rapid for perception by the human eye. Edison again commissioned the invention of a motion picture camera in 1887 [5]. Silent feature films appeared from 1910 to 1927; the silent era effectively ended with the release of The Jazz Singer in 1927.

In 1895, Guglielmo Marconi conducted the first wireless radio transmission at Pontecchio, Italy, and a few years later (1901), he detected radio waves beamed across the Atlantic [6]. Initially invented for telegraph, radio is now a major medium for audio broadcasting. In 1909, Marconi shared the Nobel Prize for Physics.¹

Television, or TV for short, was the new medium for the twentieth century [7]. In 1884, Paul Gottlieb Nipkow, a 23-year-old university student in Germany, patented the first electromechanical television system which employed a spinning disk with a series of holes spiraling toward the center. The holes were spaced at equal angular intervals such that, in a single rotation, the disk would allow light to pass through each hole and onto a light-sensitive selenium sensor which produced the electrical pulses. As an image was focused on the rotating disk, each hole captured a horizontal slice of the whole image. Nipkow’s design would not be practical until advances in amplifier tube technology, in particular, the cathode ray tube (CRT), became available in 1907. Commercially available since the late 1920s, CRT-based TV established video as a commonly available medium and has since changed the world of mass communication.

All these media mentioned above are in the analog format, for which the time-varying feature (variable) of the signal is a continuous representation of the input, i.e., analogous to the input audio, image, or video signal. The connection between computers and digital media, i.e., media data represented using the discrete binary format, emerged actually only over a short period:

1967 Nicholas Negroponte formed the Architecture Machine Group at MIT.

1969 Nelson and van Dam at Brown University created an early hypertext editor called FRESS [8]. The present-day Intermedia project by the Institute for Research in Information and Scholarship (IRIS) at Brown is the descendant of that early system.

1976 The MIT Architecture Machine Group proposed a project entitled Multiple Media. This resulted in the Aspen Movie Map, the first videodisk, in 1978.

1982 The Compact Disc (CD) was made commercially available by Philips and Sony, which was soon becoming the standard and popular medium for digital audio data, replacing the analog magnetic tape.

1985 Negroponte and Wiesner co-founded the MIT Media Lab, a leading research institution investigating digital video and multimedia.

1990 Kristina Hooper Woolsey headed the Apple Multimedia Lab, with a staff of 100. Education was a chief goal.

1991 MPEG-1 was approved as an international standard for digital video. Its further development led to newer standards, MPEG-2, MPEG-4, and further MPEGs, in the 1990s.

1991 The introduction of PDAs in 1991 began a new period in the use of computers in general and multimedia in particular. This development continued in 1996 with the marketing of the first PDA with no keyboard.

1992 JPEG was accepted as the international standard for digital image compression, which remains widely used today (say, by virtually every digital camera).

1992 The first audio multicast on the multicast backbone (MBone) was made.

1995 The JAVA language was created for platform-independent application development, which was widely used for developing multimedia applications.

1996 DVD video was introduced; high-quality, full-length movies were distributed on a single disk. The DVD format promised to transform the music, gaming, and computer industries.

1998 Handheld MP3 audio players were introduced to the consumer market, initially with 32 MB of flash memory.

1.2.2 Hypermedia, WWW, and Internet

The early studies laid a solid foundation for the capturing, representation, compression, and storage of each type of media. Multimedia however is not simply about putting different media together; rather, it focuses more on the integration of them so as to enable rich interaction amongst them, and also between media and human beings.

In 1945, as part of MIT’s postwar deliberations on what to do with all those scientists employed on the war effort, Vannevar Bush wrote a landmark article [9] describing what amounts to a hypermedia system, called Memex. Memex was meant to be a universally useful and personalized memory device that even included the concept of associative links—it really is the forerunner of the World Wide Web. After World War II, 6,000 scientists who had been hard at work on the war effort suddenly found themselves with time to consider other issues, and the Memex idea was one fruit of that new freedom.

In the 1960s, Ted Nelson started the Xanadu project and coined the term hypertext. Xanadu was the first attempt at a hypertext system—Nelson called it a magic place of literary memory.

We may think of a book as a linear medium, basically meant to be read from beginning to end. In contrast, a hypertext system is meant to be read nonlinearly, by following links that point to other parts of the document, or indeed to other documents. Figure 1.4 illustrates this familiar idea.

A316017_2_En_1_Fig4_HTML.gif

Fig. 1.4

Hypertext is nonlinear

Douglas Engelbart, greatly influenced by Vannevar Bush’s As We May Think, demonstrated the On-Line System (NLS), another early hypertext program in 1968. Engelbart’s group at Stanford Research Institute aimed at augmentation, not automation, to enhance human abilities through computer technology. NLS consisted of such critical ideas as an outline editor for idea development, hypertext links, teleconferencing, word processing, and email, and made use of the mouse pointing device, windowing software, and help systems [10].

Hypermedia, again first introduced by Ted Nelson, went beyond text-only. It includes a wide array of media, such as graphics, images, and especially the continuous media—sound and video, and links them together. The World Wide Web (WWW or simply Web) is the best example of a hypermedia application, which is also the largest.

Amazingly, this most predominant networked multimedia applications has its roots in nuclear physics! In 1990, Tim Berners-Lee proposed the World Wide Web to CERN (European Center for Nuclear Research) as a means for organizing and sharing their work and experimental results. With approval from CERN, he started developing a hypertext server, browser, and editor on a NeXTStep workstation. His team invented the Hypertext Markup Language (HTML) and the Hypertext Transfer Protocol (HTTP) for this purpose, too.

HyperText Markup Language (HTML)

It is recognized that documents need to have formats that are human-readable and that identify structure and elements. Charles Goldfarb, Edward Mosher, and Raymond Lorie developed the Generalized Markup Language (GML) for IBM. In 1986, the ISO released a final version of the Standard Generalized Markup Language (SGML), mostly based on the earlier GML.

HTML is a language for publishing hypermedia on the Web [11]. It is defined using SGML and derives elements that describe generic document structure and formatting. Since it uses ASCII, it is portable to all different (even binary-incompatible) computer hardware, which allows for global exchange of information. The current version of HTML is 4.01, and a newer version, HTML5, is still under development.

HTML uses tags to describe document elements. The tags are in the format to define the start point of a document element and to define the end of the element. Some elements have only inline parameters and do not require ending tags. HTML divides the document into a HEAD and a BODY part as follows:

A316017_2_En_1_Figa_HTML.gif

The HEAD describes document definitions, which are parsed before any document rendering is done. These include page title, resource links, and meta-information the author decides to specify. The BODY part describes the document structure and content. Common structure elements are paragraphs, tables, forms, links, item lists, and buttons.

A very simple HTML page is as follows:

A316017_2_En_1_Figb_HTML.gif

Naturally, HTML has more complex structures and can be mixed with other standards. The standard has evolved to allow integration with script languages, dynamic manipulation of almost all elements and properties after display on the client side (dynamic HTML), and modular customization of all rendering parameters using a markup language called Cascading Style Sheets (CSS). Nonetheless, HTML has rigid, nondescriptive structure elements, and modularity is hard to achieve.

Extensible Markup Language (XML)

There was also a need for a markup language for the Web that has modularity of data, structure, and view. That is, we would like a user or an application to be able to define the tags (structure) allowed in a document and their relationship to each other, in one place, then define data using these tags in another place (the XML file), and finally, define in yet another document how to render the tags.

Suppose we wanted to have stock information retrieved from a database according to a user query. Using XML, we would use a global Document Type Definition (DTD) we have already defined for stock data. Your server-side script will abide by the DTD rules to generate an XML document according to the query, using data from your database. Finally, we will send users your XML Style Sheet (XSL), depending on the type of device they use to display the information, so that our document looks best both on a computer with a 27-in. LED display and on a small-screen cellphone.

The original XML version was XML 1.0, approved by the W3C in February 1998, and is currently in its fifth edition as of 2008. The original version is still recommended. The second version XML 1.1 was introduced in 2004 and is currently in its second edition as of 2006. XML syntax looks like HTML syntax, although it is much stricter. All tags are lowercase, and a tag that has only inline data has to terminate itself, for example, . XML also uses namespaces, so that multiple DTDs declaring different elements but with similar tag names can have their elements distinguished. DTDs can be imported from URIs as well. As an example of an XML document structure, here is the definition for a small XHTML document:

A316017_2_En_1_Figc_HTML.gif

All XML documents start with ver?>. is a special tag used for importing DTDs. Since it is a DTD definition, it does not adhere to XML rules. xmlns defines a unique XML namespace for the document elements. In this case, the namespace is the XHTML specifications website.

In addition to XML specifications, the following XML-related specifications are standardized:

XML Protocol. Used to exchange XML information between processes. It is meant to supersede HTTP and extend it as well as to allow interprocess communications across networks.

XML Schema. A more structured and powerful language for defining XML data types (tags). Unlike a DTD, XML Schema uses XML tags for type definitions.

XSL. This is basically CSS for XML. On the other hand, XSL is much more complex, having three parts: XSL Transformations (XSLT), XML Path Language (XPath), and XSL Formatting Objects.

The WWW quickly gained popularity, due to the amount of information available from web servers, the capacity to post such information, and the ease of navigating such information with a web browser, particularly after Marc Andreessen’s introduction of Mosaic browser in 1993 (later became Netscape).

Today, the Web technology is maintained and developed by the World Wide Web Consortium (W3C), together with the Internet Engineering Task Force (IETF) to standardize the technologies. The W3C has listed the following three goals for the WWW: universal access of web resources (by everyone everywhere), effectiveness of navigating available information, and responsible use of posted material.

It is worth mentioning that the Internet serves as the underlying vehicle for the WWW and the multimedia content shared over it. Starting from the Advanced Research Projects Agency Network (ARPANET) with only two nodes in 1969, the Internet gradually became the dominating global network that interconnects numerous computer networks and their billions of users with the standard Internet protocol suite (TCP/IP). It evolved together with digital multimedia. On one hand, the Internet carries much of the multimedia content. It has largely swept out optical disks as the storage and distribution media in the movie industry. It is currently reshaping the TV broadcast industry with an ever-accelerating speed. On the other hand, the Internet was not initially designed for multimedia data and was not quite friendly to multimedia traffic. Multimedia data, now occupying almost 90 % of the Internet bandwidth, is the key driving force toward enhancing the existing Internet and toward developing the next generation of the Internet, as we will see in Chaps. 15 and 16.

1.2.3 Multimedia in the New Millennium

Entering the new millennium, we have witnessed the fast evolution toward a new generation of social, mobile, and cloud computing for multimedia processing and sharing. Today, the role of the Internet itself has evolved from the original use as a communication tool to provide easier and faster sharing of an infinite supply of information, and the multimedia content itself has also been greatly enriched. High-definition videos and even 3D/multiview videos can be readily captured and browsed by personal computing devices, and conveniently stored and processed with remote cloud resources. More importantly, the users are now actively engaged to be part of a social ecosystem, rather than passively receiving media content. The revolution is being driven further by the deep penetration of 3G/4G wireless networks and smart mobile devices. Coming with highly intuitive interfaces and exceptionally richer multimedia functionalities, they have been seamlessly integrated with online social networking for instant media content generation and sharing.

Below, we list some important milestones in the development of multimedia in the new millennium. We believe that most of the readers of this textbook are familiar with them, as we are all in this Internet age, witnessing its dramatic changes; many readers, particularly the younger generation, would be even more familiar with the use of such multimedia services as YouTube, Facebook, and Twitter than the authors.

2000 WWW size was estimated at over one billion pages. Sony unveiled the first Blu-ray Disc prototypes in October 2000, and the first prototype player was released in April 2003 in Japan.

2001 The first peer-to-peer file sharing (mostly MP3 music) system, Napster, was shut down by court order, but many new peer-to-peer file sharing systems, e.g., Gnutella, eMule, and BitTorrent, were launched in the following years. Coolstreaming was the first large-scale peer-to-peer streaming system that was deployed in the Internet, attracting over one million in 2004. Later years saw the booming of many commercial peer-to-peer TV systems, e.g., PPLive, PPStream, and UUSee, particularly in East Asia. NTT DoCoMo in Japan launched the first commercial 3G wireless network on October 1. 3G then started to be deployed worldwide, promising broadband wireless mobile data transfer for multimedia data.

2003 Skype was released for free peer-to-peer voice over the Internet.

2004 Web 2.0 was recognized as a new way to utilize software developers and end-users use the Web (and is not a technical specification for a new Web). The idea is to promote user collaboration and interaction so as to generate content in a virtual community, as opposed to simply passively viewing content. Examples include social networking, blogs, wikis, etc. Facebook, the most popular online social network, was founded by Mark Zuckerberg. Flickr, a popular photo hosting and sharing site, was created by Ludicorp, a Vancouver-based company founded by Stewart Butterfield and Caterina Fake.

2005 YouTube was created, providing an easy portal for video sharing, which was purchased by Google in late 2006. Google launched the online map service, with satellite imaging, real-time traffic, and Streetview being added later.

2006 Twitter was created, and rapidly gained worldwide popularity, with 500 million registered users in 2012, who posted 340 million tweets per day. In 2012, Twitter offered the Vine mobile app, which enables its users to create and post short video clips of up to 6 s. Amazon launched its cloud computing platform, Amazon’s Web Services (AWS). The most central and well-known of these services are Amazon EC2 and Amazon S3. Nintendo introduced the Wii home video game console, whose remote controller can detect movement in three dimensions.

2007 Apple launched the first generation of iPhone, running the iOS mobile operating system. Its touch screen enabled very intuitive operations, and the associated App Store offered numerous mobile applications. Goolge unveiled Android mobile operating system, along with the founding of the Open Handset Alliance: a consortium of hardware, software, and telecommunication companies devoted to advancing open standards for mobile devices. The first Android-powered phone was sold in October 2008, and Google Play, Android’s primary app store, was soon launched. In the following years, tablet computers using iOS, Android, and Windows with larger touch screens joined the eco-system, too.

2009 The first LTE (Long Term Evolution) network was set up in Oslo, Norway, and Stockholm, Sweden, making an important step toward 4G wireless networking. James Cameron’s film, Avatar, created a surge on the interest in 3D video.

2010 Netflix, which used to be a DVD rental service provider, migrated its infrastructure to the Amazon AWS cloud computing platform, and became a major online streaming video provider. Master copies of digital films from movie studios are stored on Amazon S3, and each film is encoded into over 50 different versions based on video resolution, audio quality using machines on the cloud. In total, Netflix has over 1 petabyte of data stored on Amazon’s cloud. Microsoft introduced Kinect, a horizontal bar with full-body 3D motion capture, facial recognition, and voice recognition capabilities, for its game console Xbox 360.

2012 HTML5 subsumes the previous version, HTML4, which was standardized in 1997. HTML5 is a W3C Candidate Recommendation. It is meant to provide support for the latest multimedia formats while maintaining consistency for current web browsers and devices, along with the ability to run on low-powered devices such as smartphones and tablets.

2013 Twitter offered Vine, is a mobile app that enables its users to create and post short video clips. Sony released its PlayStation 4, a video game console that is to be integrated with Gaikai, a cloud-based gaming service that offers streaming video game content. 4K resolution TV started to be available in the consumer market.

1.3 Multimedia Software Tools: A Quick Scan

For a concrete appreciation of the current state of multimedia software tools available for carrying out tasks in multimedia, we now include a quick overview of software categories and products.

These tools are really only the beginning—a fully functional multimedia project can also call for stand-alone programming as well as just the use of predefined tools to fully exercise the capabilities of machines and the Internet.²

In courses we teach using this text, students are encouraged to try these tools, producing full-blown and creative multimedia productions. Yet this textbook is not a how-to book about using these tools—it is about understanding the fundamental design principles behind these tools! With a clear understanding of the key multimedia data structures, algorithms, and protocols, a student can make smarter and advanced use of such tools, so as to fully unleash their potentials, and even improve the tools themselves or develop new tools.

The categories of software tools we examine here are

Music sequencing and notation

Digital audio

Graphics and image editing

Video editing

Animation

Multimedia authoring.

1.3.1 Music Sequencing and Notation

Cakewalk Pro Audio

Cakewalk Pro Audio is a very straightforward music-notation program for sequencing. The term sequencer comes from older devices that stored sequences of notes in the MIDI music language (events, in MIDI; see Sect. 6.2).

Finale, Sibelius

Finale and Sibelius are two composer-level notation systems; these programs likely set the bar for excellence, but their learning curve is fairly steep.

1.3.2 Digital Audio

Digital Audio tools deal with accessing and editing the actual sampled sounds that make up audio.

Adobe AuditionAdobe Audition (formerly Cool Edit) is a powerful, popular digital audio toolkit with capabilities (for PC users, at least) that emulate a professional audio studio, including multitrack productions and sound file editing, along with digital signal processing effects.

Sound Forge

Like Audition, Sound Forge is a sophisticated PC-based program for editing WAV files. Sound can be captured through the sound card, and then mixed and edited. It also permits adding complex special effects.

Pro Tools

Pro Tools is a high-end integrated audio production and editing environment that runs on Macintosh computers as well as Windows. Pro Tools offers easy MIDI creation and manipulation as well as powerful audio mixing, recording, and editing software. Full effects depend on purchasing a dongle.

1.3.3 Graphics and Image Editing

Adobe Illustrator

Illustrator is a powerful publishing tool for creating and editing vector graphics, which can easily be exported to use on the Web.

Adobe Photoshop

Photoshop is the standard in a tool for graphics, image processing, and image manipulation. Layers of images, graphics, and text can be separately manipulated for maximum flexibility, and its set of filters permits creation of sophisticated lighting effects.

Adobe Fireworks

Fireworks is software for making graphics specifically for the Web. It includes a bitmap editor, a vector graphics editor, and a JavaScript generator for buttons and rollovers.

Adobe Freehand

Freehand is a text and web graphics editing tool that supports many bitmap formats, such as GIF, PNG, and JPEG. These are pixel-based formats, in that each pixel is specified. It also supports vector-based formats, in which endpoints of lines are specified instead of the pixels themselves, such as SWF (Adobe Flash). It can also read Photoshop format.

1.3.4 Video Editing

Adobe Premiere

Premiere is a simple, intuitive video editing tool for nonlinear editing—putting video clips into any order. Video and audio are arranged in tracks, like a musical score. It provides a large number of video and audio tracks, superimpositions, and virtual clips. A large library of built-in transitions, filters, and motions for clips allows easy creation of effective multimedia productions.

CyberLink PowerDirector

PowerDirector produced by CyberLink Corp. is by far the most popular nonlinear video editing software. It provides a rich selection of audio and video features and special effects and is easy to use. It supports all modern video formats including AVCHD 2.0, 4K Ultra HD, and 3D video. It supports 64-bit video processing, graphics card acceleration, and multiple CPUs. Its processing and preview are much faster than Premiere. However, it is not as programmable as Premiere.

Adobe After Effects

After Effects is a powerful video editing tool that enables users to add and change existing movies with effects such as lighting, shadows, and motion blurring. It also allows layers, as in Photoshop, to permit manipulating objects independently.

Final Cut Pro

Final Cut Pro is a video editing tool offered by Apple for the Macintosh platform. It allows the input of video and audio from numerous sources, and provides a complete environment, from editing and color correction to the final output of a video file.

1.3.5 Animation

Multimedia APIs

Java3D is an API used by Java to construct and render 3D graphics, similar to the way Java Media Framework handles media files. It provides a basic set of object primitives (cube, splines, etc.) upon which the developer can build scenes. It is an abstraction layer built on top of OpenGL or DirectX (the user can select which), so the graphics are accelerated.

DirectX, a Windows API that supports video, images, audio, and 3D animation, is a common API used to develop multimedia Windows applications such as computer games.

OpenGL was created in 1992 and is still a popular 3D API today. OpenGL is highly portable and will run on all popular modern operating systems, such as UNIX, Linux, Windows, and Macintosh.

Animation Software

Autodesk 3ds Max (formerly 3D Studio Max) includes a number of high-end professional tools for character animation, game development, and visual effects production. Models produced using this tool can be seen in several consumer games, such as for the Sony Playstation.

Autodesk Softimage (previously called Softimage XSI) is a powerful modeling, animation, and rendering package for animation and special effects in films and games.

Autodesk Maya, a competing product to Softimage, is a complete modeling package. It features a wide variety of modeling and animation tools, such as to create realistic clothes and fur. Autodesk Maya runs on Windows, Mac OS, and Linux.

GIF Animation Packages

For a much simpler approach to animation that also allows quick development of effective small animations for the Web, many shareware and other programs permit creating animated GIF images. GIFs can contain several images, and looping through them creates a simple animation.

Linux also provides some simple animation tools, such as animate.

1.3.6 Multimedia Authoring

Tools that provide the capability for creating a complete multimedia presentation, including interactive user control, are called authoring programs.

Adobe Flash

Flash allows users to create interactive movies by using the score metaphor—a timeline arranged in parallel event sequences, much like a musical score consisting of musical notes. Elements in the movie are called symbols in Flash. Symbols are added to a central repository, called a library, and can be added to the movie’s timeline. Once the symbols are present at a specific time, they appear on the Stage, which represents what the movie looks like at a certain time, and can be manipulated and moved by the tools built into Flash. Finished Flash movies are commonly used to show movies or games on the Web.

Adobe Director

Director uses a movie metaphor to create interactive presentations. This powerful program includes a built-in scripting language, Lingo, that allows creation of complex interactive movies.³ The cast of characters in Director includes bitmapped sprites, scripts, music, sounds, and palettes. Director can read many bitmapped file formats. The program itself allows a good deal of interactivity, and Lingo, with its own debugger, allows more control, including control over external devices.

Dreamweaver

Dreamweaver is a webpage authoring tool that allows users to produce multimedia presentations without learning any HTML.

1.4 Multimedia in the Future

This textbook emphasizes on the fundamentals of multimedia, focusing on the basic and mature techniques that collectively form the foundation of today’s multimedia systems. It is however worth noting that multimedia research remains young and is vigorously growing. It brings many exciting topics together, and we will certainly see great innovations that will dramatically change our life in the near future [12].

For example, researchers are interested in camera-based object tracking technology. But while face detection is ubiquitous, with camera software doing a reasonable job of identifying faces in images and video, face detection and object tracking are by no means solved problems today (although for face tracking, combining multiple poses may be a promising direction [13]). As a matter of fact, interest in these topics is somewhat flagging, with need for some new breakthrough. Instead, the current emphasis is on event detection, e.g. for security applications such as a person leaving a bag unattended in an airport.

While shot detection—finding where scene changes exist in video—and video classification have for some time been of interest, new challenges have now arisen in these old subjects due to the abundance of online video that is not professionally edited.

Extending the conventional 2D video, today’s 3D capture technology is fast enough to allow acquiring dynamic characteristics of human facial expression during speech, to synthesize highly realistic facial animation from speech for low-bandwidth applications. Beyond this, multiple views from several cameras or from a single camera under differing lighting can accurately acquire data that gives both the shape and surface properties of materials, thus automatically generating synthetic graphics models. This allows photo-realistic (video-quality) synthesis of virtual actors. Multimedia applications aimed at handicapped persons, particularly those with poor vision and the elderly, are a rich field of endeavor in current research, too. Another related example is Google Glass, which, equipped with an optical head-mounted display, enables interactive, smartphone-like information display for its users. Wirelessly connected the Internet, it can also communicate using natural language voice commands. All these make a good step toward wearable computing of great potentials.

Online social media, such as YouTube, Facebook, and Twitter, appeared only in the past decade, but are rapidly changing the way for information generation and sharing and even our daily life. Research on social media is likely one of the most important areas under scrutiny, with some 100,000 academic articles produced per year in this area. It leads to a series of interesting new research topics:

Crowdsourcing for multimedia This concept, that the input of a large number of human contributors is made use of in multimedia projects, has experienced a large growth in attention. For example, having people provide tags to aid in understanding the visual content of images and video, such as Amazon’s Mechanical Turk, to outsource such time-consuming tasks as semantic video annotation to a large number of workers who are willing to work for small reward or just for fun. A straightforward use of such large populations is to analyze sentiment, such as the popularity of a particular brand-name as evidenced by reading several thousand tweets on the subject. Another example is Digital fashion, which aims to develop smart clothing that can communicate with other such enhanced clothing using wireless communication, so as to artificially enhance human interaction in a social setting. The vision here is to use technology to allow individuals to allow certain thoughts and feelings to be broadcast automatically, for exchange with others equipped with similar technology.

Executable academic papers In science and engineering, one traditional way to communicate findings is by publication of papers in academic journals. A new idea that exploits the completely digital pathway for broadcast of information is called Executable papers. The idea here is that results discussed in a published paper are often difficult to reproduce. The reason is that datasets being used and programming code working on that data are typically not supplied as part of the publication. The executable papers approach allows the reader to interact with and interactively manipulate the data and code, to further understand the findings being presented. Moreover, the concept includes allowing the reader to rerun the code, change parameters, or upload different data.

Animated Lifelike Virtual Agents e.g. virtual educators, in particular as social partners for special needs children; and various other roles that are designed to demonstrate emotion and personality and with a variety of embodiments. The objective is flexibility as opposed to a fixed script.

Behavioral science models can be brought into play to model interaction between people, which can then be extended to enable natural interaction by virtual characters. Such augmented interaction applications can be used to develop interfaces between real and virtual humans for tasks such as augmented storytelling.

Each of these application areas pushes the development of computer science generally, stimulates new applications, and fascinates practitioners. The chief leaders of multimedia research have generated several overarching grand challenge problems, which act as a type of state-of-the-art for multimedia interests. At present some of these consist of the following:

Social Event Detection for Social Multimedia: discovering social events planned and attended by people, as indicated by collections of multimedia content that was captured by people and uploaded to social-media sites.

Search and Hyperlinking of Television Content: finding relevant video segments for a particular subject and generating useful hyperlinks for each of these segments. The underlying idea is that instead of people performing a search and following hyperlinks, this could all be automated intelligently.

Geo-coordinate Prediction for Social Multimedia: estimating the GPS coordinates of images and videos, using all the data available including tags, audio, and users.

Violent Scenes Detection in Film: automatically detecting portions of movies depicting violence. Again, all aspects available such as text and audio could be brought into play.

Preserving Privacy in Surveillance Videos: methods obscuring private information (such as faces on Google Earth), so as to render privacy-sensitive elements of video unrecognizable, while at the same time allowing the video to still be viewable by people and also allow computer vision tasks such as object tracking.

Spoken Term Web Search: searching for audio content within audio content by using an audio query.

Question Answering for the Spoken Web: a variant on the above, specifically for matching spoken questions with a collection of spoken answers.

Soundtrack Selection for Commercials: choosing the most suitable music soundtrack from a list of candidates. The objective here is to use extra features (meta-data) such as text, descriptive features calculated for audio and for video, webpages, and social tags to help in the task.

Solutions to these challenges can be difficult, but the impact can be enormous, not only to the IT industry, but also to everyone, as we all live in a digital multimedia world. We want this textbook to bring valuable knowledge about multimedia to you, and hope you enjoy it and perhaps even contribute to this promising field (maybe for some of the topics listed above, or beyond) in your future career!

1.5 Exercises

Using your own words, describe what is multimedia? Is multimedia simply a collection of different types of media?

Identify three novel multimedia applications. Discuss why you think these are novel and their potential impact.

Discuss the relation between multimedia and hypermedia.

Briefly explain, in your own words, Memex and its role regarding hypertext. Could we carry out the Memex task today? How do you use Memex ideas in your own work?

Discover a current media input, storage, or playback device that is analog. Is it necessary to convert to digital? What are the pros and cons to be analog or digital?

Your task is to think about the transmission of smell over the Internet. Suppose we have a smell sensor at one location and wish to transmit the Aroma Vector (say) to a receiver to reproduce the same sensation. You are asked to design such a system. List three key issues to consider and two applications of such a delivery system. Hint: Think about medical applications.

Tracking objects or people can be done by both sight and sound. While vision systems are precise, they are relatively expensive; on the other hand, a pair of microphones can detect a person’s bearing inaccurately but cheaply. Sensor fusion of sound and vision is thus useful. Surf the Web to find out who is developing tools for video conferencing using this kind of multimedia idea.

Non-photorealistic graphics means computer graphics that do well enough without attempting to make images that look like camera images. An example is conferencing. For example, if we track lip movements, we can generate the right animation to fit our face. If we do not much like our own face, we can substitute another one—facial-feature modeling can map correct lip movements onto another model. See if you can find out who is carrying out research on generating avatars to represent conference participants’ bodies.

Watermarking is a means of embedding a hidden message in data. This could have important legal implications: Is this image copied? Is this image doctored? Who took it? Where? Think of messages that could be sensed while capturing an image and secretly embedded in the image, so as to answer these questions. (A similar question derives from the use of cell phones. What could we use to determine who is putting this phone to use, and where, and when? This could eliminate the need for passwords or others using the phone you lost.)

References

B. Newhall, The History of Photography: From 1839 to the Present, The Museum of Modern Art (1982)

T. Gustavson, G. Eastman House, Camera: A History of Photography from Daguerreotype to Digital (Sterling Signature, New York, 2012)

A. Koenigsberg, The Patent History of the Phonograph, (APM Press, Englewood, 1991), pp. 1877–1912

L.M. David Jr., Sound Recording: The Life Story of a Technology, (Johns Hopkins University Press, Baltimore, 2006)

Q.D. Bowers, K. Fuller-Seeley. One Thousand Nights at the Movies: An Illustrated History of Motion Pictures, (Whitman Publishing, Atlanta, 2012), pp. 1895–1915

T.K. Sarkar, R. Mailloux, A.O. Arthur, M. Salazar-Palma, D.L. Sengupta, History of Wireless, (Wiley-IEEE Press, Hoboken, 2006)

M. Hilmes, J. Jacobs, The Television History Book (Television, Media and Cultural Studies), (British Film Institute, London, 2008)

N. Yankelovitch, N. Meyrowitz, A. van Dam, Reading and writing the electronic book, in Hypermedia and Literary Studies, ed. by P. Delany, G.P. Landow (MIT Press, Cambridge, 1991)

V. Bush, in As We May Think, (The Atlantic Monthly, Boston, 1945)

10.

D. Engelbart, H. Lehtman, Working Together, (BYTE Magazine, Penticton, 1988), pp. 245–252

11.

J. Duckett, HTML and CSS: Design and Build Websites, (Wiley, Hoboken, 2011)

12.

K. Nahrstedt, R. Lienhart, M. Slaney, Special issue on the 20th anniversary of ACM SIGMM. ACM Trans. Multimedia Comput. Commun. Appl. (TOMCCAP), (2013)

13.

A.D. Bagdanov, A.D. Bimbo, F. Dini, G. Lisanti, I. Masi, Posterity logging of face imagery for video surveillance. IEEE Multimedia 19(4), 48–59 (2012)

Footnotes

Reginald A. Fessenden, of Quebec, beat Marconi to human voice transmission by several years, but not all inventors receive due credit. Nevertheless, Fessenden was paid $2.5 million in 1928 for his purloined patents.

See the accompanying website for several interesting uses of software tools. In a typical computer science course in multimedia, the tools described here might be used to create a small multimedia production as a first assignment. Some of the tools are powerful enough that they might also form part of a course project.

Therefore, Director is often a viable choice with students for creating a final project in multimedia courses—it provides the desired power without the inevitable pain of using a full-blown C++ program. The competing technology is likely Actionscripts in Flash.

Ze-Nian Li, Mark S. Drew and Jiangchuan LiuTexts in Computer ScienceFundamentals of Multimedia2nd ed. 201410.1007/978-3-319-05290-8_2

2. A Taste of Multimedia

Ze-Nian Li¹ , Mark S. Drew¹ and Jiangchuan Liu¹

(1)

Simon Fraser University, Vancouver, BC, Canada

Ze-Nian Li (Corresponding author)

Email: li@cs.sfu.ca

Mark S. Drew

Email: mark@cs.sfu.ca

Jiangchuan Liu

Email: jcliu@cs.sfu.ca

Abstract

In Chapter 2, we introduce a set of tasks and concerns that are considered in studying multimedia, from the point of view of a technically comfortable reader. When it comes to multimedia production and presentation, the issues of graphics styles fonts are discussed, with some surprising conclusions. To provide a further taste of multimedia, we show how simple animations may proceed. To round out the discussion of such tasks, we consider a build your own video-transition problem, where the intent would be to generate one’s own video transition. We then go on to review the current and future state of multimedia sharing and distribution, outlining later discussions of social media, video sharing, and new forms of TV. Finally, the details of some popular multimedia tools are set out for a quick start into the field.

2.1 Multimedia Tasks and Concerns

Multimedia content is ubiquitous in software all around us, including in our phones, of course. We are interested in this subject from a computer science and engineering point of view, and we are also interested in making interactive applications (or presentations), using video editors such as Adobe Premiere or Cyberlink PowerDirector and still-image editors such as Adobe Photoshop in the first instance, but then combining the resulting resources into interactive programs by making use of authoring tools such as Flash and Director that can include sophisticated programming. Multimedia often generates problems and considerations that have a more general computer science flavor. For example, most cameras now are smart enough to find faces (with reasonable success)—but just recently such a task was firmly in the domain of Computer Vision, i.e., a branch of Artificial Intelligence dealing with trying to understand image content. So such more basic concerns do impact multimedia as it now appears in products, and will tend to increasingly influence the field. Continuing in the Computer Vision direction, a camera owner might be encouraged to think like a computer scientist and ask What is going on in an image? A less high-level question is Where has this image been taken? (scene recognition), or Does the image contain a particular object? (object classification). A still quite difficult question is Where is an object of interest? (object detection). And a lower level question might be Which object does each pixel belong to? (image segmentation). Thus it does not take long before we find ourselves fully engaged in a classic Computer Vision hierarchy of high-level to detailed description of an image, with scene recognition at the top and image segmentation at the bottom.

In this text, we take a moderate approach to difficulty level, and do not presume to answer such sophisticated questions as those posed above. Nonetheless, studying the fundamentals of the multimedia problem is indeed a fruitful concern and our aim in the book is to give readers the tools they would need to eventually tackle such difficult questions, for example in a work situation.

2.2 Multimedia Presentation

In this section, we briefly outline some effects to keep in mind for presenting multimedia content as well as some useful guidelines for content design [1, 2].

Graphics Styles

Careful thought has gone into combinations of color schemes and how lettering is perceived in a presentation. Many presentations are meant for business projected displays, rather than appearing on a screen close to the eye. Human visual dynamics are considered in regard to how such presentations must be constructed. Most of the observations here are drawn from Vetter et al. [3], as is Fig. 2.1.

Color Principles and Guidelines

Some color schemes and art styles are best combined with a certain theme or style. Color schemes could be, for example, natural and floral for outdoor scenes and solid colors for indoor scenes. Examples of art styles are oil paints, watercolors, colored pencils, and pastels.

A general hint is to not use too many colors, as this can be distracting. It helps to be consistent with the use of color—then color can be used to signal changes in theme.

Fonts

For effective visual communication, large fonts (18 to 36 points) are best, with no more than six to eight lines per screen. As shown in Fig. 2.1, sans serif fonts work better than serif fonts (serif fonts are those with short lines stemming from and at an angle to the upper and lower ends of a letter’s strokes). Figure 2.1 shows a comparison of two screen projections, (Figs. 2 and 3 from Vetter et al. [3]).

A316017_2_En_2_Fig1_HTML.gif

Fig. 2.1

Colors and fonts. Courtesy of Ron Vetter

Figure 2.1 shows good use of color and fonts. It has a consistent color scheme, uses large and all sans serif (Arial) fonts. The bottom figure is poor, in that too many colors are used, and they are inconsistent. The red adjacent to the blue is hard to focus on, because the human retina cannot focus on these colors simultaneously. The serif (Times New Roman) font is said to be hard to read in a darkened, projection setting. Finally, the lower right panel does not have enough contrast—pretty pastel colors are often usable only if their background is sufficiently different.

A Color Contrast Program

Seeing the results of Vetter et al.’s research, we constructed a small Visual Basic program ¹ to investigate how readability of text colors depends on color and the color of the background.

The simplest approach to making readable colors on a screen is to use the principal complementary color as the background for text. For color values in the range 0–1 (or, effectively, 0–255), if the text color is some triple (Red, Green, Blue), or (R, G, B) for short, a legible color for the background is likely given by that color subtracted from the maximum:

$$\begin{aligned} (R,G,B) \;\Rightarrow \; (1-R,1-G,1-B) \end{aligned}$$

(2.1)

That is, not only is the color opposite in some sense (not the same sense as artists use), but if the text is bright, the background is dark, and vice versa.

A316017_2_En_2_Fig2_HTML.gif

Fig. 2.2

Program to investigate colors and readability

A316017_2_En_2_Fig3_HTML.gif

Fig. 2.3

Color wheel

In the Visual Basic program given, sliders can be used to change the background color. As the background changes, the text changes to equal the principal complementary color. Clicking on the background brings up a color-picker as an alternative to the sliders.

If you feel you can choose a better color combination, click on the text. This brings up a color picker not tied to the background color, so you can experiment. (The text itself can also be edited.) A little experimentation shows that some color combinations are more pleasing than others—for example, a pink background and forest green foreground, or a green background and mauve foreground. Figure 2.2 shows this small program in operation.

Figure 2.3 shows a color wheel, with opposite colors equal to ( $$1-\mathrm R $$ , $$1-\mathrm G $$ , $$1-\mathrm B $$ ). An artist’s color wheel will not look the same, as it is based on feel rather than on an algorithm. In the traditional artist’s wheel, for example, yellow is opposite magenta, instead of opposite blue as in Fig. 2.3, and blue is instead opposite orange.

Sprite Animation

Sprites are often used in animation. For example, in Adobe Director (this used to be Macromedia Director), the notion of a sprite is expanded to an instantiation of any resource. However, the basic idea of sprite animation is simple. Suppose we have produced an animation figure, as in Fig. 2.4a. Then it is a simple matter to create a 1-bit mask $$M$$ , as in Fig. 2.4b, black on white, and the accompanying sprite $$S$$ , as in Fig. 2.4c.

A316017_2_En_2_Fig4_HTML.jpg

Fig. 2.4

Sprite creation: a original; b mask image $$M$$ ; and c sprite $$S$$ . Duke figure courtesy of Sun Microsystems

Now we can overlay the sprite on a colored background $$B$$ , as in Fig. 2.5a, by first ANDing $$B$$ and $$M$$ , then ORing the result with $$S$$ , with the final result as in Fig. 2.5e. Operations are available to carry out these simple compositing manipulations at frame rate and so produce a simple 2D animation that moves the sprite around the frame but does not change the way it looks.

A316017_2_En_2_Fig5_HTML.gif

Fig. 2.5

Sprite animation: a Background $$B$$ ; b Mask $$M$$ ; c $$B$$ and $$M$$ ; d Sprite $$S$$ ; e $$B$$ and $$M$$ or $$S$$

Video Transitions

Video transitions can be an effective way to indicate a change to the next section. Video transitions are syntactic means to signal scene changes and often carry semantic meaning. Many different types of transitions exist; the main types are cuts, wipes, dissolves, fade-ins, and

Enjoying the preview?

Page 1 of 1

Fundamentals of Multimedia

About this ebook

Ze-Nian Li

Related authors

Related to Fundamentals of Multimedia

Related ebooks

System Administration For You

Related podcast episodes

Related articles

Related categories

Reviews for Fundamentals of Multimedia

What did you think?

Book preview

Fundamentals of Multimedia - Ze-Nian Li

1. Introduction to Multimedia

1.1 What is Multimedia?

1.1.1 Components of Multimedia

1.2 Multimedia: Past and Present

1.2.1 Early History of Multimedia

1.2.2 Hypermedia, WWW, and Internet

1.2.3 Multimedia in the New Millennium

1.3 Multimedia Software Tools: A Quick Scan

1.3.1 Music Sequencing and Notation

1.3.2 Digital Audio

1.3.3 Graphics and Image Editing

1.3.4 Video Editing

1.3.5 Animation

1.3.6 Multimedia Authoring

1.4 Multimedia in the Future

1.5 Exercises

2. A Taste of Multimedia

2.1 Multimedia Tasks and Concerns

2.2 Multimedia Presentation