Big Data Analytics for Cyber-Physical Systems: Machine Learning for the Internet of Things

Ebook727 pages6 hours

Big Data Analytics for Cyber-Physical Systems: Machine Learning for the Internet of Things

Name: Big Data Analytics for Cyber-Physical Systems: Machine Learning for the Internet of Things
ISBN: 9780128166468

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Big Data Analytics in Cyber-Physical Systems: Machine Learning for the Internet of Things examines sensor signal processing, IoT gateways, optimization and decision-making, intelligent mobility, and implementation of machine learning algorithms in embedded systems. This book focuses on the interaction between IoT technology and the mathematical tools used to evaluate the extracted data of those systems. Each chapter provides the reader with a broad list of data analytics and machine learning methods for multiple IoT applications. Additionally, this volume addresses the educational transfer needed to incorporate these technologies into our society by examining new platforms for IoT in schools, new courses and concepts for universities and adult education on IoT and data science.

Bridges the gap between IoT, CPS, and mathematical modelling
Features numerous use cases that discuss how concepts are applied in different domains and applications
Provides "best practices", "winning stories" and "real-world examples" to complement innovation
Includes highlights of mathematical foundations of signal processing and machine learning in CPS and IoT

Skip carousel

LanguageEnglish

PublisherElsevier Science

Release dateJul 15, 2019

ISBN9780128166468

Related to Big Data Analytics for Cyber-Physical Systems

Related ebooks

Skip carousel

Pervasive Computing: Next Generation Platforms for Intelligent Data Collection
Ebook
Pervasive Computing: Next Generation Platforms for Intelligent Data Collection
byCiprian Dobre
Rating: 5 out of 5 stars
5/5
Internet of Multimedia Things (IoMT): Techniques and Applications
Ebook
Internet of Multimedia Things (IoMT): Techniques and Applications
byShailendra Shukla
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence Foundations: Learning from experience
Ebook
Artificial Intelligence Foundations: Learning from experience
byAndrew Lowe
Rating: 0 out of 5 stars
0 ratings
5G IoT and Edge Computing for Smart Healthcare
Ebook
5G IoT and Edge Computing for Smart Healthcare
byAkash Kumar Bhoi
Rating: 0 out of 5 stars
0 ratings
Model Management and Analytics for Large Scale Systems
Ebook
Model Management and Analytics for Large Scale Systems
byBedir Tekinerdogan
Rating: 0 out of 5 stars
0 ratings
Cyber-Physical Systems: Foundations, Principles and Applications
Ebook
Cyber-Physical Systems: Foundations, Principles and Applications
byHoubing H. Song
Rating: 0 out of 5 stars
0 ratings
Practical Enterprise Data Lake Insights: Handle Data-Driven Challenges in an Enterprise Big Data Lake
Ebook
Practical Enterprise Data Lake Insights: Handle Data-Driven Challenges in an Enterprise Big Data Lake
bySaurabh Gupta
Rating: 0 out of 5 stars
0 ratings
Data Technologies A Complete Guide - 2020 Edition
Ebook
Data Technologies A Complete Guide - 2020 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Introduction to Generative AI
Ebook
Introduction to Generative AI
byNuma Dhamani
Rating: 0 out of 5 stars
0 ratings
Agile Architecture A Complete Guide - 2020 Edition
Ebook
Agile Architecture A Complete Guide - 2020 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Blockchain Technology Revolution: The Next Big Thing to Change Everything for Everyone
Ebook
Blockchain Technology Revolution: The Next Big Thing to Change Everything for Everyone
byJerry Garfield
Rating: 0 out of 5 stars
0 ratings
Living Networks 20th Anniversary Edition
Ebook
Living Networks 20th Anniversary Edition
byRoss Dawson
Rating: 0 out of 5 stars
0 ratings
Mastering Disruptive Technologies- Applications of Cloud Computing, IoT, Blockchain, Artificial Intelligence & Machine Learning Techniques: 1, #1
Ebook
Mastering Disruptive Technologies- Applications of Cloud Computing, IoT, Blockchain, Artificial Intelligence & Machine Learning Techniques: 1, #1
byS. R. Jena
Rating: 0 out of 5 stars
0 ratings
Digital Fix - Fix Digital: How to renew the digital world from the ground up
Ebook
Digital Fix - Fix Digital: How to renew the digital world from the ground up
byMatthias Schrader
Rating: 0 out of 5 stars
0 ratings
Multichannel Customer Analytics The Ultimate Step-By-Step Guide
Ebook
Multichannel Customer Analytics The Ultimate Step-By-Step Guide
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Data Privacy Fintech A Complete Guide - 2021 Edition
Ebook
Data Privacy Fintech A Complete Guide - 2021 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
AI And IoT A Complete Guide - 2021 Edition
Ebook
AI And IoT A Complete Guide - 2021 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Information Privacy A Complete Guide - 2019 Edition
Ebook
Information Privacy A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Privacy And Blockchain A Complete Guide - 2020 Edition
Ebook
Privacy And Blockchain A Complete Guide - 2020 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Deploying AI in the Enterprise: IT Approaches for Design, DevOps, Governance, Change Management, Blockchain, and Quantum Computing
Ebook
Deploying AI in the Enterprise: IT Approaches for Design, DevOps, Governance, Change Management, Blockchain, and Quantum Computing
byEberhard Hechler
Rating: 0 out of 5 stars
0 ratings
Chief Digital Officer A Complete Guide - 2019 Edition
Ebook
Chief Digital Officer A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
TELECOM ORGANIZATIONS ADAPT IN A POST-COVID-19 REALITY
Ebook
TELECOM ORGANIZATIONS ADAPT IN A POST-COVID-19 REALITY
byShuayb Greenaway
Rating: 0 out of 5 stars
0 ratings
Data Hubs A Complete Guide - 2021 Edition
Ebook
Data Hubs A Complete Guide - 2021 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence for the Internet of Everything
Ebook
Artificial Intelligence for the Internet of Everything
byWilliam Lawless
Rating: 3 out of 5 stars
3/5
Hybrid Cloud Complete Self-Assessment Guide
Ebook
Hybrid Cloud Complete Self-Assessment Guide
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Enterprise Architecture Governance Standard Requirements
Ebook
Enterprise Architecture Governance Standard Requirements
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Data Security and Privacy Third Edition
Ebook
Data Security and Privacy Third Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Navigating the Digital Landscape: Fundamentals, Cybersecurity, Emerging Technologies, and Applications
Ebook
Navigating the Digital Landscape: Fundamentals, Cybersecurity, Emerging Technologies, and Applications
byEli Kol
Rating: 0 out of 5 stars
0 ratings
Privacy And Cybersecurity A Complete Guide - 2019 Edition
Ebook
Privacy And Cybersecurity A Complete Guide - 2019 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
What's Cooking: Digital Transformation of the Agrifood System
Ebook
What's Cooking: Digital Transformation of the Agrifood System
byKateryna Schroeder
Rating: 0 out of 5 stars
0 ratings

Law For You

Skip carousel

Win In Court Every Time
Ebook
Win In Court Every Time
bycharles fisher
Rating: 5 out of 5 stars
5/5
Law For Dummies
Ebook
Law For Dummies
byJohn Ventura
Rating: 4 out of 5 stars
4/5
The Common Law
Ebook
The Common Law
byOliver Wendell Holmes, Jr.
Rating: 4 out of 5 stars
4/5
Secrets of Criminal Defense
Ebook
Secrets of Criminal Defense
byBurton Milward Jr.
Rating: 5 out of 5 stars
5/5
Legal Words You Should Know: Over 1,000 Essential Terms to Understand Contracts, Wills, and the Legal System
Ebook
Legal Words You Should Know: Over 1,000 Essential Terms to Understand Contracts, Wills, and the Legal System
byCorey Sandler
Rating: 4 out of 5 stars
4/5
Legal Forms for Everyone
Ebook
Legal Forms for Everyone
byCarl W. Battle
Rating: 4 out of 5 stars
4/5
The ZERO Percent: Secrets of the United States, the Power of Trust, Nationality, Banking and ZERO TAXES!
Ebook
The ZERO Percent: Secrets of the United States, the Power of Trust, Nationality, Banking and ZERO TAXES!
byDu'Vaul Dey
Rating: 5 out of 5 stars
5/5
How to Think Like a Lawyer--and Why: A Common-Sense Guide to Everyday Dilemmas
Ebook
How to Think Like a Lawyer--and Why: A Common-Sense Guide to Everyday Dilemmas
byKim Wehle
Rating: 3 out of 5 stars
3/5
Patents, Copyrights and Trademarks For Dummies
Ebook
Patents, Copyrights and Trademarks For Dummies
byHenri J. A. Charmasson
Rating: 4 out of 5 stars
4/5
Wills and Trusts Kit For Dummies
Ebook
Wills and Trusts Kit For Dummies
byAaron Larson
Rating: 5 out of 5 stars
5/5
Critical Race Theory: The Cutting Edge
Ebook
Critical Race Theory: The Cutting Edge
byRichard Delgado
Rating: 4 out of 5 stars
4/5
The Everything Guide To Being A Paralegal: Winning Secrets to a Successful Career!
Ebook
The Everything Guide To Being A Paralegal: Winning Secrets to a Successful Career!
bySteven Schneider
Rating: 5 out of 5 stars
5/5
Dictionary of Legal Terms: Definitions and Explanations for Non-Lawyers
Ebook
Dictionary of Legal Terms: Definitions and Explanations for Non-Lawyers
bySteven H. Gifis
Rating: 5 out of 5 stars
5/5
Legal Writing in Plain English: A Text with Exercises
Ebook
Legal Writing in Plain English: A Text with Exercises
byBryan A. Garner
Rating: 3 out of 5 stars
3/5
The Complete Book of Wills, Estates & Trusts (4th Edition): Advice That Can Save You Thousands of Dollars in Legal Fees and Taxes
Ebook
The Complete Book of Wills, Estates & Trusts (4th Edition): Advice That Can Save You Thousands of Dollars in Legal Fees and Taxes
byAlexander A. Bove Jr. Esq.
Rating: 4 out of 5 stars
4/5
Make Your Own Living Trust
Ebook
Make Your Own Living Trust
byDenis Clifford
Rating: 4 out of 5 stars
4/5
Criminal Law
Ebook
Criminal Law
byBarCharts, Inc.
Rating: 0 out of 5 stars
0 ratings
The Paralegal's Handbook: A Complete Reference for All Your Daily Tasks
Ebook
The Paralegal's Handbook: A Complete Reference for All Your Daily Tasks
byAnita Haworth
Rating: 4 out of 5 stars
4/5
The Killer Across the Table: Unlocking the Secrets of Serial Killers and Predators with the FBI's Original Mindhunter
Ebook
The Killer Across the Table: Unlocking the Secrets of Serial Killers and Predators with the FBI's Original Mindhunter
byJohn E. Douglas
Rating: 4 out of 5 stars
4/5
Estate & Trust Administration For Dummies
Ebook
Estate & Trust Administration For Dummies
byMargaret A. Munro
Rating: 0 out of 5 stars
0 ratings
Win Your Case: How to Present, Persuade, and Prevail--Every Place, Every Time
Ebook
Win Your Case: How to Present, Persuade, and Prevail--Every Place, Every Time
byGerry Spence
Rating: 5 out of 5 stars
5/5
The Socratic Method: A Practitioner's Handbook
Ebook
The Socratic Method: A Practitioner's Handbook
byWard Farnsworth
Rating: 4 out of 5 stars
4/5
Living Trusts for Everyone: Why a Will Is Not the Way to Avoid Probate, Protect Heirs, and Settle Estates (Second Edition)
Ebook
Living Trusts for Everyone: Why a Will Is Not the Way to Avoid Probate, Protect Heirs, and Settle Estates (Second Edition)
byRonald Farrington Sharp
Rating: 5 out of 5 stars
5/5
No Place to Hide: Edward Snowden, the NSA, and the U.S. Surveillance State
Ebook
No Place to Hide: Edward Snowden, the NSA, and the U.S. Surveillance State
byGlenn Greenwald
Rating: 4 out of 5 stars
4/5
The Law
Ebook
The Law
byFrédéric Bastiat
Rating: 4 out of 5 stars
4/5
8 Living Trust Forms: Legal Self-Help Guide
Ebook
8 Living Trust Forms: Legal Self-Help Guide
bySanket Mistry
Rating: 5 out of 5 stars
5/5
Drafting Affidavits and Statements
Ebook
Drafting Affidavits and Statements
byPaul Venus
Rating: 4 out of 5 stars
4/5
With Liberty and Justice for Some: How the Law Is Used to Destroy Equality and Protect the Powerful
Ebook
With Liberty and Justice for Some: How the Law Is Used to Destroy Equality and Protect the Powerful
byGlenn Greenwald
Rating: 4 out of 5 stars
4/5
Notorious RBG: The Life and Times of Ruth Bader Ginsburg
Ebook
Notorious RBG: The Life and Times of Ruth Bader Ginsburg
byIrin Carmon
Rating: 4 out of 5 stars
4/5
Executor's Guide, The: Settling a Loved One's Estate or Trust
Ebook
Executor's Guide, The: Settling a Loved One's Estate or Trust
byMary Randolph
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

What does the D2C Model means for Shopping Centres and Retailers?: While the D2C selling model is becoming popular among brands and product companies, there are several supply chain challenges and complexities involved for businesses in this space. Logistics and supply chain management can make or break a...
Podcast episode
What does the D2C Model means for Shopping Centres and Retailers?: While the D2C selling model is becoming popular among brands and product companies, there are several supply chain challenges and complexities involved for businesses in this space. Logistics and supply chain management can make or break a...
byVoice on Demand - Retail Podcast by MECS+R
0 ratings
0% found this document useful
Azure Databricks: I sat down with Ali Ghodsi, CEO and found of Databricks, and John Chirapurath, GM for Data Platform Marketing at Microsoft related to the recent announcement of Azure Databricks. When I heard about the announcement, my first thoughts were...
Podcast episode
Azure Databricks: I sat down with Ali Ghodsi, CEO and found of Databricks, and John Chirapurath, GM for Data Platform Marketing at Microsoft related to the recent announcement of Azure Databricks. When I heard about the announcement, my first thoughts were...
byData Skeptic
0 ratings
0% found this document useful
Office Hours: The Metaverse, NFTs, and Web3 — with Tonya Evans
Podcast episode
Office Hours: The Metaverse, NFTs, and Web3 — with Tonya Evans
byThe Prof G Pod with Scott Galloway
0 ratings
0% found this document useful
Best Integration Practices for Architecture Automation | BiZZdesign
Podcast episode
Best Integration Practices for Architecture Automation | BiZZdesign
byEnterprise Architecture Podcast
0 ratings
0% found this document useful
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
Podcast episode
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
Reframing Data Strategy Alignment: Reframing Data Strategy Alignment
Podcast episode
Reframing Data Strategy Alignment: Reframing Data Strategy Alignment
byInsights Tomorrow
0 ratings
0% found this document useful
245: Angela Harris - Building Relationships with Developers & Builders Helped Grow her Interior Design Firm: Welcome to today's show and happy birthday to LuAnn! Today's guest, Angela Harris, is the principal of Trio Environments and she is a real dynamo! Trio Environments is one of the fastest growing Interior Design and Visual Merchandising firms in the...
Podcast episode
245: Angela Harris - Building Relationships with Developers & Builders Helped Grow her Interior Design Firm: Welcome to today's show and happy birthday to LuAnn! Today's guest, Angela Harris, is the principal of Trio Environments and she is a real dynamo! Trio Environments is one of the fastest growing Interior Design and Visual Merchandising firms in the...
byA Well-Designed Business® | Interior Design Business Podcast
0 ratings
0% found this document useful
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
Podcast episode
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
Exploring The Design And Benefits Of The Modern Data Stack: A conversation about the design and motivation of the "modern data stack" and how it can simplify the work of building a self-service data platform that enables everyone in the business to ask and answer questions with data.
Podcast episode
Exploring The Design And Benefits Of The Modern Data Stack: A conversation about the design and motivation of the "modern data stack" and how it can simplify the work of building a self-service data platform that enables everyone in the business to ask and answer questions with data.
byData Engineering Podcast
0 ratings
0% found this document useful
Digital Economy: The digitisation of finance: The second episode of The EIU Digital Economy podcast examines the impact of digital technology on the financial services industry. Host Pete Swabey is joined by Greg Baxter, chief digital officer at US insurance company MetLife, and Tejal Mody, head o...
Podcast episode
Digital Economy: The digitisation of finance: The second episode of The EIU Digital Economy podcast examines the impact of digital technology on the financial services industry. Host Pete Swabey is joined by Greg Baxter, chief digital officer at US insurance company MetLife, and Tejal Mody, head o...
byThe Economist Intelligence Unit: Digital Economy
0 ratings
0% found this document useful
Publishing - Can YouTube and Reddit be overtaken? STEEMIT, Props, Po.et, Flixxo: Today, we’ll be talking about decentralized publishing platforms vs. centralized publishing platforms and how blockchain technology can improve the future of the publishing industry. Let’s take a look at the current state of centralized...
Podcast episode
Publishing - Can YouTube and Reddit be overtaken? STEEMIT, Props, Po.et, Flixxo: Today, we’ll be talking about decentralized publishing platforms vs. centralized publishing platforms and how blockchain technology can improve the future of the publishing industry. Let’s take a look at the current state of centralized...
byBlockchain 2025
0 ratings
0% found this document useful
#429: [Right Now at AWS] Episode 4 – Developing an IoT solution to keep water flowing for millions: Having a clear vision about the IoT solution needed to keep clean water flowing for millions of peop
Podcast episode
#429: [Right Now at AWS] Episode 4 – Developing an IoT solution to keep water flowing for millions: Having a clear vision about the IoT solution needed to keep clean water flowing for millions of peop
byAWS Podcast
0 ratings
0% found this document useful
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
Podcast episode
LLMs, Retrieval Augmented Generation, Knowledge Graph, Vector Databases with Mike Dillinger: <p>RAG, Retrieval Augemented Generation, is the term you now constantly hear in conjunction with LLM that provides context. But how does it actually work? And what's the relationship with Vector Databases and Knowledge Graphs? This will be a geeky AI e...
byCatalog & Cocktails: The Honest, No-BS Data Podcast
0 ratings
0% found this document useful
Celestia’s Building the Multi-Chain Universe with Nick White | Alpha Leak: Nick White is the COO of Celestia Labs. Celestia is the first modular blockchain network. If you have no idea what that is, you’re in luck! Celestia wants it to be as easy to deploy blockchains as it is smart contracts. Nick and David cover all...
Podcast episode
Celestia’s Building the Multi-Chain Universe with Nick White | Alpha Leak: Nick White is the COO of Celestia Labs. Celestia is the first modular blockchain network. If you have no idea what that is, you’re in luck! Celestia wants it to be as easy to deploy blockchains as it is smart contracts. Nick and David cover all...
byBankless
0 ratings
0% found this document useful
Dapr Distributed Application Runtime with Azure CTO Mark Russinovich: Dapr is a an event-driven, portable runtime for building microservices on cloud and edge. In this episode Scott talks to Azure CTO Mark Russinovich about what this means and why you should care? What are the responsibilities of a microservice, and what should YOU worry about and what a responsibilities better delegated to an open source project like Dapr?
Podcast episode
Dapr Distributed Application Runtime with Azure CTO Mark Russinovich: Dapr is a an event-driven, portable runtime for building microservices on cloud and edge. In this episode Scott talks to Azure CTO Mark Russinovich about what this means and why you should care? What are the responsibilities of a microservice, and what should YOU worry about and what a responsibilities better delegated to an open source project like Dapr?
byHanselminutes with Scott Hanselman
0 ratings
0% found this document useful
GPT4 Next Week? Whisper Chatbot Demo, ChatGPT API Updates, LangChain & AI Stock Picking | E05
Podcast episode
GPT4 Next Week? Whisper Chatbot Demo, ChatGPT API Updates, LangChain & AI Stock Picking | E05
byThis Day in AI Podcast
0 ratings
0% found this document useful
410: Getting attention for a product launch: Lessons from launching a #2 Product of the Year on Product Hunt – for product managers
Podcast episode
410: Getting attention for a product launch: Lessons from launching a #2 Product of the Year on Product Hunt – for product managers
byGlobal Product Management Talk
0 ratings
0% found this document useful
Episode 441 - Databricks Accelerator for Azure Purview: The team catches up with the developers of the Databricks Accelerator for Azure Purview to learn when, where, and why you might use it.   Media file: https://azpodcast.blob.core.windows.net/episodes/Episode441.mp3 YouTube: https://youtu.be/W9Dyb6E5eKk Resources: The Databricks to Purview Solution Accelerator Repo: microsoft/Purview-ADB-Lineage-Solution-Accelerator: A connector to ingest Azure Databricks lineage into Microsoft Purview (github.com) Demo Deployment Quickstart: Purview-ADB-Lineage-Solution-Accelerator/deploy-demo.md at release/2.1 · microsoft/Purview-ADB-Lineage-Solution-Accelerator (github.com) YouTube Video overview: Demoing the Azure Databricks lineage solution accelerator in Microsoft Purview - YouTube The OpenLineage Repo: OpenLineage/OpenLineage: An Open Standard for lineage metadata collection (github.com) OpenLineage + Purview Blog: Microsoft Purview Accelerates Lineage Extraction from Az
Podcast episode
Episode 441 - Databricks Accelerator for Azure Purview: The team catches up with the developers of the Databricks Accelerator for Azure Purview to learn when, where, and why you might use it.   Media file: https://azpodcast.blob.core.windows.net/episodes/Episode441.mp3 YouTube: https://youtu.be/W9Dyb6E5eKk Resources: The Databricks to Purview Solution Accelerator Repo: microsoft/Purview-ADB-Lineage-Solution-Accelerator: A connector to ingest Azure Databricks lineage into Microsoft Purview (github.com) Demo Deployment Quickstart: Purview-ADB-Lineage-Solution-Accelerator/deploy-demo.md at release/2.1 · microsoft/Purview-ADB-Lineage-Solution-Accelerator (github.com) YouTube Video overview: Demoing the Azure Databricks lineage solution accelerator in Microsoft Purview - YouTube The OpenLineage Repo: OpenLineage/OpenLineage: An Open Standard for lineage metadata collection (github.com) OpenLineage + Purview Blog: Microsoft Purview Accelerates Lineage Extraction from Az
byThe Azure Podcast
0 ratings
0% found this document useful
Supply Chain Decision Making and Design in Times of Disruption with LLamasoft CEO, Razat Gaurav: Innovation in supply chain decision making and design has flourished in recent years. The availability of high-quality data coupled with an urgent need to respond to disruption has fostered rapid change in how organizations coordinate, cooperate, and im...
Podcast episode
Supply Chain Decision Making and Design in Times of Disruption with LLamasoft CEO, Razat Gaurav: Innovation in supply chain decision making and design has flourished in recent years. The availability of high-quality data coupled with an urgent need to respond to disruption has fostered rapid change in how organizations coordinate, cooperate, and im...
byMIT Supply Chain Frontiers
0 ratings
0% found this document useful
What is distributed computing?: Sometimes using a single computer just won't cut it, and buying time on a supercomputer can be prohibitively expensive. So what do you do next? Tune in and learn more about distributed computing in this podcast.
Podcast episode
What is distributed computing?: Sometimes using a single computer just won't cut it, and buying time on a supercomputer can be prohibitively expensive. So what do you do next? Tune in and learn more about distributed computing in this podcast.
byTechStuff
100%
100% found this document useful
Interview with Milan Guenther, Co-Author of Enterprise Design Patterns and President at Intersection Group and Marc Lankhorst, Chief Technology Evangelist at BiZZdesign
Podcast episode
Interview with Milan Guenther, Co-Author of Enterprise Design Patterns and President at Intersection Group and Marc Lankhorst, Chief Technology Evangelist at BiZZdesign
byEnterprise Architecture Podcast
0 ratings
0% found this document useful
Digital Economy: Introduction: The EIU Digital Economy podcast is a monthly series examining the technologies, ideas and people driving the digitisation of the global economy. Sponsored by DXC, the podcast aims to help business leaders understand the way in which digital technology ...
Podcast episode
Digital Economy: Introduction: The EIU Digital Economy podcast is a monthly series examining the technologies, ideas and people driving the digitisation of the global economy. Sponsored by DXC, the podcast aims to help business leaders understand the way in which digital technology ...
byThe Economist Intelligence Unit: Digital Economy
100%
100% found this document useful
The Top Trends in 2022 for Data Leaders from DataRobot, Databricks, and Google: On this episode of The Data Chief, top data and analytics executives from DataRobot, Databricks, and Google join Cindi to discuss trends shaping the future of analytics and provide bold predictions for the upcoming year.
Podcast episode
The Top Trends in 2022 for Data Leaders from DataRobot, Databricks, and Google: On this episode of The Data Chief, top data and analytics executives from DataRobot, Databricks, and Google join Cindi to discuss trends shaping the future of analytics and provide bold predictions for the upcoming year.
byThe Data Chief
0 ratings
0% found this document useful
Reflections On Designing A Data Platform From Scratch: A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
Podcast episode
Reflections On Designing A Data Platform From Scratch: A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
byData Engineering Podcast
100%
100% found this document useful
Exploring Digital Waste in Technology and Supply Chains, the Cloud, Why Digital Waste is Dirty and the Environmental Impacts It Has in the World We Share
Podcast episode
Exploring Digital Waste in Technology and Supply Chains, the Cloud, Why Digital Waste is Dirty and the Environmental Impacts It Has in the World We Share
bySupply Chain Revolution
100%
100% found this document useful
The Big Chain Powwow: The Bitcoin and blockchain industry has been through a lot this year. While the Bitcoin price has experienced relative stability, perhaps indicating slow growth, the space has grown into a rich and diverse ecosystem of startups and open technologies. Through inspiring proof-of-concepts and exciting use cases, blockchain technologies have gained legitimacy as a viable technology to improve transparency, reduce costs and optimise processes, to name a few of it’s benefits. In this episode, all three Epicenter Bitcoin hosts, Brian, Sebastien and Meher, come together to look back on the events which marked the space this past year, and speculate on where thing may be heading in the future.
Podcast episode
The Big Chain Powwow: The Bitcoin and blockchain industry has been through a lot this year. While the Bitcoin price has experienced relative stability, perhaps indicating slow growth, the space has grown into a rich and diverse ecosystem of startups and open technologies. Through inspiring proof-of-concepts and exciting use cases, blockchain technologies have gained legitimacy as a viable technology to improve transparency, reduce costs and optimise processes, to name a few of it’s benefits. In this episode, all three Epicenter Bitcoin hosts, Brian, Sebastien and Meher, come together to look back on the events which marked the space this past year, and speculate on where thing may be heading in the future.
byEpicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed Technologies
100%
100% found this document useful
Production data labeling workflows: with Mark Christensen, CEO of Xelex.ai
Podcast episode
Production data labeling workflows: with Mark Christensen, CEO of Xelex.ai
byPractical AI: Machine Learning, Data Science
0 ratings
0% found this document useful
Episode 8: Interview Eric Evans: Eric Evans is the author of the well known Domain-Driven Design book. In his day job he works as a consultant and coach for his own company, Domain Language. In this interview, Eric talks about the essential building blocks of domain-driven design as w...
Podcast episode
Episode 8: Interview Eric Evans: Eric Evans is the author of the well known Domain-Driven Design book. In his day job he works as a consultant and coach for his own company, Domain Language. In this interview, Eric talks about the essential building blocks of domain-driven design as w...
bySoftware Engineering Radio - the podcast for professional software developers
0 ratings
0% found this document useful
Stryker on How to Connect Data Strategy to Business Value: Modern data leaders know creating a data-informed culture requires cross-functional partnership and collaboration across the entire business. IT by themselves can’t do it. Nor can individual business departments. Both the IT and business strategy must be in lock step to achieve results. On this episode of The Data Chief, Dora Boussias, Senior Director of Data Strategy and Architecture at Stryker, discusses the role of modern data executives, three keys to creating a data-informed culture, and her approach to breaking down silos based on her own 28 years of experience building effective data strategies across industries.
Podcast episode
Stryker on How to Connect Data Strategy to Business Value: Modern data leaders know creating a data-informed culture requires cross-functional partnership and collaboration across the entire business. IT by themselves can’t do it. Nor can individual business departments. Both the IT and business strategy must be in lock step to achieve results. On this episode of The Data Chief, Dora Boussias, Senior Director of Data Strategy and Architecture at Stryker, discusses the role of modern data executives, three keys to creating a data-informed culture, and her approach to breaking down silos based on her own 28 years of experience building effective data strategies across industries.
byThe Data Chief
0 ratings
0% found this document useful
CPO of Adobe on the Messy Middle of Products, Companies and Careers: In this podcast, you’ll hear from Scott Belsky, the Chief Product Officer of Adobe. Scott strongly believes that as product leaders, we must develop the muscle of becoming resourceful over valuing resources. We’ll also look into one of the most successfu...
Podcast episode
CPO of Adobe on the Messy Middle of Products, Companies and Careers: In this podcast, you’ll hear from Scott Belsky, the Chief Product Officer of Adobe. Scott strongly believes that as product leaders, we must develop the muscle of becoming resourceful over valuing resources. We’ll also look into one of the most successfu...
byCPO Mastery Podcast
0 ratings
0% found this document useful

Skip carousel

How Can AI Help Your Business?
PC Pro Magazine
Article
How Can AI Help Your Business?
Jun 8, 2023
7 min read
Five Technology Tips For Dark Factories Installation
Techfastly
Article
Five Technology Tips For Dark Factories Installation
Jun 1, 2021
6 min read
AI As A Service
PC Pro Magazine
Article
AI As A Service
Jul 9, 2020
2 min read
Quantum Computing’s DISRUPTION IN Finance Industry
Techfastly
Article
Quantum Computing’s DISRUPTION IN Finance Industry
Oct 1, 2021
5 min read
How Google Is Making The AI That Powers Its Products Better.
HWM Singapore
Article
How Google Is Making The AI That Powers Its Products Better.
Jun 3, 2019
3 min read
Cloudy With No Chance Of Erp
Architectural Review Asia Pacific
Article
Cloudy With No Chance Of Erp
Nov 11, 2019
ERP (enterprise resource planning) was born around the time the first ‘[Something] for Dummies’ book was published*. It’s typically inflexible, uncompromising software designed for large businesses, like banks, large corporations, manufacturing and s
2 min read
Tim Cook Backs Privacy Laws, Warns Data Being ‘Weaponized’
AppleMagazine
Article
Tim Cook Backs Privacy Laws, Warns Data Being ‘Weaponized’
Nov 23, 2018
3 min read
2029 VISION Where Technology Is Taking Business
NZBusiness and Management
Article
2029 VISION Where Technology Is Taking Business
May 27, 2019
6 min read
Getting Smarter
Business Today
Article
Getting Smarter
Feb 7, 2022
8 min read
NOTHING IS FREE Data-driven Optimisation Unlocks Freemium Business Models’ Real Potential
The European Business Review
Article
NOTHING IS FREE Data-driven Optimisation Unlocks Freemium Business Models’ Real Potential
Sep 20, 2018
6 min read
The Future Of Home Networking
APC
Article
The Future Of Home Networking
Feb 22, 2021
10 min read
Combining Online And Offline
AdNews
Article
Combining Online And Offline
Jan 27, 2020
1 min read
Ultra-Precision, Super-Speed, Zero-Error Inspection; Cognitive Visual Inspection in Manufacturing
Techfastly
Article
Ultra-Precision, Super-Speed, Zero-Error Inspection; Cognitive Visual Inspection in Manufacturing
Dec 1, 2021
5 min read
Upgrade Your Marketing With Machine Learning
Fast Company
Article
Upgrade Your Marketing With Machine Learning
Sep 9, 2019
2 min read
HOW SOME COMPANIES BEAT THE COMPETITION . . . For Decades And Even Centuries
The European Business Review
Article
HOW SOME COMPANIES BEAT THE COMPETITION . . . For Decades And Even Centuries
Aug 2, 2019
5 min read
AI – Turn Buzz Into Biz
Facility Management
Article
AI – Turn Buzz Into Biz
Dec 23, 2018
4 min read
A MATTER OF IoT
Facility Management
Article
A MATTER OF IoT
Oct 21, 2018
9 min read
Cybersecurity May Be Beating Cyber Fear
The Christian Science Monitor
Article
Cybersecurity May Be Beating Cyber Fear
Apr 5, 2018
Despite the drumbeat of data breaches, such as Facebook’s, the good news is that companies and governments are putting security first, according to a new survey.
1 min read
Intelligent Buildings
Business Today
Article
Intelligent Buildings
Jun 24, 2019
2 min read
Open-source Sabotage Sparks Ethical Storm
PC Pro Magazine
Article
Open-source Sabotage Sparks Ethical Storm
Jun 10, 2021
2 min read
IT For A New World
Business Today
Article
IT For A New World
Jun 10, 2021
6 min read
Finding True North
AdNews
Article
Finding True North
Sep 9, 2019
8 min read
Bitcoin - The Future Of Global Currency?
Techfastly
Article
Bitcoin - The Future Of Global Currency?
May 3, 2021
Since its inception, Bitcoin’s success has skyrocketed, and more people are getting invested in cryptocurrencies. But have you ever wondered what cryptocurrency and Bitcoin are? And why so many people are obsessed with it, and what does the future ho
1 min read
A.I. Scans For Big Farms That Might Be Polluters
Futurity
Article
A.I. Scans For Big Farms That Might Be Polluters
Apr 9, 2019
3 min read
Sensimilla of Security
Cannabis & Tech Today
Article
Sensimilla of Security
Apr 1, 2019
3 min read
Data Fabric
PC Pro Magazine
Article
Data Fabric
Aug 13, 2020
3 min read
Machine Learning Makes A Cost-effective Environmental Watchdog
Futurity
Article
Machine Learning Makes A Cost-effective Environmental Watchdog
Oct 10, 2018
Machine learning could help safeguard public health and spot environmental dangers, according to new research. As Hurricane Florence ground its way through North Carolina, it released what might politely be called an excrement storm. Massive hog farm
3 min read
Industry 4.0: India Has Everything To Be Successful
Business Today
Article
Industry 4.0: India Has Everything To Be Successful
Jul 8, 2019
4 min read
Connect, Collaborate and Communicate
Business Today
Article
Connect, Collaborate and Communicate
Sep 17, 2018
2 min read
Welcome To The Age Of Privacy Nihilism
The Atlantic
Article
Welcome To The Age Of Privacy Nihilism
Aug 23, 2018
10 min read

Related categories

Skip carousel

Reviews for Big Data Analytics for Cyber-Physical Systems

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Big Data Analytics for Cyber-Physical Systems - Guido Dartmann

University)

Introduction

Cyber-physical systems (CPS) and the Internet of things (IoT) are developing rapidly and this technology is now transforming our economy and society. The key features of this disruptive technological revolution are smart algorithms based on data science. In the last decade, the progress was mainly driven by network concepts, embedded systems, and cloud technology. Now, we are facing a new area utilizing the availability of artificial intelligence (AI) as new key technology for CPS. The AI enables systems to make decisions based on measured data and to transform data into new business ideas. A key to success in the design of new ideas based on AI is the interplay of novel applications and mathematical methods. The book addresses technological advances in machine learning, data science, and optimization in combination with applications in IoT and CPS, for example, mobility, industry, environmental systems, and medicine. This includes fundamentals of (sensor) signal processing together with data analytics and machine learning (e.g., smart sensors and IoT gateways), optimization and decision-making in smart systems (e.g., intelligent mobility), and the implementation of new machine learning algorithms in embedded systems.

These skills will become a central tool for the qualification of future engineers. In this book, we first introduce the fundamentals of data analytics and machine learning. Then, we present hardware platform aspects and applications in IoT, and finally, we discuss future demands in education for big data analytics in CPS.

To introduce basic concepts of data science, Chapters 1 and 2 of the book present fundamentals of data analytics, statistics, and processing platforms in CPS. Chapter 3 investigates an application where clustering techniques are used for object detection in smart cities.

To integrate smart IoT in our industry, knowledge of machine learning and data science needs to be combined with expertise in networks and embedded systems. Especially, IoT requires secure regional network platforms to provide a variety of IoT services, which are presented in Chapter 4.

For the pervasive establishment of smart CPS in our economy, new algorithms in combination with hardware components have to be developed. Chapter 5 presents inference techniques for IoT in the application of a complete IoT software and hardware framework embedded in a smart city infrastructure. Furthermore, efficient and real-time capable hardware is essential for the application of new machine learning techniques in autonomous driving cars. Therefore, Chapter 6 presents new aspects on the design of heterogeneous hardware platforms for autonomous driving. Finally, Chapter 7 presents an overview of AI-based sensor platforms for smart cities and gives a broad view on how the different aspects (sensors, gateways, cloud, communication standards, actuators, and algorithms) work together to establish a smart IoT system. It further gives an overview of different IEEE standards and the needs for future standardization for IoT.

The next part of the book shows how data analytics and machine learning can solve different challenges such as energy saving, autonomous driving, air quality, and public health. In Chapter 8, machine learning tools are used to predict the energy consumption in buildings.

Chapter 9 presents concepts of reinforcement learning for autonomous driving and gives an overview of a possible simulation framework where AI algorithms for autonomous driving can be evaluated. In IoT, the localization of sensors and agents is an important aspect. Chapter 10 presents an evolutionary algorithm for the localization of sensory agents for infrastructure monitoring. The AI can also be used to warn people in smart cities against dangerous gases or can monitor the air quality. The progress in the design of gas sensors allow a low-cost design of an artificial nose which is presented in Chapter 11 in combination with machine learning techniques to classify different gases.

Besides the environment, the traffic, or the autonomous driving cars, machine learning can revolutionize the future health system. In this book, Chapter 12 presents how basic algorithms based on continuous-time Markov chains can be used to classify different types of patients or diseases.

The AI and machine learning can also offer new risks: If these algorithms become more and more powerful, these techniques may easily find pattern in user data. This offers a large risk regarding the privacy of the people, therefore, Chapter 13 presents the societal aspects of citizens in future urban environments and Chapter 14 presents the theoretical foundations of metrics to quantify the privacy in communication systems.

Despite the technological advances, new concepts of education especially in the field of machine learning and data science are required as well. Additionally, this book addresses new concepts of education to transfer the described technology to our society. Especially, small and midsize companies need qualified employees to create new business models with IoT applications. People with knowledge of data analytics and machine learning together with practical experience in IoT and CPS are rare. Those having this knowledge might prefer to apply at big players and not consider classical companies in mechanical engineering or other domains with interfaces to novel IoT technology. Therefore, Chapters 15 and 16 present concepts and blueprints as to how this technology can be successfully integrated into our education systems.

Chapter 1

Data analytics and processing platforms in CPS

Claudia Chitu⁎; Houbing Song† ⁎ Faculty of Automatic Control and Computer Science, University Politehnica of Bucharest, Bucharest, Romania

† Department of Electrical, Computer, Software and Systems Engineering, Embry-Riddle Aeronautical University, Daytona Beach, FL, United States

Abstract

The speed of new developments in the IoT and CPS poses new challenges for data scientists and business owners to leverage smarter insights, demanding a real-time dashboard of information extracted from data in movement. Business is developed on top of Big Data Analytics and revenue is returned with high percentages from events prediction. Since many business areas can benefit from it, Big Data Analytics as a research topic faces multiple challenges. These ones are: fundamental understanding of models, architectures, security, privacy, but also data science and mentality to accommodate big data-driven decisions. This is argued by a series of facts such as: inference from multiple data sources, observation measuring, missing events and surrogate variables, incomplete information. Thus, this chapter aims to present a broad overview of the most common methods and techniques, including processing data platforms, used for analytics applied to large volume of unstructured, semistructured, and structured data coming with high velocity. Moreover, the tutorial character of this material helps to develop some capabilities in analyzing small data, to be followed up with massive amounts of data.

Keywords

Analytics; Dashboard; Machine Learning; Processing

1 Open source versus proprietary software

Cyber physical systems (CPSs) are taking advantage of and are growing with improvements in smart manufacturing industries and intelligent services, in which a key role is played by Big Data evolution. This evolution brings challenges and new trends in analyzing data (Yin and Kaynak, 2015). Historical data is examined with analytics tools and modeled for prediction while actionable intelligence is extracted from information systems. Due to its high importance in business, there are many players on the market, which deliver automatically data collection, cleansing and analysis in near real time, and even predictions. Big Data ingested from thousands of robots, machines, customers, and combined information systems is turned into rewarding outcome with predictive analytics. In CPS, analytics is a fundamental component, being the core wheel for the big system, with high weight in decision and control process: from level of prediction to dynamics incorporation, analytics is defined as a framework for capabilities gaps identification and roadmap for opportunities to improve quality. Popular techniques such as SVM (support vector machine) are now rewritten to take advantage of parallel computation and server farms with organic growing perspective. In this way, larger volumes of data coming from a very dynamic system get support from the infrastructure. Proper data analytics based on near real-time big data streams make possible a digital twin to optimize the working conditions, way of operation, consumption and maintenance of physical systems, and manufacturing. As is stated in Khaitan and McCalley (2015), CPSs have a broad application in many domains: vehicular systems and transportation, medical and health-care systems, smart homes and buildings (Shih et al., 2016), scheduling, thermal management, power grid systems, industrial process control, aerospace and air traffic management, etc. Thus, the broad area of CPS’s applications implies an extensive statistics and data analytics expertise. This chapter shows a comprehensive list of different mostly used data analytics tools with some hands-on examples. The knowledge transmitted within this chapter is designed to create a good image of the fundamentals in statistics for developing capabilities of understanding software’s results and implement algorithms to be followed up with larger and more complex datasets. One of the most challenging situations in CPS is the data reliability aspect; so, it is crucial to understand the correctness of collected observations as well as data validity. For instance, in Liu et al. (2017), based on big data analytics performed for spatial distribution characteristics of location data loss events, the authors proposed novel data-driven methodologies to increase data validity in transportation systems for smart cities, and there are more similar applications for smart and interconnected communities (Kambatla et al., 2014; Rathore et al., 2016; Sun et al., 2016). As data complexity is growing and the techniques and software available have to process more and produce more insights and generate decisions, we present in the first table open source and freeware software, versus proprietary tools used to analyze and predict data.

In Table 1, only a few of the tools used in the industry and academia to analyze and predict outcome from billions of intelligent devices are presented. When applied to near real-time industrial internet data streams, big data analytics eases critical failures detection, challenging anomalies exploration, and provides signal predictive maintenance alerts, etc. More details about a part of the previously mentioned tools could be found in Research (2018). Besides the aspect that large data sets demand special techniques of approach to process, they also need a special infrastructure, usually on cloud storage and resources being used. In Kiran et al. (2015), is presented a general architecture for online analysis applied to big data sets for unlocking behaviors previously unknown, using a data-handling back-end on Amazon EC2. For sure, Big Data is an immense source of knowledge and information about systems, situations, and opportunities. Moreover, there is a complex computing stack architecture to enable such size of data processing (Fox et al., 2015). Smart cities can benefit from real-time data collection, data processing, and visualization on cloud-based data analysis service for information intelligence and to support decision making (Khan et al., 2013).

Table 1

2 Data types

A first step to start the journey of understanding data is to distinguish the data types. This is important when implementing the machine learning (ML) algorithms in order to use the implementation properly. Variables could be of categorical and numerical types. Numerical data types are measurements or counts and could be of two other types: continuous (any value, e.g., temperature, height, etc.) and discrete (integer value: number of car accidents in a city in 1 year, number of failures of air conditioning equipment, etc.). The complementary data type is the categorical one, which is divided in two other subcategories: ordinal (represents an obvious order: A, B, C, etc.) and nominal (no meaningful order, e.g., gender, color, etc.).

Considering the variables data types, there are few categories of data analysis:

1.Qualitative analysis (examination of nonquantifiable data, deeply used in environmental chemistry, for instance, or oil industry)

2.Quantitative analysis (statistical, mathematical, or numerical analysis applied on objective measurements)

3.Spatial analysis (the analysis of the location of objects or phenomena being observed, e.g., analyzing data on a map)

4.Hierarchical analysis (data with parent-child relationship)

5.Graph analysis (analysis of relation between objects)

6.Textual data analysis (try to find patterns and use of words in documents and text-based data sources)

3 Easy data visualization using code

As an introduction to data analytics, an image of how data looks like could be very helpful, especially if the volume of data is high. For example, using an open data set from (ITU, 2015) about the Percentage of individuals using internet, and selecting only the values collected for 2016 and adding the country code and coordinates of countries from (Tamosauskas, 2018), we created a map with bubble plots to see how data is distributed across the world, and also where the values are higher, using different gradients of blue (different gradients of gray in print versions; Fig. 1).

Fig. 1 Distribution of data for internet usage percentage around the globe in 2016.

The R code used to create the previous figure is shown in the following code snippet:

library(ggplot2) library(dplyr) dataset <-read.csv('C:/Users/Claudia/Desktop/individuals_using_internet.csv', header = TRUE,sep =',') ggplot(data = dataset)+ borders(database ='world',colour ='grey60',fill ='grey90')+ geom_point(aes(y = lat_avg,x = long_Avg,size = percent,color = percent))+ scale_size_area(max_size = 1)+ ggtitle('Percentage of individuals using internet in 2016')+xlab('')+ylab('')+ labs('percent of individuals')+ theme(panel.background = element_blank(), axis.title.x = element_blank(), axis.text.x = element_blank(), axis.ticks.x = element_blank(), axis.title.y = element_blank(), axis.text.y = element_blank(), axis.ticks.y = element_blank())

Similar graphics could be created with Python using a dedicated package called geopandas and plotly for creating choropleth maps (Plotly, 2018). For plotting geographical map, R has implemented dplyr and ggplot2 packages with dedicated functions for geographical representation. We refer to R and Python programming languages as they are very popular in the developers and data scientists’ world and several exploration and data visualization tools such as VisIt, CDAT, and VisTrails have been built using Python (Anderson et al., 2010). The importance of the Python programming language in solving problems that involves large data sets in different formats and computational systems has been highlighted since 2010 (Perez et al., 2011). From the same data visualization techniques category is the choropleth map, a thematic map, in which areas are colored according to the density of population, or measurement of the statistical variable shown on the map. Although this type of map has some limitations such as difficulty in making comparisons or ranking countries only looking at the map, it is very popular and is used because it is easily understandable by the audience. They are indicated to be used when the data set has some continuous statistical surface (measure the phenomena, collect data anywhere on the map, even for an entire country) and data is standardized to show percentages, ratios, and illustrate relative differences. This type of data representation in data analysis is becoming more and more frequent in discussions at a global data scale as society is transformed by interconnected technologies, devices, and machines.

4 Statistical measurements in CPS data

Analyzing your data starts with a profile, brief summarizing even charting the data to better see the patterns and outliers. Means and standard deviation fit good for continuously normally distributed data and median (middle value) while interquartile ranges (IQRs) are more suitable for skewed data sets. A big difference between the mean and the median value is an indicator of skewed data. Using R studio, these statistics could be seen in few ways: an ampler image of these ones is to use the summary command and another method is to use the functions mean, median, and standard deviation (sd). We explore them in the following exercise. First, load a data set to apply on these functions and see the results. Download a free data set on your desktop from (Arel-Bundock, 2018). For this example, the amis data set is used. This item is a dataset of 8437 rows and 4 columns called Car Speeding and Warning Signs. The data frame contains data from a study conducted by the Cambridgeshire County Council about locations to account for factors such as traffic volume and type of road. The effect warning signs have on speeding patterns is observed with this study. These speed measurements were taken before the erection of a sign, shortly after sign placement, and third time after the sign had been in place for a while. These measurements correspond to the column called period having three values:1, 2, and 3, respectively, for each of the speed measurements. The other columns are: speed of cars (miles/h), warning (1 or 2 if the sign was present or, respectively, erected), and pair (from 1 to 14 corresponding to the 14 locations). To load the data, follow the next step; then use the commands to find the summary shown in Fig. 2.

Fig. 2 Summary results in R studio for car speeding and warning signs data set.

The median is commonly used to measure the properties of a data set and is more advantageous in describing data compared with the mean (average), giving a better idea of a typical value. The mean could be skewed by a small number or extremely high value, but the median better suggests what a typical value is. It is better to use the median as a measure of central tendency since it is not much affected by extreme values. The mean value is expressed as

(1)

Standard deviation is defined as a measure to quantify the amount of variation in the data set. A low value of it indicates the data points tend to be close to the mean value and it is used to measure confidence in statistical conclusions.

(2)

with N being the mean value of the observations. This is called the sample standard deviation and it is useful to show how the obtained results in a study could be generalized, in contrast with the population standard deviation applied usually for a population, as its name suggests.

5 Statistical methods, models, and techniques: Brief introduction

Regardless where the border between statistics and ML is, very powerful methods in the field are: linear regression for prediction of a target variable, classification techniques for assigning categories to collections of data, subset selection for identifying a certain number of features to find the target response, and dimension reduction for those data sets with a number of dimensions larger than 10 usually used a priori to apply the KNN algorithm. A classical technique for dimension reduction is PCA (principal component analysis) applied often for sparse data (Franke et al., 2016).

Regression is defined as a statistical method that describes a statistical relationship modeled by a mathematical function between the predictor variable and the response variable. Statistical modeling is presented as the second step in a data analysis cycle, the first one being the exploratory analysis. Models used for this stage in the analysis life cycle are built with supervised and unsupervised techniques depending on the situation. The output of this stage is reporting and visualization; so, the goal is to transfer information to decision makers. A second option from this phase could be also a backward path in order to identify new data to be fed into the system for complementing the existing data. A more complex presentation of learning algorithms and stochastic techniques in terms of accuracy, speed of learning, and risks of over-fitting, is given in Singh et al. (2016). Methods could be understood as systems of concepts and procedures defining an ensemble to realize certain insights and techniques are the practical approaches to implement these methods.

Methods, techniques, and models are used to develop and implement solutions in a large spectrum of domains involving statistical knowledge combined with learning algorithms. Even more, the most complicated situations are challenging the experts to apply this set of tools in order to solve critical situations in real-time systems, CPS, and the derived systems from these ones as illustrated in Wang et al. (2017) and Yavanoglu and Aydos (2017).

6 Analytics and statistics versus ML techniques

Deciding appropriate statistical methods for research implies defining firstly the types of measurements (variables) and then, the relationship between them (dependent vs independent variables). Analysis of scale or binary independent variables could be done with regressions. Before we go into more detail with regressions, we present in Table 2, a list of statistical methods, models, and techniques and a list of ML techniques suitable to be exploited in Big Data analysis from CPS.

Table 2

ML focuses intensively on prediction, learning (supervised and unsupervised), and computational methodology while statistical analysis explores the design, sampling, estimation, regression, and classification more than ML. The process of using many modeling methods from statistics and ML, to best predict the probability of an outcome (failure of a robot, maintenance of a train/machine/car, etc.) with a phase of implementing, training, testing, and validating a model, is called a predictive modeling process.

Good practices of statistical modeling and thinking are the fundamentals of Big Data projects; high-quality data, correct relationship modeling, and right algorithms as well as strategies are the keys for the success in the petabyte age. Many relationships in the world are or tend to be linear; so, linear regression is a very powerful tool to build exploratory models and predict relationship between models. A proper example is presented in Flynn et al. (2009), where authors use regression as a tool to perform a comparative analysis for performance capability exploration of measurement systems. The participants of a linear regression are called the predictor variable and the response variable, x and y, where x is an independent variable and y is a dependent one. The independent variable is the one manipulated during the experiment in order to observe the behavior of the outcome, y. They are called also exploratory variable, independent variable, regressor/risk factor, feature/attribute, respectively, dependent variable or regressand. In computer science, x is very often referred as feature or

Enjoying the preview?

Page 1 of 1

Big Data Analytics for Cyber-Physical Systems: Machine Learning for the Internet of Things

About this ebook

Related to Big Data Analytics for Cyber-Physical Systems

Related ebooks

Law For You

Related podcast episodes

Related articles

Related categories

Reviews for Big Data Analytics for Cyber-Physical Systems

What did you think?

Book preview

Big Data Analytics for Cyber-Physical Systems - Guido Dartmann

Introduction

Abstract

Keywords

Analytics; Dashboard; Machine Learning; Processing

1 Open source versus proprietary software

Table 1

2 Data types

3 Easy data visualization using code

4 Statistical measurements in CPS data

5 Statistical methods, models, and techniques: Brief introduction

6 Analytics and statistics versus ML techniques

Table 2