Measuring the Data Universe: Data Integration Using Statistical Data and Metadata Exchange

Ebook195 pages1 hour

Measuring the Data Universe: Data Integration Using Statistical Data and Metadata Exchange

Name: Measuring the Data Universe: Data Integration Using Statistical Data and Metadata Exchange
Author: Reinhold Stahl
ISBN: 9783319769899

By Reinhold Stahl and Patricia Staab

Rating: 0 out of 5 stars

()

Read preview

About this ebook

This richly illustrated book provides an easy-to-read introduction to the challenges of organizing and integrating modern data worlds, explaining the contribution of public statistics and the ISO standard SDMX (Statistical Data and Metadata Exchange). As such, it is a must for data experts as well those aspiring to become one.

Today, exponentially growing data worlds are increasingly determining our professional and private lives. The rapid increase in the amount of globally available data, fueled by search engines and social networks but also by new technical possibilities such as Big Data, offers great opportunities. But whatever the undertaking – driving the block chain revolution or making smart phones even smarter – success will be determined by how well it is possible to integrate, i.e. to collect, link and evaluate, the required data. One crucial factor in this is the introduction of a cross-domain order system in combination with a standardization of the data structure.

Using everyday examples, the authors show how the concepts of statistics provide the basis for the universal and standardized presentation of any kind of information. They also introduce the international statistics standard SDMX, describing the profound changes it has made possible and the related order system for the international statistics community.

Skip carousel

LanguageEnglish

PublisherSpringer

Release dateMay 16, 2018

ISBN9783319769899

Author

Reinhold Stahl

Related authors

Skip carousel

Related to Measuring the Data Universe

Related ebooks

Skip carousel

Big Data Analytics for Beginners
Ebook
Big Data Analytics for Beginners
byChuck Sherman
Rating: 0 out of 5 stars
0 ratings
From Big Data to Smart Data
Ebook
From Big Data to Smart Data
byFernando Iafrate
Rating: 0 out of 5 stars
0 ratings
The Digital Journey of Banking and Insurance, Volume III: Data Storage, Data Processing and Data Analysis
Ebook
The Digital Journey of Banking and Insurance, Volume III: Data Storage, Data Processing and Data Analysis
byVolker Liermann
Rating: 0 out of 5 stars
0 ratings
Practical DataOps: Delivering Agile Data Science at Scale
Ebook
Practical DataOps: Delivering Agile Data Science at Scale
byHarvinder Atwal
Rating: 0 out of 5 stars
0 ratings
Big Data: Statistics, Data Mining, Analytics, And Pattern Learning
Ebook
Big Data: Statistics, Data Mining, Analytics, And Pattern Learning
byRob Botwright
Rating: 0 out of 5 stars
0 ratings
Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets
Ebook
Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets
byAndreas François Vermeulen
Rating: 0 out of 5 stars
0 ratings
Big Data Preprocessing: Enabling Smart Data
Ebook
Big Data Preprocessing: Enabling Smart Data
byJulián Luengo
Rating: 0 out of 5 stars
0 ratings
Managing Data in Motion: Data Integration Best Practice Techniques and Technologies
Ebook
Managing Data in Motion: Data Integration Best Practice Techniques and Technologies
byApril Reeve
Rating: 0 out of 5 stars
0 ratings
Big Data: Unleashing the Power of Data to Transform Industries and Drive Innovation
Ebook
Big Data: Unleashing the Power of Data to Transform Industries and Drive Innovation
byMay Reads
Rating: 0 out of 5 stars
0 ratings
The Decision Maker's Handbook to Data Science: A Guide for Non-Technical Executives, Managers, and Founders
Ebook
The Decision Maker's Handbook to Data Science: A Guide for Non-Technical Executives, Managers, and Founders
byStylianos Kampakis
Rating: 0 out of 5 stars
0 ratings
Big Data: the Revolution That Is Transforming Our Work, Market and World
Ebook
Big Data: the Revolution That Is Transforming Our Work, Market and World
byPAT NAKAMOTO
Rating: 0 out of 5 stars
0 ratings
Understanding Big Data: A Beginners Guide to Data Science & the Business Applications
Ebook
Understanding Big Data: A Beginners Guide to Data Science & the Business Applications
byEileen McNulty-Holmes
Rating: 4 out of 5 stars
4/5
BigQuery for Data Warehousing: Managed Data Analysis in the Google Cloud
Ebook
BigQuery for Data Warehousing: Managed Data Analysis in the Google Cloud
byMark Mucchetti
Rating: 0 out of 5 stars
0 ratings
Big Data for Beginners: Data at Scale. Harnessing the Potential of Big Data Analytics
Ebook
Big Data for Beginners: Data at Scale. Harnessing the Potential of Big Data Analytics
byTom Lesley
Rating: 0 out of 5 stars
0 ratings
Real-Time Data Processing
Ebook
Real-Time Data Processing
byChuck Sherman
Rating: 0 out of 5 stars
0 ratings
Data as a Product: Elevating Information into a Valuable Product
Ebook
Data as a Product: Elevating Information into a Valuable Product
byChuck Sherman
Rating: 0 out of 5 stars
0 ratings
Mastering Data-Intensive Applications: Building for Scale, Speed, and Resilience
Ebook
Mastering Data-Intensive Applications: Building for Scale, Speed, and Resilience
byChuck Sherman
Rating: 0 out of 5 stars
0 ratings
Statistical Disclosure Control
Ebook
Statistical Disclosure Control
byAnco Hundepool
Rating: 0 out of 5 stars
0 ratings
Python Data Science: A Step-By-Step Guide to Data Analysis. What a Beginner Needs to Know About Machine Learning and Artificial Intelligence. Exercises Included
Ebook
Python Data Science: A Step-By-Step Guide to Data Analysis. What a Beginner Needs to Know About Machine Learning and Artificial Intelligence. Exercises Included
byAxel Ross
Rating: 0 out of 5 stars
0 ratings
Big Data Analytics: Turning Big Data into Big Money
Ebook
Big Data Analytics: Turning Big Data into Big Money
byFrank J. Ohlhorst
Rating: 0 out of 5 stars
0 ratings
Learn Hadoop in 24 Hours
Ebook
Learn Hadoop in 24 Hours
byAlex Nordeen
Rating: 0 out of 5 stars
0 ratings
Big Data for Insurance Companies
Ebook
Big Data for Insurance Companies
byMarine Corlosquet-Habart
Rating: 0 out of 5 stars
0 ratings
Big Data Analytics: From Strategic Planning to Enterprise Integration with Tools, Techniques, NoSQL, and Graph
Ebook
Big Data Analytics: From Strategic Planning to Enterprise Integration with Tools, Techniques, NoSQL, and Graph
byDavid Loshin
Rating: 5 out of 5 stars
5/5
Practical Data Analysis
Ebook
Practical Data Analysis
byHector Cuesta
Rating: 4 out of 5 stars
4/5
Big Data: Opportunities and challenges
Ebook
Big Data: Opportunities and challenges
byBCS, The Chartered Institute for IT
Rating: 0 out of 5 stars
0 ratings
Crash Course Big Data
Ebook
Crash Course Big Data
byIntroBooks Team
Rating: 0 out of 5 stars
0 ratings
Big Data for Executives and Market Professionals - Third Edition: Big Data
Ebook
Big Data for Executives and Market Professionals - Third Edition: Big Data
byJose Antonio Ribeiro Neto
Rating: 0 out of 5 stars
0 ratings
Edge Computing: A Primer
Ebook
Edge Computing: A Primer
byJie Cao
Rating: 0 out of 5 stars
0 ratings
Making Big Data Work for Your Business: A guide to effective Big Data analytics
Ebook
Making Big Data Work for Your Business: A guide to effective Big Data analytics
bySudhi Sinha
Rating: 0 out of 5 stars
0 ratings
Navigating Big Data Analytics: Strategies for the Quality Systems Analyst
Ebook
Navigating Big Data Analytics: Strategies for the Quality Systems Analyst
byWilliam D. Mawby
Rating: 0 out of 5 stars
0 ratings

Mathematics For You

Skip carousel

Algebra - The Very Basics
Ebook
Algebra - The Very Basics
byMetin Bektas
Rating: 5 out of 5 stars
5/5
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
Ebook
The Everything Guide to Pre-Algebra: A Helpful Practice Guide Through the Pre-Algebra Basics - in Plain English!
byJane Cassie
Rating: 5 out of 5 stars
5/5
Geometry For Dummies
Ebook
Geometry For Dummies
byMark Ryan
Rating: 5 out of 5 stars
5/5
Mental Math Secrets - How To Be a Human Calculator
Ebook
Mental Math Secrets - How To Be a Human Calculator
byRandy Silverman
Rating: 5 out of 5 stars
5/5
Basic Math & Pre-Algebra For Dummies
Ebook
Basic Math & Pre-Algebra For Dummies
byMark Zegarelli
Rating: 4 out of 5 stars
4/5
Game Theory: A Simple Introduction
Ebook
Game Theory: A Simple Introduction
byK.H. Erickson
Rating: 4 out of 5 stars
4/5
Quantum Physics for Beginners
Ebook
Quantum Physics for Beginners
byMax Thomson
Rating: 4 out of 5 stars
4/5
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
Ebook
Build a Mathematical Mind - Even If You Think You Can't Have One: Become a Pattern Detective. Boost Your Critical and Logical Thinking Skills.
byAlbert Rutherford
Rating: 5 out of 5 stars
5/5
Precalculus: A Self-Teaching Guide
Ebook
Precalculus: A Self-Teaching Guide
bySteve Slavin
Rating: 5 out of 5 stars
5/5
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
Ebook
The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English!
byChristopher Monahan
Rating: 4 out of 5 stars
4/5
Calculus Made Easy
Ebook
Calculus Made Easy
bySilvanus P. Thompson
Rating: 4 out of 5 stars
4/5
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
Ebook
Statistics 101: From Data Analysis and Predictive Modeling to Measuring Distribution and Determining Probability, Your Essential Guide to Statistics
byDavid Borman
Rating: 4 out of 5 stars
4/5
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
Ebook
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
byS. Deviant
Rating: 4 out of 5 stars
4/5
The Little Book of Mathematical Principles, Theories & Things
Ebook
The Little Book of Mathematical Principles, Theories & Things
byRobert Solomon
Rating: 3 out of 5 stars
3/5
Relativity: The special and the general theory
Ebook
Relativity: The special and the general theory
byAlbert Einstein
Rating: 5 out of 5 stars
5/5
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
Ebook
The Everything Everyday Math Book: From Tipping to Taxes, All the Real-World, Everyday Math Skills You Need
byChristopher Monahan
Rating: 5 out of 5 stars
5/5
Is God a Mathematician?
Ebook
Is God a Mathematician?
byMario Livio
Rating: 4 out of 5 stars
4/5
Mental Math: How to Develop a Mind for Numbers, Rapid Calculations and Creative Math Tricks (Including Special Speed Math for SAT, GMAT and GRE Students)
Ebook
Mental Math: How to Develop a Mind for Numbers, Rapid Calculations and Creative Math Tricks (Including Special Speed Math for SAT, GMAT and GRE Students)
byJoseph White
Rating: 0 out of 5 stars
0 ratings
Algebra I Workbook For Dummies
Ebook
Algebra I Workbook For Dummies
byMary Jane Sterling
Rating: 3 out of 5 stars
3/5
Introducing Game Theory: A Graphic Guide
Ebook
Introducing Game Theory: A Graphic Guide
byIvan Pastine
Rating: 4 out of 5 stars
4/5
Algebra II For Dummies
Ebook
Algebra II For Dummies
byMary Jane Sterling
Rating: 3 out of 5 stars
3/5
The Thirteen Books of the Elements, Vol. 1
Ebook
The Thirteen Books of the Elements, Vol. 1
byEuclid
Rating: 0 out of 5 stars
0 ratings
The Golden Ratio: The Divine Beauty of Mathematics
Ebook
The Golden Ratio: The Divine Beauty of Mathematics
byGary B. Meisner
Rating: 5 out of 5 stars
5/5
A Mind for Numbers | Summary
Ebook
A Mind for Numbers | Summary
bySummary Station
Rating: 4 out of 5 stars
4/5
Summary of The Black Swan: by Nassim Nicholas Taleb | Includes Analysis
Ebook
Summary of The Black Swan: by Nassim Nicholas Taleb | Includes Analysis
byInstaread Summaries
Rating: 5 out of 5 stars
5/5
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
Ebook
See Ya Later Calculator: Simple Math Tricks You Can Do in Your Head
byEditors of Portable Press
Rating: 4 out of 5 stars
4/5
Sneaky Math: A Graphic Primer with Projects
Ebook
Sneaky Math: A Graphic Primer with Projects
byCy Tymony
Rating: 0 out of 5 stars
0 ratings
My Best Mathematical and Logic Puzzles
Ebook
My Best Mathematical and Logic Puzzles
byMartin Gardner
Rating: 5 out of 5 stars
5/5
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
Ebook
Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis
byJ Scott
Rating: 0 out of 5 stars
0 ratings
Logicomix: An epic search for truth
Ebook
Logicomix: An epic search for truth
byApostolos Doxiadis
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Composable Data Analytics
Podcast episode
Composable Data Analytics
byThe Cloudcast
0 ratings
0% found this document useful
The Great Data Debate: Over a decade after the idea of “big data'' was first born, data has become the central nervous system for decision-making in organizations of all sizes. But the modern data stack is evolving and which infrastructure trends and technologies will ultimately win out remains to be decided. Five leaders in data infrastructure debate the future of the modern data stack.
Podcast episode
The Great Data Debate: Over a decade after the idea of “big data'' was first born, data has become the central nervous system for decision-making in organizations of all sizes. But the modern data stack is evolving and which infrastructure trends and technologies will ultimately win out remains to be decided. Five leaders in data infrastructure debate the future of the modern data stack.
bya16z Podcast
0 ratings
0% found this document useful
Managing the Business Impact of Data Quality
Podcast episode
Managing the Business Impact of Data Quality
byThe Cloudcast
0 ratings
0% found this document useful
Channeling the data avalanche. [CyberWire-X]
Podcast episode
Channeling the data avalanche. [CyberWire-X]
byCyberWire Daily
0 ratings
0% found this document useful
Use Your Data Warehouse To Power Your Product Analytics With NetSpring: With the rise of the web and digital business came the need to understand how customers are interacting with the products and services that are being sold. Product analytics has grown into its own category and brought with it several services with generational differences in how they approach the problem. NetSpring is a warehouse-native product analytics service that allows you to gain powerful insights into your customers and their needs by combining your event streams with the rest of your business data. In this episode Priyendra Deshwal explains how NetSpring is designed to empower your product and data teams to build and explore insights around your products in a streamlined and maintainable workflow.
Podcast episode
Use Your Data Warehouse To Power Your Product Analytics With NetSpring: With the rise of the web and digital business came the need to understand how customers are interacting with the products and services that are being sold. Product analytics has grown into its own category and brought with it several services with generational differences in how they approach the problem. NetSpring is a warehouse-native product analytics service that allows you to gain powerful insights into your customers and their needs by combining your event streams with the rest of your business data. In this episode Priyendra Deshwal explains how NetSpring is designed to empower your product and data teams to build and explore insights around your products in a streamlined and maintainable workflow.
byData Engineering Podcast
0 ratings
0% found this document useful
2206: Dynatrace Grail and a Mission to Unify Observability Data: Continuous digital transformation has created a data explosion that's overwhelming many organisations. Every tap, click, or swipe from a user, new code deployment or architecture change, and attempted cyberattack generates more data that can be...
Podcast episode
2206: Dynatrace Grail and a Mission to Unify Observability Data: Continuous digital transformation has created a data explosion that's overwhelming many organisations. Every tap, click, or swipe from a user, new code deployment or architecture change, and attempted cyberattack generates more data that can be...
byThe Tech Talks Daily Podcast
0 ratings
0% found this document useful
Data Analytics Launches with Bruno Aziza and Eric Schmidt: Stephanie Wong and Jenny Brown are your hosts this week, discussing data analytics with the yin and yang of the field, Bruno Aziza and Eric Schmidt.
Podcast episode
Data Analytics Launches with Bruno Aziza and Eric Schmidt: Stephanie Wong and Jenny Brown are your hosts this week, discussing data analytics with the yin and yang of the field, Bruno Aziza and Eric Schmidt.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Reconciling The Data In Your Databases With Datafold: A significant portion of data workflows involve storing and processing information in database engines. Validating that the information is stored and processed correctly can be complex and time-consuming, especially when the source and destination speak different dialects of SQL. In this episode Gleb Mezhanskiy, founder and CEO of Datafold, discusses the different error conditions and solutions that you need to know about to ensure the accuracy of your data.
Podcast episode
Reconciling The Data In Your Databases With Datafold: A significant portion of data workflows involve storing and processing information in database engines. Validating that the information is stored and processed correctly can be complex and time-consuming, especially when the source and destination speak different dialects of SQL. In this episode Gleb Mezhanskiy, founder and CEO of Datafold, discusses the different error conditions and solutions that you need to know about to ensure the accuracy of your data.
byData Engineering Podcast
0 ratings
0% found this document useful
Strise with Marit Rødevand: Priyanka Vergadia hops back into the host seat this week, joining Mark Mirchandani to talk to Marit Rødevand of Strise.
Podcast episode
Strise with Marit Rødevand: Priyanka Vergadia hops back into the host seat this week, joining Mark Mirchandani to talk to Marit Rødevand of Strise.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Modern Customer Data Platform Principles: Databases and analytics architectures have gone through several generational shifts. A substantial amount of the data that is being managed in these systems is related to customers and their interactions with an organization. In this episode Tasso Argyros, CEO of ActionIQ, gives a summary of the major epochs in database technologies and how he is applying the capabilities of cloud data warehouses to the challenge of building more comprehensive experiences for end-users through a modern customer data platform (CDP).
Podcast episode
Modern Customer Data Platform Principles: Databases and analytics architectures have gone through several generational shifts. A substantial amount of the data that is being managed in these systems is related to customers and their interactions with an organization. In this episode Tasso Argyros, CEO of ActionIQ, gives a summary of the major epochs in database technologies and how he is applying the capabilities of cloud data warehouses to the challenge of building more comprehensive experiences for end-users through a modern customer data platform (CDP).
byData Engineering Podcast
0 ratings
0% found this document useful
Aligning Data Security With Business Productivity To Deploy Analytics Safely And At Speed: As with all aspects of technology, security is a critical element of data applications, and the different controls can be at cross purposes with productivity. In this episode Yoav Cohen from Satori shares his experiences as a practitioner in the space of data security and how to align with the needs of engineers and business users. He also explains why data security is distinct from application security and some methods for reducing the challenge of working across different data systems.
Podcast episode
Aligning Data Security With Business Productivity To Deploy Analytics Safely And At Speed: As with all aspects of technology, security is a critical element of data applications, and the different controls can be at cross purposes with productivity. In this episode Yoav Cohen from Satori shares his experiences as a practitioner in the space of data security and how to align with the needs of engineers and business users. He also explains why data security is distinct from application security and some methods for reducing the challenge of working across different data systems.
byData Engineering Podcast
0 ratings
0% found this document useful
A "Data" Look Ahead for 2020
Podcast episode
A "Data" Look Ahead for 2020
byThe Cloudcast
0 ratings
0% found this document useful
Designing Data Platforms For Fintech Companies: Working with financial data requires a high degree of rigor due to the numerous regulations and the risks involved in security breaches. In this episode Andrey Korchack, CTO of fintech startup Monite, discusses the complexities of designing and implementing a data platform in that sector.
Podcast episode
Designing Data Platforms For Fintech Companies: Working with financial data requires a high degree of rigor due to the numerous regulations and the risks involved in security breaches. In this episode Andrey Korchack, CTO of fintech startup Monite, discusses the complexities of designing and implementing a data platform in that sector.
byData Engineering Podcast
0 ratings
0% found this document useful
Security and Privacy in the Enterprise with Skyflow’s Sam Sternberg: Sam Sternberg, Customer Programs Lead at Skyflow, joins the show to discuss the world of privacy and security at scale within large enterprises. We explore the complex infrastructure, regulatory challenges, and evolving technologies that these giants...
Podcast episode
Security and Privacy in the Enterprise with Skyflow’s Sam Sternberg: Sam Sternberg, Customer Programs Lead at Skyflow, joins the show to discuss the world of privacy and security at scale within large enterprises. We explore the complex infrastructure, regulatory challenges, and evolving technologies that these giants...
byPartially Redacted: Data Privacy, Security & Compliance
0 ratings
0% found this document useful
Big Data, Data Lakes, and Blockchain with Rahul Pathak, Executive at Amazon Web Services: Everyone knows that data is exploding. What most people don’t realize is the pace and ways in which data is changing our everyday lives. According to , we’re seeing a “roughly 10x increase in data every 5 years, and the types of data that’s...
Podcast episode
Big Data, Data Lakes, and Blockchain with Rahul Pathak, Executive at Amazon Web Services: Everyone knows that data is exploding. What most people don’t realize is the pace and ways in which data is changing our everyday lives. According to , we’re seeing a “roughly 10x increase in data every 5 years, and the types of data that’s...
byMission Daily
0 ratings
0% found this document useful
Turbocharged AI analytics, with Carey Anderson (1datapipe): the world's most widely-sourced lending insights
Podcast episode
Turbocharged AI analytics, with Carey Anderson (1datapipe): the world's most widely-sourced lending insights
byHow to Lend Money to Strangers
0 ratings
0% found this document useful
Data Gravity? Why Cloud Databases Will Prevail: Information assets may not have physical weight, but that doesn't mean data has no gravity. And in the new, cloud-centric world evolving around us, many new data sets are born in the cloud, where they will likely remain, whether for analytical or...
Podcast episode
Data Gravity? Why Cloud Databases Will Prevail: Information assets may not have physical weight, but that doesn't mean data has no gravity. And in the new, cloud-centric world evolving around us, many new data sets are born in the cloud, where they will likely remain, whether for analytical or...
byDM Radio
0 ratings
0% found this document useful
AI and the Democratization of Data of with Alonso Castañeda Andrade: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is joined today by , who is the...
Podcast episode
AI and the Democratization of Data of with Alonso Castañeda Andrade: Dr. Jerry Smith welcomes you to another episode of AI Live and Unbiased to explore the breadth and depth of Artificial Intelligence and to encourage you to change the world, not just observe it! Dr. Jerry is joined today by , who is the...
byAI Live & Unbiased
0 ratings
0% found this document useful
Using Trino And Iceberg As The Foundation Of Your Data Lakehouse: A data lakehouse is intended to combine the benefits of data lakes (cost effective, scalable storage and compute) and data warehouses (user friendly SQL interface). Multiple open source projects and vendors have been working together to make this vision a reality. In this episode Dain Sundstrom, CTO of Starburst, explains how the combination of the Trino query engine and the Iceberg table format offer the ease of use and execution speed of data warehouses with the infinite storage and scalability of data lakes.
Podcast episode
Using Trino And Iceberg As The Foundation Of Your Data Lakehouse: A data lakehouse is intended to combine the benefits of data lakes (cost effective, scalable storage and compute) and data warehouses (user friendly SQL interface). Multiple open source projects and vendors have been working together to make this vision a reality. In this episode Dain Sundstrom, CTO of Starburst, explains how the combination of the Trino query engine and the Iceberg table format offer the ease of use and execution speed of data warehouses with the infinite storage and scalability of data lakes.
byData Engineering Podcast
0 ratings
0% found this document useful
The Top Trends in 2022 for Data Leaders from DataRobot, Databricks, and Google: On this episode of The Data Chief, top data and analytics executives from DataRobot, Databricks, and Google join Cindi to discuss trends shaping the future of analytics and provide bold predictions for the upcoming year.
Podcast episode
The Top Trends in 2022 for Data Leaders from DataRobot, Databricks, and Google: On this episode of The Data Chief, top data and analytics executives from DataRobot, Databricks, and Google join Cindi to discuss trends shaping the future of analytics and provide bold predictions for the upcoming year.
byThe Data Chief
0 ratings
0% found this document useful
2553: Fortanix: Why Confidential Computing Will Be the Underlining of Training Data for AI: In the rapidly evolving landscape of artificial intelligence and data science, today's episode of the Tech Talks Daily Podcast brings forth a critically important yet often overlooked aspect—securing data through its entire lifecycle. Our esteemed...
Podcast episode
2553: Fortanix: Why Confidential Computing Will Be the Underlining of Training Data for AI: In the rapidly evolving landscape of artificial intelligence and data science, today's episode of the Tech Talks Daily Podcast brings forth a critically important yet often overlooked aspect—securing data through its entire lifecycle. Our esteemed...
byThe Tech Talks Daily Podcast
0 ratings
0% found this document useful
Data Science and Privacy - sugarcoated or straight up? It Depends (with Katharine Jarmul of Cape Privacy)
Podcast episode
Data Science and Privacy - sugarcoated or straight up? It Depends (with Katharine Jarmul of Cape Privacy)
bySerious Privacy
0 ratings
0% found this document useful
ESW #293 - Martin Roesch, Edward Wu: Cloud computing’s velocity and dynamism make it hard for security teams to monitor and protect workloads in the cloud without impeding the agility of dev teams. ExtraHop Senior Principal Data Scientist Edward Wu joins ESW to discuss practical...
Podcast episode
ESW #293 - Martin Roesch, Edward Wu: Cloud computing’s velocity and dynamism make it hard for security teams to monitor and protect workloads in the cloud without impeding the agility of dev teams. ExtraHop Senior Principal Data Scientist Edward Wu joins ESW to discuss practical...
bySecurity Weekly Podcast Network (Audio)
0 ratings
0% found this document useful
Introducing Data Downtime: From Firefighting to Winning // Barr Moses // MLOps Coffee Sessions #19
Podcast episode
Introducing Data Downtime: From Firefighting to Winning // Barr Moses // MLOps Coffee Sessions #19
byMLOps.community
0 ratings
0% found this document useful
Trent McConaghy: Ocean Protocol – The Platform Making Waves in the Data Industry: Ocean Protocol is a platform which creates data marketplaces, providing an alternative to the current model. Trent McConaghy, Founder of Ocean Protocol, joins us to chat about the platform.
Podcast episode
Trent McConaghy: Ocean Protocol – The Platform Making Waves in the Data Industry: Ocean Protocol is a platform which creates data marketplaces, providing an alternative to the current model. Trent McConaghy, Founder of Ocean Protocol, joins us to chat about the platform.
byEpicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed Technologies
0 ratings
0% found this document useful
Lessons From the Last Year's Breaches, ISW Interviews - ESW #334: In this segment, we'll explore some of the most useful lessons and interesting insights to come out of the last year's worth of breaches and data leaks! We'll explain why we will NOT be covering MGM in this segment. The breaches we will be covering...
Podcast episode
Lessons From the Last Year's Breaches, ISW Interviews - ESW #334: In this segment, we'll explore some of the most useful lessons and interesting insights to come out of the last year's worth of breaches and data leaks! We'll explain why we will NOT be covering MGM in this segment. The breaches we will be covering...
bySecurity Weekly Podcast Network (Audio)
0 ratings
0% found this document useful
Ship Smarter Not Harder With Declarative And Collaborative Data Orchestration On Dagster+: A core differentiator of Dagster in the ecosystem of data orchestration is their focus on software defined assets as a means of building declarative workflows. With their launch of Dagster+ as the redesigned commercial companion to the open source project they are investing in that capability with a suite of new features. In this episode Pete Hunt, CEO of Dagster labs, outlines these new capabilities, how they reduce the burden on data teams, and the increased collaboration that they enable across teams and business units.
Podcast episode
Ship Smarter Not Harder With Declarative And Collaborative Data Orchestration On Dagster+: A core differentiator of Dagster in the ecosystem of data orchestration is their focus on software defined assets as a means of building declarative workflows. With their launch of Dagster+ as the redesigned commercial companion to the open source project they are investing in that capability with a suite of new features. In this episode Pete Hunt, CEO of Dagster labs, outlines these new capabilities, how they reduce the burden on data teams, and the increased collaboration that they enable across teams and business units.
byData Engineering Podcast
0 ratings
0% found this document useful
Build A Data Lake For Your Security Logs With Scanner: Monitoring and auditing IT systems for security events requires the ability to quickly analyze massive volumes of unstructured log data. The majority of products that are available either require too much effort to structure the logs, or aren't fast enough for interactive use cases. Cliff Crosland co-founded Scanner to provide fast querying of high scale log data for security auditing. In this episode he shares the story of how it got started, how it works, and how you can get started with it.
Podcast episode
Build A Data Lake For Your Security Logs With Scanner: Monitoring and auditing IT systems for security events requires the ability to quickly analyze massive volumes of unstructured log data. The majority of products that are available either require too much effort to structure the logs, or aren't fast enough for interactive use cases. Cliff Crosland co-founded Scanner to provide fast querying of high scale log data for security auditing. In this episode he shares the story of how it got started, how it works, and how you can get started with it.
byData Engineering Podcast
0 ratings
0% found this document useful
Cloud-Native Security & Usage
Podcast episode
Cloud-Native Security & Usage
byThe Cloudcast
0 ratings
0% found this document useful
System Observability For The Cloud Native Era With Chronosphere: An interview about the Chronosphere platform and the M3DB storage engine for managing system metrics to power observability in the cloud native era.
Podcast episode
System Observability For The Cloud Native Era With Chronosphere: An interview about the Chronosphere platform and the M3DB storage engine for managing system metrics to power observability in the cloud native era.
byData Engineering Podcast
0 ratings
0% found this document useful

Skip carousel

Finding Your Data
APC
Article
Finding Your Data
Sep 9, 2019
4 min read
Demystifying Artificial Intelligence
Finweek - English
Article
Demystifying Artificial Intelligence
Oct 18, 2019
artificial intelligence (AI) has had a significant global impact by changing the way enterprises, markets and consumers define efficiency and innovation. Financial markets typically feature large volumes of noisy and dynamic data while utilising high
3 min read
Inform And Enhance Your Business With Open Data
PC Pro Magazine
Article
Inform And Enhance Your Business With Open Data
Jun 10, 2021
7 min read
The Future Of The Data Economy
The European Business Review
Article
The Future Of The Data Economy
Jun 1, 2022
6 min read
Edge Computing The Key To IoT Success
Techfastly
Article
Edge Computing The Key To IoT Success
Jun 1, 2022
6 min read
Good Governance for Dark Data: GUIDELINES FOR INDUSTRIAL IOT MANAGERS
The European Business Review
Article
Good Governance for Dark Data: GUIDELINES FOR INDUSTRIAL IOT MANAGERS
Mar 31, 2020
7 min read
Datafication
PC Pro Magazine
Article
Datafication
May 11, 2023
3 min read
Building Trends, Building Momentum
Facility Management
Article
Building Trends, Building Momentum
Oct 14, 2019
3 min read
Enter the Industry 4.0 Era Today by Using “Dark Data” You Already Have
The European Business Review
Article
Enter the Industry 4.0 Era Today by Using “Dark Data” You Already Have
Aug 2, 2019
7 min read
Why We Need To Fear The Risk Of AI Model Collapse
Evening Standard
Article
Why We Need To Fear The Risk Of AI Model Collapse
Dec 17, 2023
4 min read
Edge Computing In Europe: A Key Driver Of Business Innovation
The European Business Review
Article
Edge Computing In Europe: A Key Driver Of Business Innovation
Jan 26, 2024
1 83% of our survey respondents believe that edge computing will be essential to remaining competitive in the future but only 65% are using edge today. 2 Super Integrators — edge adopters that tie edge to business in transformation adoption — compris
8 min read
Small Data
PC Pro Magazine
Article
Small Data
Oct 8, 2022
3 min read
TimeXtender HELPS EUROPEAN COMPANIES FROM NUMEROUS INDUSTRIES MANAGE THEIR DATA
The European Business Review
Article
TimeXtender HELPS EUROPEAN COMPANIES FROM NUMEROUS INDUSTRIES MANAGE THEIR DATA
May 25, 2021
3 min read
Empowering Small And Medium Enterprises Through The Synergy Of AI And Blockchain
The European Business Review
Article
Empowering Small And Medium Enterprises Through The Synergy Of AI And Blockchain
Jan 25, 2021
10 min read
Putting Artificial Intelligence to Work
Rotman Management
Article
Putting Artificial Intelligence to Work
May 1, 2018
11 min read
The Cloud Is All Around Us
MoneyWeek
Article
The Cloud Is All Around Us
Mar 17, 2023
The ways the cloud can be used in our day-to-day lives is unlimited, as these examples help to illustrate. Within entertainment, whether it’s Disney+ or Netflix, the television shows and films we watch are stored in the cloud so that millions can sim
2 min read
Five Technology Tips For Dark Factories Installation
Techfastly
Article
Five Technology Tips For Dark Factories Installation
Jun 1, 2021
6 min read
Countdown To Cybersecurity In The Quantum Era: Will Businesses Be Ready In Time?
The European Business Review
Article
Countdown To Cybersecurity In The Quantum Era: Will Businesses Be Ready In Time?
Jul 31, 2023
☑ As of today, there are no large-scale quantum computers available that could break cryptographic algorithms, but we know they are coming. Due to the time it takes to implement and promulgate a defense, businesses should act now to counter this thre
7 min read
Synthetic Data As A Double-Edged Sword In Africa's AI Revolution
Forbes Africa
Article
Synthetic Data As A Double-Edged Sword In Africa's AI Revolution
Sep 29, 2023
Artificial intelligence (AI) is transforming companies and economies worldwide, including in Africa. Data is an essential component in the training of AI systems. Unfortunately, the lack of accurate, high-quality data is a significant impediment in A
3 min read
How And Where You Use Machine-learning
APC
Article
How And Where You Use Machine-learning
Oct 7, 2019
4 min read
Why Your Organisation Needs To Lift Its Data Game
NZBusiness and Management
Article
Why Your Organisation Needs To Lift Its Data Game
Oct 22, 2019
From problems stemming from the recent New Zealand census to data collected by Facebook, data has been in the news a lot lately. It may seem obvious that large organisations such as Statistics New Zealand and Facebook need to continually improve thei
3 min read
Privacy Vs Personalisation: A Marketer’s Dilemma Or A Brand-positioning Choice?
NZ Marketing
Article
Privacy Vs Personalisation: A Marketer’s Dilemma Or A Brand-positioning Choice?
Sep 22, 2021
4 min read
Facilities Systems
Facility Management
Article
Facilities Systems
Oct 21, 2018
5 min read
Life Af Ter Disruption
Tatler Singapore
Article
Life Af Ter Disruption
Nov 8, 2019
8 min read
How Data Will Transform Our Lives
Money Magazine
Article
How Data Will Transform Our Lives
May 3, 2023
4 min read
Internet 2.0: How To Cash In On Cloud Computing
MoneyWeek
Article
Internet 2.0: How To Cash In On Cloud Computing
Mar 17, 2023
5 min read
Leadership Forum: Investing in Disruption
Rotman Management
Article
Leadership Forum: Investing in Disruption
Jan 1, 2019
10 min read
THREE TRENDS DRIVING THE geospatial AI REVOLUTION
The European Business Review
Article
THREE TRENDS DRIVING THE geospatial AI REVOLUTION
Oct 3, 2019
5 min read
Triple A.i. Supply Chains
The European Business Review
Article
Triple A.i. Supply Chains
Jun 1, 2022
15 min read
India’s Data centre Boom
Business Today
Article
India’s Data centre Boom
Mar 31, 2023
9 min read

Related categories

Skip carousel

Reviews for Measuring the Data Universe

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Measuring the Data Universe - Reinhold Stahl

Part ICreating Comprehensive Data Worlds Using Standardisation

Reinhold Stahl and Patricia StaabMeasuring the Data Universehttps://doi.org/10.1007/978-3-319-76989-9_1

1. Where We Stand, Where We Want to Be, and How to Get There

Reinhold Stahl¹ and Patricia Staab²

(1)

Dornburg, Germany

(2)

Frankfurt, Germany

Reinhold Stahl (Corresponding author)

Patricia Staab

Abstract

The data available to us all over the world are multiplying rapidly. Our fixation on these data is increasing accordingly and drives the demand for the collection of more and more granular data.

Companies are increasingly aware that they are sitting on an underestimated treasure of data. But most of it is stored in separate data silos. Therefore, many organisations are making major efforts to integrate data, to link the treasures hidden in the silos and to create a high-quality data world.

This integration requires an order system, that is a classification standard for data, to make things fit together. The international statistics community uses the data standard SDMX (Statistical Data and Metadata Exchange) intensively to define data structures for any kind of phenomena and, based on them, to develop data exchange processes, data collections and data analysis tools. We are convinced that SDMX can form the basis of a comprehensive, orderly and standardised data world in other areas as well.

1.1 Exploding Data Worlds

The data available to us all over the world are constantly and rapidly multiplying. Because the technical possibilities have grown immensely, more and more granular information—the corresponding term would be micro data, or even nano data—is being automatically recorded (e.g. via sensors). Social networks or search engines act as prominent data collectors of such micro data. Coincidently, they also drive technological developments—for example, Big Data—to deal with the volume of data generated. At the same time, about 70% of the world’s population currently own a mobile phone and contribute every day to the growing mountain of data.

As more and more data become available, our fixation on them is increasing accordingly: post-game analyses of sports events have already turned into data-driven comparisons of space gain, one-on-one duel performance and percentage of ball possession. In doing so, our need for higher granularity (meaning the fine-grained nature of the data material) increases as if greater detail could also give us greater certainty. For instance, in the past, regional average daily temperatures were absolutely sufficient to monitor the weather; now, however, hourly values are being recorded for individual cities or even streets.

Numbers suggest objectivity and provide a feeling of safety, and that is good. Or would we trust a pilot who, when we ask about the speed at which the aircraft is currently flying, has no other answer than No idea, but quite fast. We fear obscurity and seek certainty; the more of it, the better. This is why we measure everything, everywhere and at any time. This is why we force the world around us—which is fluent, continuous and nuanced by nature—more and more into grids and digits.

Even when dealing with ourselves, we do not stop our numbermania: we measure our consumed calories, our sleep duration, our pulse rate. Although, in the end, there might be only one result we really care about: Are we healthy? Did we lose weight? Of course, the business world is not spared by this trend: a growing number of large companies refer to themselves as data-driven companies—there is an increasing perception that they are sitting on a data treasure which, until now, has largely been left unused.

1.2 Gated Communities: The Data Silos

The tremendous data treasures of enterprises and institutions are mostly stored in so-called data silos. A data silo encapsulates the data, programs and processes as well as the information technology (IT) and professional expertise belonging to a specific field (see Fig. 1.1).

../images/459681_1_En_1_Chapter/459681_1_En_1_Fig1_HTML.png

Fig. 1.1

Data silo seen from different perspectives

Data silos may be veritable treasure chests. But, just like grain silos, they seem impenetrable to the outside viewer. Grain silos can also often be underestimated, especially when looked at from a bird’s-eye perspective. This is no surprise, considering that from above you see only the area covered by the base. However, once the viewer is standing on the ground in front of the silo, the considerable height and volume of the silo can be appreciated.

Data silos are mostly structures that have been developed in accordance with the actual needs of a specific department and have, over many years, been tested again and again, and ultimately optimised for regular use. Being well maintained by trained experts and developers, they offer a very high level of practicality. In addition, they are functional, robust and self-reliant; they can, for example, be set up to default to a consistent state after a system or power failure on the basis of their own data backups. Given that data silos provide such enormous value, larger companies can be expected to own a considerable, and in some cases even increasing, number of silos.

However, silos only work perfectly in isolation; the information contained within them is hardly usable outside of the silo. They use internal identifiers (IDs) or codes for products, articles, accounts, customers, suppliers and process steps. They choose their own formats for time, date, location, textual and quantitative information. Proprietary categories are created for goods, customers and territories, which in turn do not match those of other silos. All in all, if the goal was to shield the information as strictly as possible, silos are doing a fantastic job. However, this is why many companies and organisations are now making great efforts to integrate their data: to bring the data treasures from silos into a uniform, interconnected high-quality data world. In general, the attempt is worthwhile: data integration promises high added value.

1.3 Data Linkage Is the Key

The eagerness to collect more and more granular data from more and more data silos leads to some challenges: the more fine-grained the collected material, the less valuable is the single sand grain—the piece of micro data per se. The micro piece of information is an integral part of the overall analysis and therefore needed at short notice, but ultimately it will remain only one value among many. The useful amount of information has therefore not grown nearly as fast as the usable data volume. After all, hidden in these data collections lies a mountain of data points that has to be searched through.

The evaluation of a micro data set consists of suitable aggregation, outlier detection, calculation of average, minimum or maximum values, following observations over time, and so on. However, the quantum leap in the creation of knowledge occurs when micro data sets of various data silos are brought together: by linking data from different data sources, one can transform the single players into a much more powerful ensemble, as given in the examples following.

The scanners used at supermarket checkouts collect a tremendous amount of information: products and their quantities, the times and places of sales, prices, reductions and much more. A lot of conclusions can be drawn from these figures. But, of course, the information value would be even higher if other data relating to the buyer could be linked to the scanner data: name, address, age, sex, occupation, income and so on.

Imagine how big, indeed gigantic, the information value would be if one could combine the customer’s supermarket data with their data from different sales points, such as pharmacies, furniture markets, petrol stations and car workshops. This is why large business corporations offer lucrative membership programmes where you collect points with each purchase and convert them into attractive reductions. In return, they collect your purchase data to create an incredibly fascinating data pool of our preferences for food, drugstore articles, prescription-free medicines, gasoline and auto-repairs. All of this, of course, with the aim of optimally tailoring their offers to our pre-calculated needs, displaying them on request and giving us personal advertising recommendations.

However, it is not only in the area of consumption that data integration represents a breakthrough in the generation of information and the development of knowledge. In the field of sciences, the linking of data from different disciplines also offers huge potential for intelligence gathering and problem solving.

Take, for example, the increasing incidence of resistant germs, which no longer react to antibiotics and have therefore become extremely dangerous. What causes the phenomenon and, more importantly, who is able to contain the threat?

Lack of hygiene in medical facilities or places hosting massive crowds of people, such as sports stadiums? This would concern these facilities.

Excessive or carefree administration of antibiotics for harmless diseases? This would relate to human medicine.

Excessive or carefree administration of antibiotics in livestock farming, even as feed supplements? Then veterinary medicine and agriculture would be responsible.

Use of expired products, potentially coming from illegal international trade? This might relate to a possible lack of working control mechanisms in this field.

Other reasons for the phenomenon?

Examples like this clearly demonstrate that the combination of data on different phenomena can be extremely helpful for the discovery and possible solution of problems. But the same examples also illustrate the shady aspects of data integration—because in a world in which such collections of data can be created for each and every one of us, maybe even without our consent, the individual is helplessly exposed to the evaluations performed on their data, the conclusions drawn from them and, most importantly, the actions derived from them. In general, history shows us that when dealing with potentially dangerous technical advancements, ignoring their possibilities or simply prohibiting their use is not an effective response. However, the development of legal and social protection mechanisms has to keep up to speed with technical progress in order to avoid the big brother scenarios we fear the new possibilities of data linkage could lead to.

1.4 Data Linkage Succeeds with an Order System

To enable this vision of knowledge gain and problem solving by means of data integration to become a reality, there is a universal requirement for any raw data material: a good description of the data, unique identifiers for key objects (e.g. locations, products, companies) and the consistent use of uniform concepts for classification criteria or attributes (see Fig. 1.2).

../images/459681_1_En_1_Chapter/459681_1_En_1_Fig2_HTML.png

Fig. 1.2

Requirements for data to be evaluable

To assemble the various data collections, some kind of compass or map, an operating

Enjoying the preview?

Page 1 of 1

Measuring the Data Universe: Data Integration Using Statistical Data and Metadata Exchange

About this ebook

Reinhold Stahl

Related authors

Related to Measuring the Data Universe

Related ebooks

Mathematics For You

Related podcast episodes

Related articles

Related categories

Reviews for Measuring the Data Universe

What did you think?

Book preview

Measuring the Data Universe - Reinhold Stahl

1. Where We Stand, Where We Want to Be, and How to Get There

Abstract

1.1 Exploding Data Worlds

1.2 Gated Communities: The Data Silos

1.3 Data Linkage Is the Key

1.4 Data Linkage Succeeds with an Order System