The Fiesta Data Model: A novel approach to the representation of heterogeneous multimodal interaction data

Ebook551 pages5 hours

The Fiesta Data Model: A novel approach to the representation of heterogeneous multimodal interaction data

Name: The Fiesta Data Model: A novel approach to the representation of heterogeneous multimodal interaction data
Author: Peter Menke
ISBN: 9783741217920

By Peter Menke

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Diese Dissertation stellt ein Datenmodell zur Repräsentation experimentbasierter Datensätze aus dem Forschungsgebiet der multimodalen Kommunikation vor. Es werden Belege für die Existenz verschiedener Probleme und Unzulänglichkeiten in der Arbeit mit multimodalen Datensammlungen aufgezeigt. Diese resultieren aus (a) einer Analyse bestehender multimodaler Korpora und (b) einer Umfrage, an der Wissenschaftlerinnen teilgenommen haben, die zu konkreten Problemen in der Arbeit mit ihren multimodalen Datensammlungen befragt wurden. Auf dieser Grundlage wird herausgearbeitet, dass trotz der Existenz einer Vielzahl von Datenmodellen und Formalismen zur Darstellung klassischer Textkorpora sich diese nicht eignen, um die den multimodalen Korpora eigenen Besonderheiten abbilden zu können. Aus diesem Grund wird ein Datenmodell entwickelt, das all jene spezifischen Eigenschaften multimodaler Korpora zu berücksichtigen sucht. Dieses Datenmodell bietet Lösungen speziell für die Arbeit mit einer oder mehreren Zeitachsen und Raumkoordinaten, für die Darstellung komplexer Annotationswerte, und für die Transformation zwischen verschiedenen (bisher inkompatiblen) Dateiformaten verbreiteter Annotationswerkzeuge.

Skip carousel

LanguageEnglish

PublisherBooks on Demand

Release dateApr 5, 2016

ISBN9783741217920

Author

Peter Menke

Peter Menke (geboren 1982 in Herford) studierte an der Universität Bielefeld Naturwissenschaftliche Informatik, Literaturwissenschaft und Linguistik. Nach dem Masterabschluss waren seine Forschungsinteressen die Verwaltung, Handhabung und Modellierung multimodaler Ereignisdaten, wie sie vielfach in der Linguistik und Interaktionsforschung Verwendung finden.

Related authors

Skip carousel

Related to The Fiesta Data Model

Related ebooks

Skip carousel

Collective Intelligence and Digital Archives: Towards Knowledge Ecosystems
Ebook
Collective Intelligence and Digital Archives: Towards Knowledge Ecosystems
bySamuel Szoniecky
Rating: 0 out of 5 stars
0 ratings
Statistical Pattern Recognition
Ebook
Statistical Pattern Recognition
byAndrew R. Webb
Rating: 4 out of 5 stars
4/5
Bayesian Analysis of Stochastic Process Models
Ebook
Bayesian Analysis of Stochastic Process Models
byDavid Insua
Rating: 0 out of 5 stars
0 ratings
Multimedia Semantics: Metadata, Analysis and Interaction
Ebook
Multimedia Semantics: Metadata, Analysis and Interaction
byRaphael Troncy
Rating: 0 out of 5 stars
0 ratings
Principles of Data Integration
Ebook
Principles of Data Integration
byAnHai Doan
Rating: 5 out of 5 stars
5/5
Stochastic Modeling: Analysis and Simulation
Ebook
Stochastic Modeling: Analysis and Simulation
byBarry L. Nelson
Rating: 0 out of 5 stars
0 ratings
Systems Programming: Designing and Developing Distributed Applications
Ebook
Systems Programming: Designing and Developing Distributed Applications
byRichard Anthony
Rating: 0 out of 5 stars
0 ratings
Named Entities for Computational Linguistics
Ebook
Named Entities for Computational Linguistics
byDamien Nouvel
Rating: 0 out of 5 stars
0 ratings
Joe Celko's Trees and Hierarchies in SQL for Smarties
Ebook
Joe Celko's Trees and Hierarchies in SQL for Smarties
byJoe Celko
Rating: 0 out of 5 stars
0 ratings
The Knowledge Level in Expert Systems: Conversations and Commentary
Ebook
The Knowledge Level in Expert Systems: Conversations and Commentary
byLuc Steels
Rating: 5 out of 5 stars
5/5
Spatial Econometrics using Microdata
Ebook
Spatial Econometrics using Microdata
byJean Dubé
Rating: 0 out of 5 stars
0 ratings
Computation for the Analysis of Designed Experiments
Ebook
Computation for the Analysis of Designed Experiments
byRichard Heiberger
Rating: 0 out of 5 stars
0 ratings
Data Mining: Know It All
Ebook
Data Mining: Know It All
bySoumen Chakrabarti
Rating: 0 out of 5 stars
0 ratings
Mixtures: Estimation and Applications
Ebook
Mixtures: Estimation and Applications
byKerrie L. Mengersen
Rating: 0 out of 5 stars
0 ratings
Data Mining and Learning Analytics: Applications in Educational Research
Ebook
Data Mining and Learning Analytics: Applications in Educational Research
bySamira ElAtia
Rating: 5 out of 5 stars
5/5
Modeling and Analysis of Real-Time and Embedded Systems with UML and MARTE: Developing Cyber-Physical Systems
Ebook
Modeling and Analysis of Real-Time and Embedded Systems with UML and MARTE: Developing Cyber-Physical Systems
byBran Selic
Rating: 5 out of 5 stars
5/5
Enterprise Interoperability: INTEROP-PGSO Vision
Ebook
Enterprise Interoperability: INTEROP-PGSO Vision
byBernard Archimède
Rating: 0 out of 5 stars
0 ratings
Reading and Writing Knowledge in Scientific Communities: Digital Humanities and Knowledge Construction
Ebook
Reading and Writing Knowledge in Scientific Communities: Digital Humanities and Knowledge Construction
byGérald Kembellec
Rating: 0 out of 5 stars
0 ratings
Statistical Methods in the Atmospheric Sciences
Ebook
Statistical Methods in the Atmospheric Sciences
byDaniel S. Wilks
Rating: 5 out of 5 stars
5/5
Constraint Processing
Ebook
Constraint Processing
byRina Dechter
Rating: 3 out of 5 stars
3/5
Applied Logistic Regression
Ebook
Applied Logistic Regression
byDavid W. Hosmer, Jr.
Rating: 5 out of 5 stars
5/5
Audiovisual Archives: Digital Text and Discourse Analysis
Ebook
Audiovisual Archives: Digital Text and Discourse Analysis
byPeter Stockinger
Rating: 0 out of 5 stars
0 ratings
Multilevel Statistical Models
Ebook
Multilevel Statistical Models
byHarvey Goldstein
Rating: 0 out of 5 stars
0 ratings
Designing Deep Learning Systems: A software engineer's guide
Ebook
Designing Deep Learning Systems: A software engineer's guide
byChi Wang
Rating: 0 out of 5 stars
0 ratings
Research Methods in Human-Computer Interaction
Ebook
Research Methods in Human-Computer Interaction
byJonathan Lazar
Rating: 3 out of 5 stars
3/5
The Data Analysis Handbook
Ebook
The Data Analysis Handbook
byI.E. Frank
Rating: 0 out of 5 stars
0 ratings
The Openxp Solution: For the Strategic Consolidation of Multiple Diverse Perspectives on Agile Software Projects
Ebook
The Openxp Solution: For the Strategic Consolidation of Multiple Diverse Perspectives on Agile Software Projects
byDr. Sandra Walsh
Rating: 0 out of 5 stars
0 ratings
Distributed Computing Through Combinatorial Topology
Ebook
Distributed Computing Through Combinatorial Topology
byMaurice Herlihy
Rating: 0 out of 5 stars
0 ratings
Big Data Analytics for Large-Scale Multimedia Search
Ebook
Big Data Analytics for Large-Scale Multimedia Search
byStefanos Vrochidis
Rating: 0 out of 5 stars
0 ratings
Practical Business Statistics
Ebook
Practical Business Statistics
byAndrew F. Siegel
Rating: 0 out of 5 stars
0 ratings

Linguistics For You

Skip carousel

Art of Styling Sentences
Ebook
Art of Styling Sentences
byAnn Longknife
Rating: 5 out of 5 stars
5/5
The Well-Spoken Thesaurus: The Most Powerful Ways to Say Everyday Words and Phrases (A Vocabulary Builder for Adults to Improve Your Writing and Speaking Communication Skills)
Ebook
The Well-Spoken Thesaurus: The Most Powerful Ways to Say Everyday Words and Phrases (A Vocabulary Builder for Adults to Improve Your Writing and Speaking Communication Skills)
byTom Heehler
Rating: 4 out of 5 stars
4/5
So to Speak: 11,000 Expressions That'll Knock Your Socks Off
Ebook
So to Speak: 11,000 Expressions That'll Knock Your Socks Off
byShirley Kobliner
Rating: 5 out of 5 stars
5/5
The Only Grammar Book You'll Ever Need: A One-Stop Source for Every Writing Assignment
Ebook
The Only Grammar Book You'll Ever Need: A One-Stop Source for Every Writing Assignment
bySusan Thurman
Rating: 4 out of 5 stars
4/5
An Etymological Dictionary of Modern English, Vol. 1
Ebook
An Etymological Dictionary of Modern English, Vol. 1
byErnest Weekley
Rating: 1 out of 5 stars
1/5
The American Heritage Dictionary of Idioms: American English Idiomatic Expressions & Phrases
Ebook
The American Heritage Dictionary of Idioms: American English Idiomatic Expressions & Phrases
byChristine Ammer
Rating: 5 out of 5 stars
5/5
Dark Psychology and Manipulation: Psychology, Relationships and Self-Improvement, #1
Ebook
Dark Psychology and Manipulation: Psychology, Relationships and Self-Improvement, #1
byMargaret Morrison
Rating: 4 out of 5 stars
4/5
The Origin of Names, Words and Everything in Between
Ebook
The Origin of Names, Words and Everything in Between
byPatrick Foote
Rating: 3 out of 5 stars
3/5
Sleight of Mouth: The Magic of Conversational Belief Change
Ebook
Sleight of Mouth: The Magic of Conversational Belief Change
byRobert Dilts
Rating: 5 out of 5 stars
5/5
500 Beautiful Words You Should Know
Ebook
500 Beautiful Words You Should Know
byCaroline Taggart
Rating: 5 out of 5 stars
5/5
A Pocket Dictionary of the Vulgar Tongue
Ebook
A Pocket Dictionary of the Vulgar Tongue
byFrancis Grose
Rating: 0 out of 5 stars
0 ratings
Inspired Baby Names from Around the World: 6,000 International Names and the Meaning Behind Them
Ebook
Inspired Baby Names from Around the World: 6,000 International Names and the Meaning Behind Them
byNeala Shane
Rating: 4 out of 5 stars
4/5
Extinct Languages
Ebook
Extinct Languages
byJohannes Friedrich
Rating: 4 out of 5 stars
4/5
The Word Museum: The Most Remarkable English Words Ever Forgotten
Ebook
The Word Museum: The Most Remarkable English Words Ever Forgotten
byJeffrey Kacirk
Rating: 4 out of 5 stars
4/5
Dark Matter of the Mind: The Culturally Articulated Unconscious
Ebook
Dark Matter of the Mind: The Culturally Articulated Unconscious
byDaniel L. Everett
Rating: 5 out of 5 stars
5/5
Metaphors We Live By
Ebook
Metaphors We Live By
byGeorge Lakoff
Rating: 4 out of 5 stars
4/5
The Elements of Style, Fourth Edition
Ebook
The Elements of Style, Fourth Edition
byWilliam Strunk Jr
Rating: 5 out of 5 stars
5/5
The Cabinet of Linguistic Curiosities: A Yearbook of Forgotten Words
Ebook
The Cabinet of Linguistic Curiosities: A Yearbook of Forgotten Words
byPaul Anthony Jones
Rating: 5 out of 5 stars
5/5
Everything Essential Russian Book
Ebook
Everything Essential Russian Book
byYulia Stakhnevich
Rating: 3 out of 5 stars
3/5
We Need to Talk: How to Have Conversations That Matter
Ebook
We Need to Talk: How to Have Conversations That Matter
byCeleste Headlee
Rating: 4 out of 5 stars
4/5
Wordslut: A Feminist Guide to Taking Back the English Language
Ebook
Wordslut: A Feminist Guide to Taking Back the English Language
byAmanda Montell
Rating: 4 out of 5 stars
4/5
Dictionary of Word Origins
Ebook
Dictionary of Word Origins
byJoseph T Shipley
Rating: 4 out of 5 stars
4/5
Talking Back, Talking Black: Truths About America's Lingua Franca
Ebook
Talking Back, Talking Black: Truths About America's Lingua Franca
byJohn McWhorter
Rating: 4 out of 5 stars
4/5
What Kind of Creatures Are We?
Ebook
What Kind of Creatures Are We?
byNoam Chomsky
Rating: 4 out of 5 stars
4/5
The Tyranny of Words
Ebook
The Tyranny of Words
byStuart Chase
Rating: 4 out of 5 stars
4/5
Let's Bring Back: The Lost Language Edition: A Collection of Forgotten-Yet-Delightful Words, Phrases, Praises, Insults, Idioms, and Literary Flourishes from Eras Past
Ebook
Let's Bring Back: The Lost Language Edition: A Collection of Forgotten-Yet-Delightful Words, Phrases, Praises, Insults, Idioms, and Literary Flourishes from Eras Past
byLesley M.M. Blume
Rating: 4 out of 5 stars
4/5
Through the Language Glass: Why the World Looks Different in Other Languages
Ebook
Through the Language Glass: Why the World Looks Different in Other Languages
byGuy Deutscher
Rating: 4 out of 5 stars
4/5
The Happiness Passport: A World Tour of Joyful Living in 50 Words
Ebook
The Happiness Passport: A World Tour of Joyful Living in 50 Words
byMegan Hayes
Rating: 0 out of 5 stars
0 ratings
The Mother Tongue: English and How it Got that Way
Ebook
The Mother Tongue: English and How it Got that Way
byBill Bryson
Rating: 4 out of 5 stars
4/5
Watch Your Tongue: What Our Everyday Sayings and Idioms Figuratively Mean
Ebook
Watch Your Tongue: What Our Everyday Sayings and Idioms Figuratively Mean
byMark Abley
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

Lost in the Middle: How Language Models Use Long Contexts
Podcast episode
Lost in the Middle: How Language Models Use Long Contexts
byDeep Papers
0 ratings
0% found this document useful
Episode 87: Software Components: In this episode, Michael and Markus talk about software components. We first looked at a couple of attempts at defining what a component is. We then provided our own definition that will be used in the rest of the episode.
Podcast episode
Episode 87: Software Components: In this episode, Michael and Markus talk about software components. We first looked at a couple of attempts at defining what a component is. We then provided our own definition that will be used in the rest of the episode.
bySoftware Engineering Radio - the podcast for professional software developers
0 ratings
0% found this document useful
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
Podcast episode
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
byUVA Data Points
0 ratings
0% found this document useful
[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research
Podcast episode
[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research
byLatent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0
0 ratings
0% found this document useful
Devon Estes from Sketch on Benchee, Performance and Training: Devon Estes joins our ongoing discussion about performance and training in the Elixir world, shares about his current work on the beta for Sketch Cloud, his previous Erlang consultancy role at one of the largest banks in Europe, and the massive responsibility he carried while working on the bottom line application.
Podcast episode
Devon Estes from Sketch on Benchee, Performance and Training: Devon Estes joins our ongoing discussion about performance and training in the Elixir world, shares about his current work on the beta for Sketch Cloud, his previous Erlang consultancy role at one of the largest banks in Europe, and the massive responsibility he carried while working on the bottom line application.
byElixir Wizards
0 ratings
0% found this document useful
Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4: This paper introduces 26 guiding principles designed to streamline the process of querying and prompting large language models. Our goal is to simplify the underlying concepts of formulating questions for various scales of large language models, exam...
Podcast episode
Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4: This paper introduces 26 guiding principles designed to streamline the process of querying and prompting large language models. Our goal is to simplify the underlying concepts of formulating questions for various scales of large language models, exam...
byPapers Read on AI
0 ratings
0% found this document useful
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
Podcast episode
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
byUVA Data Points
0 ratings
0% found this document useful
Episode 179: AiA 178: The Framework Summit
Podcast episode
Episode 179: AiA 178: The Framework Summit
byAdventures in Angular
0 ratings
0% found this document useful
12. ResearchOps (feat. Kara Pernice, Sr. VP at NN/g)
Podcast episode
12. ResearchOps (feat. Kara Pernice, Sr. VP at NN/g)
byNN/g UX Podcast
0 ratings
0% found this document useful
Episode 59: Static Code Analysis: This episode is a discussion with Jonathan Aldrich (Assistant Professor at CMU) about static analysis. The discussion covered theory as well as practice and tools. We started with an explanation of what static analysis actually is,
Podcast episode
Episode 59: Static Code Analysis: This episode is a discussion with Jonathan Aldrich (Assistant Professor at CMU) about static analysis. The discussion covered theory as well as practice and tools. We started with an explanation of what static analysis actually is,
bySoftware Engineering Radio - the podcast for professional software developers
0 ratings
0% found this document useful
Editing Large Language Models: Problems, Methods, and Opportunities: Recent advancements in deep learning have precipitated the emergence of large language models (LLMs) which exhibit an impressive aptitude for understanding and producing text akin to human language. Despite the ability to train highly capable LLMs, t...
Podcast episode
Editing Large Language Models: Problems, Methods, and Opportunities: Recent advancements in deep learning have precipitated the emergence of large language models (LLMs) which exhibit an impressive aptitude for understanding and producing text akin to human language. Despite the ability to train highly capable LLMs, t...
byPapers Read on AI
0 ratings
0% found this document useful
Modern Web Podcast S11E3- Design System Engineering at Scale with Kathleen McMahon: In this podcast episode, Rob Ocel chats with Kathleen McMahon, a senior design systems engineer at Northwestern Mutual and a key contributor to the W3C Design Tokens Community Group. Kathleen McMahon kicks off the conversation by defining design sy...
Podcast episode
Modern Web Podcast S11E3- Design System Engineering at Scale with Kathleen McMahon: In this podcast episode, Rob Ocel chats with Kathleen McMahon, a senior design systems engineer at Northwestern Mutual and a key contributor to the W3C Design Tokens Community Group. Kathleen McMahon kicks off the conversation by defining design sy...
byModern Web
0 ratings
0% found this document useful
Static Code Analysis in Elixir vs. Ruby with René Föhring & Marc-André Lafortune: In this episode of Elixir Wizards, hosts Owen and Dan are joined by René Föhring, creator of Credo for Elixir, and Marc-André LaFortune, head maintainer of the RuboCop AST library for Ruby. They compare static code analysis in Ruby versus Elixir.
Podcast episode
Static Code Analysis in Elixir vs. Ruby with René Föhring & Marc-André Lafortune: In this episode of Elixir Wizards, hosts Owen and Dan are joined by René Föhring, creator of Credo for Elixir, and Marc-André LaFortune, head maintainer of the RuboCop AST library for Ruby. They compare static code analysis in Ruby versus Elixir.
byElixir Wizards
0 ratings
0% found this document useful
Episode 309: JSJ 306: The Framework Summit with Joe Eames
Podcast episode
Episode 309: JSJ 306: The Framework Summit with Joe Eames
byJavaScript Jabber
0 ratings
0% found this document useful
?ThursdAI - LAION down, OpenChat beats GPT3.5, Apple is showing where it's going, Midjourney v6 is here & Suno can make music!
Podcast episode
?ThursdAI - LAION down, OpenChat beats GPT3.5, Apple is showing where it's going, Midjourney v6 is here & Suno can make music!
byThursdAI - The top AI news from the past week
0 ratings
0% found this document useful
Large Language Models for Generative Information Extraction: A Survey: Information extraction (IE) aims to extract structural knowledge (such as entities, relations, and events) from plain natural language texts. Recently, generative Large Language Models (LLMs) have demonstrated remarkable capabilities in text understa...
Podcast episode
Large Language Models for Generative Information Extraction: A Survey: Information extraction (IE) aims to extract structural knowledge (such as entities, relations, and events) from plain natural language texts. Recently, generative Large Language Models (LLMs) have demonstrated remarkable capabilities in text understa...
byPapers Read on AI
0 ratings
0% found this document useful
Doing Software Engineering in Academia - Johanna Bayer
Podcast episode
Doing Software Engineering in Academia - Johanna Bayer
byDataTalks.Club
0 ratings
0% found this document useful
The Cloudcast #215 - Open Source in Europe: Brian talks with Rachel Roumeliotis (@rroumeliotis, Co-Chair of OSCON EU) about open-source in Europe, regional diversity, the evolution of open-source for application development and how to select speakers and topics for large events.<br /> <br /> C...
Podcast episode
The Cloudcast #215 - Open Source in Europe: Brian talks with Rachel Roumeliotis (@rroumeliotis, Co-Chair of OSCON EU) about open-source in Europe, regional diversity, the evolution of open-source for application development and how to select speakers and topics for large events.<br /> <br /> C...
byThe Cloudcast
0 ratings
0% found this document useful
Igniting Language Intelligence: The Hitchhiker’s Guide From Chain-of-Thought Reasoning to Language Agents: Large language models (LLMs) have dramatically enhanced the field of language intelligence, as demonstrably evidenced by their formidable empirical performance across a spectrum of complex reasoning tasks. Additionally, theoretical proofs have illumi...
Podcast episode
Igniting Language Intelligence: The Hitchhiker’s Guide From Chain-of-Thought Reasoning to Language Agents: Large language models (LLMs) have dramatically enhanced the field of language intelligence, as demonstrably evidenced by their formidable empirical performance across a spectrum of complex reasoning tasks. Additionally, theoretical proofs have illumi...
byPapers Read on AI
0 ratings
0% found this document useful
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models: We study how to apply large language models to write grounded and organized long-form articles from scratch, with comparable breadth and depth to Wikipedia pages. This underexplored problem poses new challenges at the pre-writing stage, including how...
Podcast episode
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models: We study how to apply large language models to write grounded and organized long-form articles from scratch, with comparable breadth and depth to Wikipedia pages. This underexplored problem poses new challenges at the pre-writing stage, including how...
byPapers Read on AI
0 ratings
0% found this document useful
The Cloudcast #211 - Mesosphere DCOS: Aaron talks with Ben Hindman (@benh; Co-creator of ApacheMesos, Founder of @mesosphere) about his time at Twitter, building Mesos, understanding problems at scale, how Mesos compares to Kubernetes, Mesosphere DCOS and the recent announcements with Micr...
Podcast episode
The Cloudcast #211 - Mesosphere DCOS: Aaron talks with Ben Hindman (@benh; Co-creator of ApacheMesos, Founder of @mesosphere) about his time at Twitter, building Mesos, understanding problems at scale, how Mesos compares to Kubernetes, Mesosphere DCOS and the recent announcements with Micr...
byThe Cloudcast
0 ratings
0% found this document useful
Julien Le Dem: Why Data Lineage Matters: Julien has a unique history of building open frameworks that make data platforms interoperable. He’s contributed in various ways to Apache Arrow, Apache Iceberg, Apache Parquet, and Marquez, and is currently leading OpenLineage, an open framework...
Podcast episode
Julien Le Dem: Why Data Lineage Matters: Julien has a unique history of building open frameworks that make data platforms interoperable. He’s contributed in various ways to Apache Arrow, Apache Iceberg, Apache Parquet, and Marquez, and is currently leading OpenLineage, an open framework...
byThe Analytics Engineering Podcast
0 ratings
0% found this document useful
The Cloudcast #296 - KubeCon, DockerCon, Azure Functions and Bears, Oh My!: Aaron and Brian review KubeCon 2017 (Berlin), DockerCon 2017 (Austin) and Aaron’s trip to Seattle to learn more about Azure Functions. They also read a bunch of sponsor ads for sponsors they don't have. Use offer code CLOUDCAST to get awesome discounts...
Podcast episode
The Cloudcast #296 - KubeCon, DockerCon, Azure Functions and Bears, Oh My!: Aaron and Brian review KubeCon 2017 (Berlin), DockerCon 2017 (Austin) and Aaron’s trip to Seattle to learn more about Azure Functions. They also read a bunch of sponsor ads for sponsors they don't have. Use offer code CLOUDCAST to get awesome discounts...
byThe Cloudcast
0 ratings
0% found this document useful
The Birth and Growth of Spark: An Open Source Success Story // Matei Zaharia // MLOps Podcast #155
Podcast episode
The Birth and Growth of Spark: An Open Source Success Story // Matei Zaharia // MLOps Podcast #155
byMLOps.community
0 ratings
0% found this document useful
Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus: An interview with Frank Liu about the open source vector database Milvus and how its native storage of vector embeddings reduces the friction involved in building and deploying machine learning models.
Podcast episode
Optimize Your Machine Learning Development And Serving With The Open Source Vector Database Milvus: An interview with Frank Liu about the open source vector database Milvus and how its native storage of vector embeddings reduces the friction involved in building and deploying machine learning models.
byData Engineering Podcast
0 ratings
0% found this document useful
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
Podcast episode
058R_An adaptive learning process for developing and applying sustainability indicators with local communities (research summary)
byWhat is The Future for Cities?
0 ratings
0% found this document useful
418: Mental Models For Reduce Functions: Joël talks about his difficulties optimizing queries in ActiveRecord, especially with complex scopes and unions, resulting in slow queries. He emphasizes the importance of optimizing subqueries in unions to boost performance despite challenges such as query duplication and difficulty reusing scopes. Stephanie discusses upgrading a client's app to Rails 7, highlighting the importance of patience, detailed attention, and the benefits of collaborative work with a fellow developer. The conversation shifts to Ruby's reduce method (inject), exploring its complexity and various mental models to understand it. They discuss when it's preferable to use reduce over other methods like each, map, or loops and the importance of understanding the underlying operation you wish to apply to two elements before scaling up with reduce. The episode also touches on monoids and how they relate to reduce, suggesting that a deep understanding of functional programming
Podcast episode
418: Mental Models For Reduce Functions: Joël talks about his difficulties optimizing queries in ActiveRecord, especially with complex scopes and unions, resulting in slow queries. He emphasizes the importance of optimizing subqueries in unions to boost performance despite challenges such as query duplication and difficulty reusing scopes. Stephanie discusses upgrading a client's app to Rails 7, highlighting the importance of patience, detailed attention, and the benefits of collaborative work with a fellow developer. The conversation shifts to Ruby's reduce method (inject), exploring its complexity and various mental models to understand it. They discuss when it's preferable to use reduce over other methods like each, map, or loops and the importance of understanding the underlying operation you wish to apply to two elements before scaling up with reduce. The episode also touches on monoids and how they relate to reduce, suggesting that a deep understanding of functional programming
byThe Bike Shed
0 ratings
0% found this document useful
High Agency Pydantic > VC Backed Frameworks — with Jason Liu of Instructor
Podcast episode
High Agency Pydantic > VC Backed Frameworks — with Jason Liu of Instructor
byLatent Space: The AI Engineer Podcast — Practitioners talking LLMs, CodeGen, Agents, Multimodality, AI UX, GPU Infra and all things Software 3.0
0 ratings
0% found this document useful
Building A Reliable And Performant Router For Observability Data - Episode 97: An interview about building the Vector project to unify delivery of logs and metrics for better system observability
Podcast episode
Building A Reliable And Performant Router For Observability Data - Episode 97: An interview about building the Vector project to unify delivery of logs and metrics for better system observability
byData Engineering Podcast
0 ratings
0% found this document useful
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
Podcast episode
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
byDataLab: The Materials Informatics Podcast
0 ratings
0% found this document useful

Skip carousel

Note-taking Applications For Family History
Family Tree UK
Article
Note-taking Applications For Family History
Mar 10, 2023
7 min read
Mailserver
Linux Format
Article
Mailserver
Jun 27, 2023
4 min read
CherryTree
Linux Format
Article
CherryTree
May 31, 2022
1 min read
CherryTree
Linux Format
Article
CherryTree
May 31, 2022
1 min read
Code A Cataloguing Application In Python
Linux Format
Article
Code A Cataloguing Application In Python
Nov 15, 2022
Credit: www.djangoproject.com Matt Holder has been a fan of the open source methodology for over two decades and uses Linux and other tools where possible. More featurepacked source code for this project can be downloaded from https://github.com/mat
8 min read
2023 ArchiTeam Conference – ‘Strength In Small Practice’
Architectural Review Asia Pacific
Article
2023 ArchiTeam Conference – ‘Strength In Small Practice’
Aug 8, 2022
4 min read
Kernel Watch
Linux Format
Article
Kernel Watch
Jul 25, 2023
Linus Torvalds announced both the release of Linux 6.4, and the first release candidate for what will become Linux 6.5 in another couple of months. Linux 6.4 had few “big ticket” user visible features (although it did include initial Apple Silicon M2
2 min read
References
The European Business Review
Article
References
Sep 20, 2018
1. Kosfeld, M., Heinrichs, M., Zak, P. J., Fischbacher, U., Fehr, E. (2005). "Oxytocin increases trust in humans". Nature. 435: 673–676. doi:10.1038/nature03701. 2. Zak, P. J., and Knack, S. (2001). Trust and growth. The Economic Journal, 111: 295-32
1 min read
Monitoring Cycles In Directory Trees
Linux Format
Article
Monitoring Cycles In Directory Trees
Apr 6, 2021
7 min read
Terminal Velocity
Linux Format
Article
Terminal Velocity
Jun 4, 2019
9 min read
Zulip Economy
Linux Format
Article
Zulip Economy
Oct 20, 2020
10 min read
Jonathan Ellis INTERVIEW
Linux Format
Article
Jonathan Ellis INTERVIEW
Oct 22, 2019
6 min read
The Future Of The Database
Linux Format
Article
The Future Of The Database
Aug 27, 2019
7 min read
Research in Large Australian Practices: A Roundtable Discussion
Architecture Australia
Article
Research in Large Australian Practices: A Roundtable Discussion
Jul 2, 2018
What research is actually happening in large architectural practices in Australia? How are practices pursuing research and what are their motivations? What do they see as the benefits and how are they justifying the cost? What are the challenges and
12 min read
How To Use, Modify, And Create Templates In Word
PCWorld
Article
How To Use, Modify, And Create Templates In Word
Jul 3, 2018
6 min read
2 The Use of Python in AI and ML
Techfastly
Article
2 The Use of Python in AI and ML
Nov 30, 2020
3 min read
Researchers Gain New Understanding From Simple AI
Nautilus
Article
Researchers Gain New Understanding From Simple AI
Apr 15, 2022
In the last two years, artificial intelligence programs have reached a surprising level of linguistic fluency. The biggest and best of these are all based on an architecture invented in 2017 called the transformer. It serves as a kind of blueprint fo
5 min read
Federated Learning Uses The Data Right On Our Devices
Futurity
Article
Federated Learning Uses The Data Right On Our Devices
Jul 21, 2022
2 min read
GENEALOGY GADGETS & APPS FOR ALL OCCASIONS!
Family Tree UK
Article
GENEALOGY GADGETS & APPS FOR ALL OCCASIONS!
Dec 9, 2022
4 min read
Euclidean And Beyond
Future Music
Article
Euclidean And Beyond
Jul 28, 2020
1 min read
[Ambientes Inteligentes & FabLab UISEK]1
Revista Trama
Article
[Ambientes Inteligentes & FabLab UISEK]1
Nov 22, 2019
Discusiones sobre la inteligencia en el entorno construido comenzaron a finales de los 60s y principios de los 70s [3–7]. Pertenecían a un discurso técnico y tecnológico más amplio, involucrando a una variedad de dominios y disciplinas, para explorar
8 min read
태도가 건축이 될 때 When Attitude Becomes Architecture
Space
Article
태도가 건축이 될 때 When Attitude Becomes Architecture
Dec 5, 2023
12 min read
Mailserver
Linux Format
Article
Mailserver
Jun 2, 2020
3 min read
Mailserver
Linux Format
Article
Mailserver
Jan 9, 2024
3 min read
How To Train Computers Faster For ‘Extreme’ Datasets
Futurity
Article
How To Train Computers Faster For ‘Extreme’ Datasets
Dec 12, 2019
4 min read
Recording research Findings
Writing Magazine
Article
Recording research Findings
Aug 5, 2021
3 min read
Enfoques Multi Disciplinares Y Multi Escalares Del Proyecto Como Modelo Pedagógico, Aplicado Y Didáctico
Revista Trama
Article
Enfoques Multi Disciplinares Y Multi Escalares Del Proyecto Como Modelo Pedagógico, Aplicado Y Didáctico
Sep 21, 2021
8 min read
Help And Support
Linux Format
Article
Help And Support
Sep 24, 2019
Keeping away from mainstream desktop environments and apps means these lightweight distros aren’t the most straightforward and intuitive to operate. You’ll need some good avenues of documentation and support if you plan to use them on a regular basis
1 min read
Are ‘Chats’ The Secret To Building Epic Teams?
Futurity
Article
Are ‘Chats’ The Secret To Building Epic Teams?
Oct 16, 2018
The growing role of teamwork in the modern world raises a question: What factors contribute to team performance? A new study investigating the dynamics of teamwork suggests a major factor that predicts performance is connectedness. Several studies h
3 min read
Research Logs:
Family Tree UK
Article
Research Logs:
Mar 8, 2024
You’ve probably heard the story of Theseus and the Minotaur: how the young hero wound his way through a fiendish labyrinth, to slay the fearsome beast hidden in its confines. But do you recall how Theseus escaped from the maze, when others had been t
9 min read

Related categories

Skip carousel

Reviews for The Fiesta Data Model

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

The Fiesta Data Model - Peter Menke

tools

Part I

Introduction

1

Introduction

Every step is a first step if it’s a step in the right direction.

TERRY PRATCHETT: I Shall Wear Midnight

1.1 Overview

THIS THESIS INTRODUCES and describes FiESTA, a new data model and library that assists researchers in creating, managing, and analyzing multimodal data collections. In this introductory chapter, we clarify the motivation for this project and, in parallel, give a commented overview of how each chapter contributes to the big picture. The visual roadmap in Figure 1 on the following page accompanies and illustrates this outline.

SECTION 1.2 describes our motivation and contains pointers to the respective chapters of the thesis.

SECTION 1.3 connects this thesis to other publications and projects from the wider context of multimodal corpora and data sets.

Figure 1: Visual roadmap for this thesis. Large, dashed boxes indicate parts, nested solid boxes stand for chapters. The narrative flow is shown as arrow connections between the chapters. Italic texts next to chapters outline the goals or accomplishments of their respective chapter(s). GEF (due to the restricted space in the diagram) stands for generic exchange format.

SECTION 1.4 introduces some conventions used in this thesis, along with some remarks about mathematical notations.

1.2 Motivation

Multi-modal records allow us not only to approach old research problems in new ways, but also open up entirely new avenues of research.

Wittenburg, Levinson, et al., 2002 : 176

THIS STATEMENT DESCRIBES a central development in linguistics and its neighbouring disciplines in the last decades: The focus of research is no longer on the purely linguistic component of communicative interaction only. Instead, interaction is understood as a complex interplay between linguistic events (typically, spoken utterances) and events in other modalities, such as gesture, gaze, or facial expressions (cf. Kress and Leeuwen, 2001; Knapp, Hall, and Horgan, 2013).

A couple of decades ago, technology could only provide limited support to this branch of research. Microlevel video analysis, for instance, originated in the last century: Back then, researchers used purpose-built film projectors that could play film reels "at a variety of speeds, from very slow to extremely fast, effectively achieving slow motion vision and sound" (Condon and Ogston, 1967 : 227). This served as the basis for detailed, yet hand-written, analyses of interaction on the level of single video frames.

Since then, researchers benefitted from various developments and technological shifts, such as easily available computing facilities and digitization of video and audio recordings: The fact that media recordings can be digitized means that there is no loss of quality in copies anymore. This is an improvement compared to situations where copies of analog media often were expensive while at the same time being lossy, thus also limiting the number of generations of copies that could be produced (cf. Draxler, 2010 : 11 f.). In addition, year by year, computational power, disk space and recordable devices (such as working memory or hard disks) become more affordable (Gray and Graefe, 1997).

In addition, the advent of high-level programming languages (in general, and especially in the scientific context) and the evergrowing supply with modular, reusable programming libraries containing solutions to many problems enabled the community to create annotation tools. These are special pieces of software suited to the needs of researchers in the field of multimodal interaction, such as the EUDICO Linguistic Annotator ELAN (Wittenburg, Brugman, et al., 2006), or Anvil (Kipp, 2001). Both tools support the playback and navigation of video and audio recordings, as a basis for the creation and temporal localisation of additional data. Similarly, for detailed phonological and phonetic analyses of sound files, the tool Praat (Boersma and Weenink, 2013, 2001) was developed.

With these tools and their wide range of possible operations, scientists work on a diverse range of research questions, investigating phenomena such as

– the synchronicity and cross-modal construction of meaning in speech and gesture signals (Lücking et al., 2010; Bergmann and Kopp, 2012; Bergmann, 2012; Lücking et al., 2013),

– the use of speech-accompanying behaviour signalling emotion, and its possible differences in patients and healthy subjects (Jaecks et al., 2012),

– the interaction of speech and actions in object arrangement games, with a focus on the positioning of critical objects in a twodimensional target space (Schaffranietz et al., 2007; Lücking, Abramov, et al., 2011; Menke, McCrae, and Cimiano, 2013),

– or the multimodal behaviour in negotiation processes concerning object configurations in miniature models of buildings or landscapes (Pitsch et al., 2010; Dierker, Pitsch, and Hermann, 2011; Schnier et al., 2011).

IN ALL DIALOGICAL¹ situations investigated in these experiments, interlocutors produced several series of interaction signals over time – such as speech, gestures, facial expressions or manipulations of objects located in the shared space between interlocutors. These streams of interactive patterns are sometimes independent of each other. Often, however, multiple streams are coupled in a single interlocutor (e. g., in speech-accompanying gestures), and in other cases, the streams of different interlocutors are (at least locally) coupled (e. g., in coconstructions of speech, where a fragmentary segment of a linguistic construction is continued or completed by another interlocutor).

Figure 2: Schema of the data generation workflow in the research of multimodal interaction. Left: The different levels of data, and information about how subsequent layers are generated out of prior ones. Right: An example of primary and secondary annotations based on the segment of a recording (containing a speech transcription, an annotation of gesture, a syntactic analysis of the speech, and a secondary annotation expressing an hypothesis about how items on both the speech and gesture form a joint semantic unit.

A detailed and thorough analysis of such dialogues typically pursues the following course (cf. Figure 2; the following description indicates numbers in the diagram for easier reference):

To simplify work, further references to these recordings (and, indirectly to the events of the original situation) refer to an abstraction in the shape of a so-called timeline ). Points and intervals on this timeline are the only link to the underlying media files, since (under the assumption that all media have been synchronised) every segment in the media files can unambigously be referenced with such a time stamp information.

Then, researchers create primary annotations ). This is done by identifying points or intervals on the timeline and associating them with a coded representation (typically, by using text) of the observed phenomenon.

In addition, it can be necessary to generate annotations on an additional level, so-called secondary annotations ). These do not refer to temporal information directly. Instead, they point to one or multiple annotations (cf. Brugman and Russel, 2004 : 2065). They typically assign a property or category to an annotation, or model a certain kind of relation between two or more annotations.

DATA AND MODALITY are two terms which, although researchers have an intuitive understanding, are often deficiently defined. Therefore, we prepend two chapters to this thesis that attempt to clarify the exact definitions of terms from the two fields of data (Chapter 2) and of modalities (Chapter 3).

While most of the investigations concerning multimodal interaction follow the basic schema described above, its differentiations per project can diverge substantially. This is mostly due to the fact that different research questions often require idiosyncratic data structures and different descriptive categories (as, for instance, for the description of non-linguistic behaviour).

IN ORDER TO give a more detailed overview of how these data structures can be designed, and how they diverge against the background of varying research questions, descriptions of a sample of multimodal data collections, along with the underlying research questions, are presented in Chapter 4, starting on page →. This is accompanied by an introduction to the graphical user interfaces and the file formats of two annotation tools that were repeatedly used for creating the example data collection: Praat and ELAN (Chapter 5, starting on page →).

FIRST AND FOREMOST, the annotation tools mentioned in the previous section provide a solid basis for the research of multimodal interaction. And yet, as will be shown in the following chapters, there are still areas and specific tasks where these general-purpose tools fail, and where creative, but ad-hoc solutions are implemented. Examples of such problematic tasks are

– the creation of a certain connection inside the annotation structure for which the developers of the tool did not provide a solution,

– the automated creation and seamless integration of an additional layer containing part-of-speech tags (a task that can almost effortlessly be performed when working with text-based corpora),

– a calculation of quantitative relations of interesting patterns in multiple layers,

– or a customizable visualisation of such patterns.

Thus, as a starting point for our investigation we take the following claim that summarises these (and other) issues:

CLAIM 1

Investigators of multimodal interaction need better support in various areas for the collection, analysis, visualisation, exchange, storage, and machine comprehensibility of their corpora.

An analysis of the example data collection reveals first bits of evidence for this claim and tries to complement them with observations and results from literature. This analysis is given in Chapter 6 (page →).

HOWEVER, INFORMATION CONCERNING issues in data generation and analysis is sparse in scientific publications, and our analysis of the example data can only produce hypotheses about potential problems and issues. Therefore, a survey among creators and producers of multimodal data collections was conducted in addition. Particularly, this survey examined what kinds of problems existed for researchers, and which of them impeded them most during their work, what tasks they needed to perform in order to answer specific research questions, and how features and solutions should look that could be able to assist and support them in answering specific research questions. This survey, its design considerations, its realisation, and its evaluation, are presented in Chapter 7 (starting on page →).

IN SEVERAL OTHER areas with concurring data formats, a solution to such a set of problems was to develop a common exchange file format – one central format that can model the data structures and represent the information contained in all other formats. The advantage of such a central format is that, once data conversion routines between the common format and third-party formats are established, any subsequent task only needs to be implemented once – for the common format. In the past, such exchange (or pivotal) formats have successfully been created in different areas. Also, several exchange formats have been developed and (sometimes) standardised in the field of linguistics, such as the modular schemas for representing different sorts of texts by the Text Encoding Initiative (TEI; TEI Consortium, 2008), the Linguistic Annotation Framework (LAF; Ide, Romary, and Clergerie, 2003; Ide and Romary, 2006), the PAULA format (Potsdamer Austauschformat für linguistische Annotationen; Zeldes, Zipser, and Neumann, 2013), and many more.

An obvious choice would be to identify and use an exchange format that is suited for representing complex multimodal data sets. In order to evaluate candidates for such a purpose, we analysed the collected evidence (both from the literature review and the survey), and transformed it into a catalogue of critera for exchange formats. This catalogue will then be used to evaluate exchange format candidates. The advantages of generic exchange formats and the resulting catalogue of criteria are described in detail in Chapter 8 (starting on page →).

AT FIRST GLANCE, many file formats that have successfully been used to represent textual data seem to be promising candidates also for the representation of multimodal interaction data. However, an evaluation of these formats revealed that multimodal corpora have conceptual and structural differences from text-based corpora that make it difficult to apply such a formalism.

One of these problems is that many of the common formats presuppose the existence of one single, flat stream of primary data – typically, a text or a non-overlapping sequence of transcribed utterances. In these approaches, often either tokens are marked in the primary text which can then be referred to from annotations, or locations in the text are described using numeric character offsets. However, this approach cannot express multimodal interaction data in an adequate way. While there are numerous reasons, we present three of them that we consider especially important:

In classical corpus linguistics, the primary data is already present in the shape of the finalised (thus, immutable) text to be annotated. In contrast, in multimodal interaction studies such a text does not exist a priori, but it has to be produced in the form of a transcription. Such a transcription must itself refer to a kind of axis more suited to the situation, that is, a timeline, and optionally, also to spatial coordinates. Approaches that use only character sequences as their primary axis therefore have no adequate way of representing these temporal and spatial coordinates.

A textual primary axis, be it segmented on the word or character level, has a much lower resolution than a timeline. In addition, the axis distorts temporal relations and makes the comparison of durations and distances impossible, because character-based lengths and distances cannot be compared to temporal or spatial ones.

In multimodal interaction, often multiple streams of events cooccur which cannot be flattened down into a single sequence without discarding large amounts of important information about the order.

These and other problems have become evident in the literature review as well as the survey. In order to have a reliable assessment of to which degree existing solutions meet the requirements in the field of multimodal interaction studies, an evaluation of known data models, libraries, and file formats was conducted that have been proposed or used for the handling of linguistic (and, possibly, multimodal) corpora. The result of the evaluation shows that none of these data models meets a sufficient amount of criteria from the catalogue collected earlier. Chapter 9 (beginning on page →) contains this detailed evaluation of existing solutions.

THE RESULTS FROM this evaluation underpin the second central claim of this work:

CLAIM 2

There is (to our best knowledge) no known solution (in the shape of a theoretical or implemented data model) that meets the important requirements that researchers have when investigating multimodal interaction.

Since the evaluation of all solutions under examination did not expose a data model as a suitable candidate, the final part of this thesis describes the design and development of FiESTA, a novel data model for solving as many of the aforementioned issues as possible. This data model, since it has been designed as a specific match to the criteria catalogue, is expected to provide better solutions to the problems in multimodality research. We exemplify the usefulness of FiESTA by describing implemented and potential improvements of the workflow of scientists along with an evaluation of the formalism against the criteria catalogue.

The formal specification and documentation of the so-called Format for extensive spatio-temporal annotations (FiESTA) can be found in Chapter 10 (beginning on page →). Summaries of the XML-based file format and the pilot implementation are given in Chapter 11 (page →), and a conclusive evaluation in Chapter 12 (page →).

SINCE A SINGLE thesis does not provide enough space for a thorough description and documentation of such an ambitious project, the conclusion evaluates on a meta-level what has been achieved in the thesis, and what further developments and improvements appear promising. This conclusion and the outlook are presented in Chapter 13 (page →). In addition, Appendix A (page →) presents the questions of the survey in detail.

1.3 Relations to other works or publications

This thesis project is closely related to and embedded within the endeavours of Project X1 Multimodal Alignment Corpora within the Collaborative Research Centre (CRC)² 673 Alignment in Communication³. X1 provided solutions for both low-level storage and sharing and high-level administration and analysis of a variety of data collections dealing with multimodal dialogues. The models and products of this thesis provided the theoretical and structural basis for several of these solutions. Sometimes, early versions of models and implementations have already been used an integrated into services and applications (one of them being the Phoibos corpus manager, which is mentioned in Chapter 12).

A DRAFT OF THE MEXICO MODEL (which is a sister project of FiESTA that aims at representing whole multimodal corpora; MExiCo stands for Multimodal Experiment Corpora) for managing multimodal corpora has been summarized and published in Menke and Cimiano (2013).

THE GENERAL IDEA OF A CORPUS MANAGEMENT APPLICATION (and also of the underlying functionality that eventually resulted in the FiESTA and MExiCo libraries) has been outlined in Menke and Mehler (2010) and Menke and Mehler (2011), and plans were to integrate the X1 solutions with the eHumanities Desktop System (Gleim, Waltinger, Mehler, et al., 2009). However, due to personnel reorganisation issues within Project X1, this agenda was abandoned in favor of the development of the current approach.

THE NOTION OF A GENERIC SCALE-BASED APPROACH to modelling multimodal annotations (as described in this thesis) is based on earlier work: on Menke (2007), where a general scale concept for an improved modelling of overlapping discourse segments, based on Stevens’ levels of measurement (Stevens, 1946), is described, and on (Menke, 2009), where metrics for the calculation of the synchronicity of interval-based annotations, especially for multimodal ensembles⁴ consisting of speech and gesture parts, are introduced.

FIESTA, while being a central subject of this thesis, is present as a draft in various earlier publications (not necessarily bearing this name, but possibly under its previous working title ToE, short for time-oriented events), among them Menke and Mehler (2010), Menke and Mehler (2011), and Menke and Cimiano (2012).

PRELIMINARY INVESTIGATIONS TOWARD A MACHINE-READABLE, RDF-BASED ONTOLOGY OF MULTIMODAL DATA UNITS AND PHENOMENA based on the works of Chapter 3 have been discussed at a workshop on multimodality in the context of the LREC 2012 conference, and have been published afterwards in Menke and Cimiano (2012).

THE CHAT GAME CORPUS developed in Project B6 of the CRC 673 has been summarized and described in Menke, McCrae, and Cimiano (2013).

THE ATTEMPT OF DEVELOPING A PROTOTYPE FOR A MULTIMODAL TOOL CHAIN using FiESTA as the central exchange file format is described in Menke, Freigang, et al. (2014).

1.4 Conventions and declarations

1.4.1 Mathematical conventions

MEDIAN AND QUARTILES. We denote the median of a distribution as μ, and the interval between the lower and upper quartile of a distribution as Q1,3.

SIGNIFICANCE. The results of a statistical significance test are called highly significant and marked with ** if p ≤ 0.01, and they are called significant and marked with * if 0.01 < p ≤ 0.05.

BOOLEAN VALUESis the set of Boolean values true T and false F.

ACCESS TO SEQUENCE ELEMENTS. The nth element of a sequence s is denoted by s[n].

DOT NOTATION. For structures that have subordinate components, a dot notation inspired by the member access syntax of several programming languages is used. For instance, if b is a book, then b.TITLE retrieves its title, and b.CHAPTERS returns an enumeration of its chapters.

We consider this notation more readable than multiple nested predicate expressions.

1.4.2 Bibliographic conventions

Since this thesis originates from a research project at a German university, we assume that there will not be any need for translations of German quotations. Translations from languages other than English or German were created by us, if not explicitly stated otherwise by the addition of a source of the translation. Highlighting and structure in quoted passages originate from the original authors, if not explicitly stated otherwise.

Pages in the World Wide Web are used in two different functions in this document: as a means for the mere localisation of a resource, and as evidence for an argument.

If a page serves as the entry page or frontdoor page of a product, object, or other resource that the text refers to, then the link to the page is given in a footnote.

If a page contains information that serves as evidence for arguments in the text, then it is inserted as an ordinary citation. Title information is taken from the TITLE element in the HTML header. Author information is obtained from the text body or from the HTML header. If no author could be detected, the citation displays a custom shorthand in the text and as the label in the bibliography, preceded with the ° character, such as °SyncWriter1.

All web resources have been checked for functionality on 26 May 2015, unless we provide a different date in the reference.

¹ Throughout this thesis, dialogue and dialogical explicitly include communicative situations with more than two participants (for which sometimes the term multilogue is used).

² This official English translation of the German term Sonderforschungsbereich (SFB) is imprecise, a better version is specialised research department. However, due to the official status of the first term, we will use it (or its abbreviation).

³ http://www.sfb673.org

⁴ Multimodal ensembles were introduced in Mehler and Lücking (2009) and Lücking (2013), see also below.

Part II

Background: What are multimodal data?

2

Data

MERE ACCUMULATION OF OBSERVATIONAL EVIDENCE IS NOT PROOF.

TERRY PRATCHETT: Hogfather

2.1 Introduction

WHEN TALKING ABOUT solutions for the management and analysis of multimodal data⁵, it is advisable to have at least a basic agreement on the terms used in this phrase. However, although most researchers from fields where recorded dialogues and communicative situations are analysed have an intuitive agreement upon their terms (especially primary data and secondary data), it is not trivial to find definitions for them that really match their usage.

As a consequence, this chapter clarifies what is to be understood under the superordinate term of data, and then summarises the most prominent readings and definitions of primary and secondary data. It will become apparent that some are quite closely related, while others make use of these terms while having nothing to do with the others conceptually. These different levels of relation will be analysed in the conclusion of this chapter.

2.2 An examination object

FOR THE REST of this chapter we will refer to the example configuration of a multimodal experiment depicted in Figure 3. This figure contains a schematic overview of the data sets resulting from a typical (yet ficticious) experiment. The original situation, and the representations derived from it are given numbers, while the data mapping operations between them are given letters for easier reference. This setup may seem artificial, yet it contains several relations and data types that will help understanding the differences of the different variations in data nomenclature. The representations are:

THE ORIGINAL SITUATION. This is the sequence of communicative events in reality (which, as will later be shown, is volatile, and has to be recorded and documented).

DIRECT VIDEO AND AUDIO RECORDINGS. These are conventional, immediate recordings of the visual impression and the sound that occurred in the original situation. They are stored using discrete frames or samples, which, when played, create an impression of moving pictures and continuous sound.

A LIVE TRANSCRIPT. This is a log created by an observer who was present in the real communicative situation and wrote down immediately what was said in the dialogue, and by whom.

A CONVERTED VIDEO FILE. . This could be a video file that is reduced in file size, image resolution, or that has been compressed using a video codec.

GESTURE ANNOTATION. Based on audio and video recordings, specially trained annotators create time-based markings and classifications of the gestures produced by interlocutors in the original situation.

SPEECH TRANSCRIPT. Based mainly on the audio recordings, transcriptions of the speech (to be more exact, of utterances and words) of the interlocutors in the original situation is created. These transcriptions are created with a special software that allows for the marking of temporal position and duration of the utterances and words.

PART-OF-SPEECH ANNOTATION, AUTOMATED. A piece of software takes speech transcripts as input and assigns part-of-speech information to the units on the word level, according to the algorithm implemented, and the underlying data sets (these can be lexica, dictionaries, corpora, etc.).

PART-OF-SPEECH ANNOTATION, MANUAL. An annotator investigates the units on the word level found in the speech transcripts, and assigns part-of-speech information to them, based on his grammatical competence, and material he has at his disposition (again, lexica, dictionaries, etc.).

Table 1: Comparison of elements in definitions of the concept data.

UNIFIED SPEECH TRANSCRIPTare analysed and combined into a single resource, in order to reduce errors and disambiguate missing or disputable portions.

2.3 Data

THERE ARE SEVERAL definitions and interpretations of data. Although they are all related and refer to similar things, they still differ slightly, depending on disciplines involved and purposes pursued.

According to Kertész and Rákosi (2012 : 169), "[a] datum is a statement with a positive plausibility value originating from some direct source. Christian Lehmann defines the term in a similar way. For him, [a] datum is a representationi of an aspect of the epistemic object of some empirical research whichi is taken for granted" (Lehmann, 2005 : 182). In these two definitions, three relevant components can be identified (cf. Table 1):

Data are representations of the objects of study. This also includes that a datum is tangible, that is, it is materially manifested and can be accessed used as the basis for an analysis.

The objects of study (entities, their properties and relations interesting to researchers) serve as a basis for the creation of data.

People usually agree that these representations have a certain quality that makes them usable as a basis for a scientific argumentation, proofs, and similar things (often this quality is called validity, cf. Menke, 2012 : 288 f.).

There are, however, some minor issues with both definitions. First, we consider statement too narrow a concept to model several of the types of data multimodal research deals with, especially raw signal data in audio or video recordings. They are not statements, because they do not have any propositional content, at least not without an additional level of interpretation. Also, for an object to serve as a valid datum, it needs more than just any positive plausibility value. Plausibility should be high enough that one can rely on it in order to draw valid conclusions. There is, however, no absolute threshold, it depends on the situation. At least, the plausibility value should be significantly higher than a chance value or baseline (cf. Menke, 2012 : 304 f.).

A working definition based on the two definitions cited above could be:

DEFINITION 1

A scientific datum is a valid and processable representation of an object of study, or of one or more of its aspects or properties.

In this definition, "valid and processable" mean the following: The validity value must be high enough to minimize errors and doubts, and data must be in a state such

Enjoying the preview?

Page 1 of 1

The Fiesta Data Model: A novel approach to the representation of heterogeneous multimodal interaction data

About this ebook

Peter Menke

Related authors

Related to The Fiesta Data Model

Related ebooks

Linguistics For You

Related podcast episodes

Related articles

Related categories

Reviews for The Fiesta Data Model

What did you think?

Book preview

The Fiesta Data Model - Peter Menke

1

Introduction

1.1 Overview

1.2 Motivation

1.3 Relations to other works or publications

1.4.1 Mathematical conventions

1.4.2 Bibliographic conventions

2

Data

2.1 Introduction

2.2 An examination object

2.3 Data