Discover this podcast and so much more

Podcasts are free to enjoy without a subscription. We also offer ebooks, audiobooks, and so much more for just $11.99/month.

Allen Day: Google’s Mission to Provide Open Datasets for Public Blockchains

Allen Day: Google’s Mission to Provide Open Datasets for Public Blockchains

FromEpicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed Technologies


Allen Day: Google’s Mission to Provide Open Datasets for Public Blockchains

FromEpicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed Technologies

ratings:
Length:
68 minutes
Released:
Sep 26, 2018
Format:
Podcast episode

Description

Public blockchains produce enormous amounts of data. In theory, anyone can access the raw contents of transaction and blocks. In practice, however, querying blockchains can prove to be a daunting task.
The difficulty lies in the fact that blockchains are particular types of distributed databases and thus carry several limitations. Most, if not all, blockchains lack the most basic SQL querying capabilities supported by nearly every off-the-shelf database system.
Take Bitcoin as an example. Its API lacks even the most basic calls which would allow a user to query any address and receive the balance. In order to achieve this, block explorers and alike have developed sophisticated middleware infrastructure that parses the blockchain, normalizes the data, and stores it in a database, where it can be queried. In the best of cases, companies offer API calls for only a limited set of operations. Google hopes to change this by freeing blockchain datasets.
We’re joined by Allen Day, Science Advocate at Google’s Singapore office. Earlier this year, he and his team released the Bitcoin blockchain as a public dataset in Big Query, Google big data IaaS offering. In August, they added Ethereum to their list of freely available public datasets, which includes US census data, cannabis genomes, and the entirety of Reddit and Github. Anyone wishing to query the data can do so in SQL on the Big Query website or via an API. For instance, a relatively simple query would return the daily mean transaction fees since the Genesis Block in just a few seconds.
Coupled with Google’s AI and Machine Learning infrastructure and other open data sets, one can only imagine the potentially groundbreaking insights we could gain from this data.
Topics covered in this episode:

Allen’s background as a geneticist
The similarities between blockchains and evolution process in lifeforms
Google’s cloud platform and its various components
Big Query and its publicly available datasets
The Bitcoin and Ethereum datasets in Big Query
Why this data is useful to the public and for what it may be used
The particular challenges in implementing Ethereum as opposed to Bitcoin
Insights we may gain by crossing blockchain dataset with other data
How machine learning and AI could help us better understand specific transaction patterns

Episode links:

Bitcoin in BigQuery: blockchain analytics on public data
Bitcoin Blockchain Public Dataset
Ethereum in BigQuery: a Public Dataset for smart contract analytics
Ethereum in BigQuery: how we built this dataset
Ethereum Blockchain Public Dataset
Change Agent by Daniel Suarez
Real-time Ethereum Notifications for Everyone for Free
ethjs-abi library, compiled for use in Google BigQuery
Kaggle: Your Home for Data Science
The Strange Inevitability of Evolution - Issue 20: Creativity - Nautilus
Reddit_
Google Cloud

Thank you to our sponsors for their support:

The open, decentralized trading protocol for ERC20 tokens using the Dutch auction mechanism. More at epicenter.tv/dutchx.
Deploy enterprise-ready consortium blockchain networks that scale in just a few clicks. More at aka.ms/epicenter.

This episode is hosted by Sébastien Couture. Show notes and listening options: epicenter.tv/254
Released:
Sep 26, 2018
Format:
Podcast episode

Titles in the series (100)

Epicenter brings you in-depth conversations about the technical, economic and social implications of cryptocurrencies and blockchain technologies. Every week, we interview business leaders, engineers academics and entrepreneurs, and bring you a diverse spectrum of opinions and points of view. Epicenter is hosted by Sebastien Couture, Brian Fabian Crain, Meher Roy, Sunny Aggarwal, and Friederike Ernst. Since 2014, episodes have been downloaded over 4 million times.