Discover millions of ebooks, audiobooks, and so much more with a free trial

Only $11.99/month after trial. Cancel anytime.

Unavailable
Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Instagram, GitHub, and More
Unavailable
Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Instagram, GitHub, and More
Unavailable
Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Instagram, GitHub, and More
Ebook425 pages7 hours

Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Instagram, GitHub, and More

Rating: 4 out of 5 stars

4/5

()

Currently unavailable

Currently unavailable

About this ebook

Mine the rich data tucked away in popular social websites such as Twitter, Facebook, LinkedIn, and Instagram. With the third edition of this popular guide, data scientists, analysts, and programmers will learn how to glean insights from social media—including who’s connecting with whom, what they’re talking about, and where they’re located—using Python code examples, Jupyter notebooks, or Docker containers.

In part one, each standalone chapter focuses on one aspect of the social landscape, including each of the major social sites, as well as web pages, blogs and feeds, mailboxes, GitHub, and a newly added chapter covering Instagram. Part two provides a cookbook with two dozen bite-size recipes for solving particular issues with Twitter.

  • Get a straightforward synopsis of the social web landscape
  • Use Docker to easily run each chapter’s example code, packaged as a Jupyter notebook
  • Adapt and contribute to the code’s open source GitHub repository
  • Learn how to employ best-in-class Python 3 tools to slice and dice the data you collect
  • Apply advanced mining techniques such as TFIDF, cosine similarity, collocation analysis, clique detection, and image recognition
  • Build beautiful data visualizations with Python and JavaScript toolkits
LanguageEnglish
Release dateDec 4, 2018
ISBN9781491973523
Unavailable
Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Instagram, GitHub, and More
Author

Matthew A. Russell

Matthew Russell (@ptwobrussell) is Chief Technology Officer at Built Technologies, where he leads a team of leaders on a mission to improve the way the world is built. Outside of work, he contemplates ultimate reality, practices rugged individualism, and trains for the possibilities of a zombie or robot apocalypse.

Related to Mining the Social Web

Related ebooks

Software Development & Engineering For You

View More

Related articles

Reviews for Mining the Social Web

Rating: 3.9 out of 5 stars
4/5

10 ratings4 reviews

What did you think?

Tap to rate

Review must be at least 10 words

  • Rating: 5 out of 5 stars
    5/5
    Mining the Social Web (2nd Edition) by Matthew A Russell is a book for a relatively small niche. Most users of the social web (Twitter, Facebook, etc) won't have the programming expertise to follow the technical details and most programmers won't be interested in the data which come from the social web. I'm still on the fringes of target audience set, with a long history of involvement in the social web and a decent grasp of Python. I have also recently been wanting to learn how to programmatically extract some data from Twitter; not full-scale mining but certainly a desire to dig below the surface.I haven't tried and tested every snippet of information in the book. However, for the material where I did dig deeper (Twitter) it provided the keys I needed to unlock some of the barriers that had previously obstructed me. It is worth skimming through the whole book to soak in the wider concepts but, if you are interested in one or more of the social web gateways discussed and want to start exploring it programmatically, it gets both thumbs up.
  • Rating: 4 out of 5 stars
    4/5
    The social web is a phenomenon of our times when the web started to reflect our interactions and communications.Who speaks to whom, who says what about what, how many people talk about what. Information that marketers want, the information underlying the altmetrics movement in academia, and it would appear, the various security agencies.Mapping out interactions is not new, the Republic of Letters project did much the same by analysing the correspondence of eighteenth century savants, but it is both the scale of the social web and the complexities of the analyses made possible by cheap processing power.This book covers the major social networks such as Twitter, LinkedIn, Facebook, and Google+, with an emphasis on Twitter. The author also discusses mailbox corpus creation and analysis, and the analysis of semantic web data, and also interestingly, GitHub as a social platform.This book is not a book for the dilettante. More than half the text consists of Python code and the reader really needs to work with the code examples to gain full value from the book. The book also provides a rapid introduction to OAuth, and ranges over topics as diverse as simple text analysis, cluster analysis, natural language processing, and the use of applications such as MongoDB.This is however a very good book for anyone seeking to work with the social web and would serve as a very useful primer or as a textbook for a module on data mining. The code examples are clear and nicely structured, making them easy to follow and work with.
  • Rating: 3 out of 5 stars
    3/5
    This book covers the major social networks. The code is written in Python, which isn't my thing, but it is clear enough that it's easy to figure out what is being done so that the concepts transfer to another language. I myself was only interested in the Twitter content, but it's nice to know that with this book I have a starting reference to other social network platforms if ever I need them.
  • Rating: 5 out of 5 stars
    5/5
    This book uses Python code to demonstrate the techniques of scraping popular social media sites, and then goes on to show what can be done with that data in bulk. There are additional notes about some principles of data mining and web technologies that add detail and value to the book. because it's an e-book, syntax coloring makes the sample code especially readable.