Mastering MongoDB 4.x - Second Edition: Expert techniques to run high-volume and fault-tolerant database solutions using MongoDB 4.x, 2nd Edition

Ebook796 pages4 hours

Mastering MongoDB 4.x - Second Edition: Expert techniques to run high-volume and fault-tolerant database solutions using MongoDB 4.x, 2nd Edition

Name: Mastering MongoDB 4.x - Second Edition: Expert techniques to run high-volume and fault-tolerant database solutions using MongoDB 4.x, 2nd Edition
Author: Alex Giamas
ISBN: 9781789611380

By Alex Giamas

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Leverage the power of MongoDB 4.x to build and administer fault-tolerant database applications

Key Features

Master the new features and capabilities of MongoDB 4.x
Implement advanced data modeling, querying, and administration techniques in MongoDB
Includes rich case-studies and best practices followed by expert MongoDB developers

Book Description

MongoDB is the best platform for working with non-relational data and is considered to be the smartest tool for organizing data in line with business needs. The recently released MongoDB 4.x supports ACID transactions and makes the technology an asset for enterprises across the IT and fintech sectors.

This book provides expertise in advanced and niche areas of managing databases (such as modeling and querying databases) along with various administration techniques in MongoDB, thereby helping you become a successful MongoDB expert. The book helps you understand how the newly added capabilities function with the help of some interesting examples and large datasets. You will dive deeper into niche areas such as high-performance configurations, optimizing SQL statements, configuring large-scale sharded clusters, and many more. You will also master best practices in overcoming database failover, and master recovery and backup procedures for database security.

By the end of the book, you will have gained a practical understanding of administering database applications both on premises and on the cloud; you will also be able to scale database applications across all servers.

What you will learn

Perform advanced querying techniques such as indexing and expressions
Configure, monitor, and maintain a highly scalable MongoDB environment
Master replication and data sharding to optimize read/write performance
Administer MongoDB-based applications on premises or on the cloud
Integrate MongoDB with big data sources to process huge amounts of data
Deploy MongoDB on Kubernetes containers
Use MongoDB in IoT, mobile, and serverless environments

Who this book is for

This book is ideal for MongoDB developers and database administrators who wish to become successful MongoDB experts and build scalable and fault-tolerant applications using MongoDB. It will also be useful for database professionals who wish to become certified MongoDB professionals. Some understanding of MongoDB and basic database concepts is required to get the most out of this book.

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateMar 30, 2019

ISBN9781789611380

Author

Alex Giamas

Related authors

Skip carousel

Related to Mastering MongoDB 4.x - Second Edition

Related ebooks

Skip carousel

Node.JS Guidebook: Comprehensive guide to learn Node.js
Ebook
Node.JS Guidebook: Comprehensive guide to learn Node.js
byDhruti Shah
Rating: 0 out of 5 stars
0 ratings
MongoDB Complete Guide: Develop Strong Understanding of Administering MongoDB, CRUD Operations, MongoDB Commands, MongoDB Compass, MongoDB Server, MongoDB Replication and MongoDB Sharding (English Edition)
Ebook
MongoDB Complete Guide: Develop Strong Understanding of Administering MongoDB, CRUD Operations, MongoDB Commands, MongoDB Compass, MongoDB Server, MongoDB Replication and MongoDB Sharding (English Edition)
byManu Sharma
Rating: 0 out of 5 stars
0 ratings
Mongoose for Application Development
Ebook
Mongoose for Application Development
bySimon Holmes
Rating: 5 out of 5 stars
5/5
MongoDB Cookbook - Second Edition: Modern Database Management Made Easy
Ebook
MongoDB Cookbook - Second Edition: Modern Database Management Made Easy
byCyrus Dasadia
Rating: 0 out of 5 stars
0 ratings
Mastering JavaScript Single Page Application Development
Ebook
Mastering JavaScript Single Page Application Development
byPhilip Klauzinski
Rating: 0 out of 5 stars
0 ratings
Flask By Example: Unleash the full potential of the Flask web framework by creating simple yet powerful web applications
Ebook
Flask By Example: Unleash the full potential of the Flask web framework by creating simple yet powerful web applications
byGareth Dwyer
Rating: 4 out of 5 stars
4/5
Code with Java 21: A practical approach for building robust and efficient applications (English Edition)
Ebook
Code with Java 21: A practical approach for building robust and efficient applications (English Edition)
byAaron Ploetz
Rating: 0 out of 5 stars
0 ratings
Mastering AWS Serverless: Architecting, developing, and deploying serverless solutions on AWS (English Edition)
Ebook
Mastering AWS Serverless: Architecting, developing, and deploying serverless solutions on AWS (English Edition)
byMiguel A. Calles
Rating: 0 out of 5 stars
0 ratings
ElasticSearch Cookbook
Ebook
ElasticSearch Cookbook
byAlberto Paro
Rating: 5 out of 5 stars
5/5
Practical C++ Backend Programming: Crafting Databases, APIs, and Web Servers for High-Performance Backend
Ebook
Practical C++ Backend Programming: Crafting Databases, APIs, and Web Servers for High-Performance Backend
byJustin Barbara
Rating: 0 out of 5 stars
0 ratings
Full Stack iOS Development with Swift and Vapor: Full stack iOS development made easy (English Edition)
Ebook
Full Stack iOS Development with Swift and Vapor: Full stack iOS development made easy (English Edition)
byHem Dutt
Rating: 0 out of 5 stars
0 ratings
Instant MongoDB
Ebook
Instant MongoDB
byAmol Nayak
Rating: 0 out of 5 stars
0 ratings
Node.js Design Patterns - Second Edition
Ebook
Node.js Design Patterns - Second Edition
byMario Casciaro
Rating: 4 out of 5 stars
4/5
Getting Started with React
Ebook
Getting Started with React
byDoel Sengupta
Rating: 0 out of 5 stars
0 ratings
Cross-Platform Modern Apps with VS Code: Combine the power of EF Core, ASP.NET Core and Xamarin.Forms to build multi-platform applications on Visual Studio Code
Ebook
Cross-Platform Modern Apps with VS Code: Combine the power of EF Core, ASP.NET Core and Xamarin.Forms to build multi-platform applications on Visual Studio Code
byOckert J. du Preez
Rating: 0 out of 5 stars
0 ratings
Redis® Deep Dive: Explore Redis - Its Architecture, Data Structures and Modules like Search, JSON, AI, Graph, Timeseries (English Edition)
Ebook
Redis® Deep Dive: Explore Redis - Its Architecture, Data Structures and Modules like Search, JSON, AI, Graph, Timeseries (English Edition)
byChinmay Kulkarni
Rating: 0 out of 5 stars
0 ratings
Exploring Web Components: Build Reusable UI Web Components with Standard Technologies (English Edition)
Ebook
Exploring Web Components: Build Reusable UI Web Components with Standard Technologies (English Edition)
byAndrea Chiarelli
Rating: 0 out of 5 stars
0 ratings
JavaScript and JSON Essentials
Ebook
JavaScript and JSON Essentials
bySai Srinivas Sriparasa
Rating: 5 out of 5 stars
5/5
Node.js: Novice to Ninja
Ebook
Node.js: Novice to Ninja
byCraig Buckler
Rating: 0 out of 5 stars
0 ratings
Learning JavaScript Data Structures and Algorithms - Second Edition
Ebook
Learning JavaScript Data Structures and Algorithms - Second Edition
byLoiane Groner
Rating: 0 out of 5 stars
0 ratings
Object-Oriented JavaScript - Third Edition
Ebook
Object-Oriented JavaScript - Third Edition
byStoyan Stefanov
Rating: 4 out of 5 stars
4/5
Designing Microservices using Django: Structuring, Deploying and Managing the Microservices Architecture with Django
Ebook
Designing Microservices using Django: Structuring, Deploying and Managing the Microservices Architecture with Django
byShayank Jain
Rating: 0 out of 5 stars
0 ratings
Learning Angular for .NET Developers
Ebook
Learning Angular for .NET Developers
byGunasundaram Rajesh
Rating: 0 out of 5 stars
0 ratings
JavaScript for Gurus: Use JavaScript programming features, techniques and modules to solve everyday problems
Ebook
JavaScript for Gurus: Use JavaScript programming features, techniques and modules to solve everyday problems
byOckert J. du Preez
Rating: 0 out of 5 stars
0 ratings
ASP.NET Core 3 and React: Hands-On full stack web development using ASP.NET Core, React, and TypeScript 3
Ebook
ASP.NET Core 3 and React: Hands-On full stack web development using ASP.NET Core, React, and TypeScript 3
byCarl Rippon
Rating: 0 out of 5 stars
0 ratings
Building Server-side and Microservices with Go: Building Modern Backends and Microservices Using Go, Docker and Kubernetes
Ebook
Building Server-side and Microservices with Go: Building Modern Backends and Microservices Using Go, Docker and Kubernetes
byDušan Stojanović
Rating: 0 out of 5 stars
0 ratings
Learn AngularJS in 24 Hours
Ebook
Learn AngularJS in 24 Hours
byAlex Nordeen
Rating: 0 out of 5 stars
0 ratings
Mastering Reactive JavaScript
Ebook
Mastering Reactive JavaScript
byErich de Souza Oliveira
Rating: 0 out of 5 stars
0 ratings
Learning Single-page Web Application Development
Ebook
Learning Single-page Web Application Development
byFernando Monteiro
Rating: 0 out of 5 stars
0 ratings
The Joy of JavaScript
Ebook
The Joy of JavaScript
byLuis Atencio
Rating: 4 out of 5 stars
4/5

Enterprise Applications For You

Skip carousel

QuickBooks Online For Dummies, 2025 Edition
Ebook
QuickBooks Online For Dummies, 2025 Edition
byDavid H. Ringstrom
Rating: 5 out of 5 stars
5/5
Scrivener Superpowers
Ebook
Scrivener Superpowers
byM. G. Herron
Rating: 0 out of 5 stars
0 ratings
Excel All-in-One For Dummies
Ebook
Excel All-in-One For Dummies
byPaul McFedries
Rating: 0 out of 5 stars
0 ratings
QuickBooks 2023 All-in-One For Dummies
Ebook
QuickBooks 2023 All-in-One For Dummies
byStephen L. Nelson
Rating: 0 out of 5 stars
0 ratings
Excel Dashboards & Reports For Dummies
Ebook
Excel Dashboards & Reports For Dummies
byMichael Alexander
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
Ebook
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
byTJ Books
Rating: 3 out of 5 stars
3/5
Some Future Day: How AI Is Going to Change Everything
Ebook
Some Future Day: How AI Is Going to Change Everything
byMarc Beckman
Rating: 0 out of 5 stars
0 ratings
Bitcoin For Dummies
Ebook
Bitcoin For Dummies
byPrypto
Rating: 4 out of 5 stars
4/5
Microsoft Excel 365 Bible
Ebook
Microsoft Excel 365 Bible
byMichael Alexander
Rating: 0 out of 5 stars
0 ratings
Excel VBA Programming For Dummies
Ebook
Excel VBA Programming For Dummies
byJohn Walkenbach
Rating: 4 out of 5 stars
4/5
QuickBooks 2024 All-in-One For Dummies
Ebook
QuickBooks 2024 All-in-One For Dummies
byStephen L. Nelson
Rating: 0 out of 5 stars
0 ratings
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]
Ebook
Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]
byKevin Pitch
Rating: 5 out of 5 stars
5/5
CompTIA Project+ Study Guide: Exam PK0-005
Ebook
CompTIA Project+ Study Guide: Exam PK0-005
byKim Heldman
Rating: 0 out of 5 stars
0 ratings
QuickBooks 2021 For Dummies
Ebook
QuickBooks 2021 For Dummies
byStephen L. Nelson
Rating: 0 out of 5 stars
0 ratings
Dragon NaturallySpeaking For Dummies
Ebook
Dragon NaturallySpeaking For Dummies
byStephanie Diamond
Rating: 0 out of 5 stars
0 ratings
Excel 2021
Ebook
Excel 2021
byJIAYI SIMONDS
Rating: 4 out of 5 stars
4/5
Scrivener For Dummies
Ebook
Scrivener For Dummies
byGwen Hernandez
Rating: 4 out of 5 stars
4/5
QuickBooks Online For Dummies
Ebook
QuickBooks Online For Dummies
byDavid H. Ringstrom
Rating: 0 out of 5 stars
0 ratings
Salesforce.com For Dummies
Ebook
Salesforce.com For Dummies
byLiz Kao
Rating: 3 out of 5 stars
3/5
Microsoft Access Guide to Success: From Fundamentals to Mastery in Crafting Databases, Optimizing Tasks, & Making Unparalleled Impressions [III EDITION]
Ebook
Microsoft Access Guide to Success: From Fundamentals to Mastery in Crafting Databases, Optimizing Tasks, & Making Unparalleled Impressions [III EDITION]
byKevin Pitch
Rating: 5 out of 5 stars
5/5
Learning Microsoft Endpoint Manager: Unified Endpoint Management with Intune and the Enterprise Mobility + Security Suite
Ebook
Learning Microsoft Endpoint Manager: Unified Endpoint Management with Intune and the Enterprise Mobility + Security Suite
byScott Duffey
Rating: 0 out of 5 stars
0 ratings
Microsoft Copilot For Dummies
Ebook
Microsoft Copilot For Dummies
byChris Minnick
Rating: 0 out of 5 stars
0 ratings
Learning AWS
Ebook
Learning AWS
byAurobindo Sarkar
Rating: 4 out of 5 stars
4/5
Excel Macros For Dummies
Ebook
Excel Macros For Dummies
byMichael Alexander
Rating: 3 out of 5 stars
3/5
Enterprise AI For Dummies
Ebook
Enterprise AI For Dummies
byZachary Jarvinen
Rating: 3 out of 5 stars
3/5
Microsoft Office 365 Non-Tech Savvy Bible: [10 in 1]: From Beginner to Pro with This Fully Illustrated Guide– Word, Excel, PowerPoint, Access, Outlook, Skype, Teams, OneDrive, OneNote, and Publisher
Ebook
Microsoft Office 365 Non-Tech Savvy Bible: [10 in 1]: From Beginner to Pro with This Fully Illustrated Guide– Word, Excel, PowerPoint, Access, Outlook, Skype, Teams, OneDrive, OneNote, and Publisher
byAlan Norwood
Rating: 0 out of 5 stars
0 ratings
Microsoft 365 Excel For Dummies
Ebook
Microsoft 365 Excel For Dummies
byDavid H. Ringstrom
Rating: 0 out of 5 stars
0 ratings
Microsoft Power Platform A Deep Dive: Dig into Power Apps, Power Automate, Power BI, and Power Virtual Agents (English Edition)
Ebook
Microsoft Power Platform A Deep Dive: Dig into Power Apps, Power Automate, Power BI, and Power Virtual Agents (English Edition)
byBijay Kumar Sahoo
Rating: 0 out of 5 stars
0 ratings
SQL 101 Crash Course: Comprehensive Guide to SQL Fundamentals and Practical Applications
Ebook
SQL 101 Crash Course: Comprehensive Guide to SQL Fundamentals and Practical Applications
byEmrys Callahan
Rating: 5 out of 5 stars
5/5
Agile Project Management: Scrum for Beginners
Ebook
Agile Project Management: Scrum for Beginners
byMarkus Heimrath
Rating: 4 out of 5 stars
4/5

Related categories

Skip carousel

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Mastering MongoDB 4.x - Second Edition - Alex Giamas

Mastering MongoDB 4.0, Second Edition

Mastering MongoDB 4.x

Second Edition

Expert techniques to run high-volume and fault-tolerant database solutions using MongoDB 4.x

Alex Giamas

BIRMINGHAM - MUMBAI

Mastering MongoDB 4.x Second Edition

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

Commissioning Editor: Amey Varangaonkar

Acquisition Editor: Porous Godhaa

Content Development Editor: Ronnel Mathew

Technical Editor: Suwarna Patil

Copy Editor: Safis Editing

Project Coordinator: Namrata Swetta

Proofreader: Safis Editing

Indexer: Rekha Nair

Graphics: Tom Scaria

Production Coordinator: Deepika Naik

First published: November 2017

Second edition: March 2019

Production reference: 1290319

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham

B3 2PB, UK.

ISBN 978-1-78961-787-0

www.packtpub.com

mapt.io

Mapt is an online digital library that gives you full access to over 5,000 books and videos, as well as industry leading tools to help you plan your personal development and advance your career. For more information, please visit our website.

Why subscribe?

Spend less time learning and more time coding with practical eBooks and Videos from over 4,000 industry professionals

Improve your learning with Skill Plans built especially for you

Get a free eBook or video every month

Mapt is fully searchable

Copy and paste, print, and bookmark content

Packt.com

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.packt.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at customercare@packtpub.com for more details.

At www.packt.com, you can also read a collection of free technical articles, sign up for a range of free newsletters, and receive exclusive discounts and offers on Packt books and eBooks.

Contributors

About the author

Alex Giamas is a consultant and a hands-on technical architect at the Department for International Trade in the UK Government. His experience spans software architecture and development using NoSQL and big data technologies. For more than 15 years, he has contributed to Fortune 15 companies and has worked as a start-up CTO.

He is the author of Mastering MongoDB 3.x, published by Packt Publishing, which is based on his use of MongoDB since 2009.

Alex has worked with a wide array of NoSQL and big data technologies, building scalable and highly available distributed software systems in Python, Java, and Ruby. He is a MongoDB Certified Developer, a Cloudera Certified Developer for Apache Hadoop and Data Science Essentials.

Writing a book is harder than I thought, but more rewarding than I could have ever imagined. I would like to thank my wife, Mary, for her love and constant support, day and night, and for being my editor and sounding board, my muse and my anchor in life.

Many thanks to my parents for all the support and the amazing chances they've given me over the years, without which I would never have been who I am today.

About the reviewers

Doug Bierer wrote his first program on a Digital Equipment Corporation PDP-8 in 1971. Since that time, he's written buckets of code in lots of different programming languages, including BASIC, C, Assembler, PL/I, FORTRAN, Prolog, FORTH, Java, PHP, Perl, and Python. He did networking for 10 years. The largest network he worked on was based in Brussels and had 35,000 nodes. Doug is certified in PHP 5.1, 5.3, 5.5, and 7.1, and Zend Framework 1 and 2. He has authored a bunch of books and videos for O'Reilly/Packt Publishing on PHP, security, and MongoDB. His most current authoring project is a book called Learn MongoDB 4.0, scheduled to be published by Packt Publishing in September 2019. He founded unlikelysource(dot)com in April 2008.

Sumit Sengupta has worked on several RDBMS and NoSQL databases, such as Oracle, SQL Server, Postgres, MongoDB, and Cassandra. A former employee of MongoDB, he has designed, architected, and managed many distributed and big data solutions, both on-premise and on AWS and Azure. Currently, he works on the Azure Data and AI Platform for Microsoft, helping partners to create innovative solutions based on data.

Packt is searching for authors like you

If you're interested in becoming an author for Packt, please visit authors.packtpub.com and apply today. We have worked with thousands of developers and tech professionals, just like you, to help them share their insight with the global tech community. You can make a general application, apply for a specific hot topic that we are recruiting an author for, or submit your own idea.

Title Page

Copyright and Credits

Mastering MongoDB 4.x Second Edition

About Packt

Why subscribe?

Packt.com

Contributors

About the author

About the reviewers

Packt is searching for authors like you

Preface

Who this book is for

What this book covers

To get the most out of this book

Download the example code files

Conventions used

Get in touch

Reviews

Section 1: Basic MongoDB – Design Goals and Architecture

MongoDB – A Database for Modern Web

Technical requirements

The evolution of SQL and NoSQL

The evolution of MongoDB

Major feature set for versions 1.0 and 1.2

Version 2

Version 3

Version 4

MongoDB for SQL developers

MongoDB for NoSQL developers

MongoDB's key characteristics and use cases

Key characteristics

Use cases for MongoDB

MongoDB criticism

MongoDB configuration and best practices

Operational best practices

Schema design best practices

Best practices for write durability

Best practices for replication

Best practices for sharding

Best practices for security

Best practices for AWS

Reference documentation

MongoDB documentation

Packt references

Further reading

Summary

Schema Design and Data Modeling

Relational schema design

MongoDB schema design

Read-write ratio

Data modeling

Data types

Comparing different data types

Date type

ObjectId

Modeling data for atomic operations

Write isolation

Read isolation and consistency

Modeling relationships

One-to-one

One-to-many and many-to-many

Modeling data for keyword searches

Connecting to MongoDB

Connecting using Ruby

Mongoid ODM

Inheritance with Mongoid models

Connecting using Python

PyMODM ODM

Inheritance with PyMODM models

Connecting using PHP

Doctrine ODM

Inheritance with Doctrine

Summary

Section 2: Querying Effectively

MongoDB CRUD Operations

CRUD using the shell

Scripting for the mongo shell

The differences between scripting for the mongo shell and using it directly

Batch inserts using the shell

Batch operations using the mongo shell

Administration

fsync

compact

currentOp and killOp

collMod

touch

MapReduce in the mongo shell

MapReduce concurrency

Incremental MapReduce

Troubleshooting MapReduce

Aggregation framework

SQL to aggregation

Aggregation versus MapReduce

Securing the shell

Authentication and authorization

Authorization with MongoDB

Security tips for MongoDB

Encrypting communication using TLS/SSL

Encrypting data

Limiting network exposure

Firewalls and VPNs

Auditing

Using secure configuration options

Authentication with MongoDB

Enterprise Edition

Kerberos authentication

LDAP authentication

Summary

Advanced Querying

MongoDB CRUD operations

CRUD using the Ruby driver

Creating documents

Read

Chaining operations in find()

Nested operations

Update

Delete

Batch operations

CRUD in Mongoid

Read

Scoping queries

Create, update, and delete

CRUD using the Python driver

Creating and deleting

Finding documents

Updating documents

CRUD using PyMODM

Creating documents

Updating documents

Deleting documents

Querying documents

CRUD using the PHP driver

Creating and deleting

BulkWrite

Read

Updating documents

CRUD using Doctrine

Creating, updating, and deleting

Read

Best practices

Comparison operators

Update operators

Smart querying

Using regular expressions

Querying results and cursors

Storage considerations for the delete operation

Change streams

Introduction

Setup

Using change streams

Specification

Important notes

Production recommendations

Replica sets

Sharded clusters

Summary

Multi-Document ACID Transactions

Background

ACID

Atomicity

Consistency

Isolation

Phantom reads

Non-repeatable reads

Dirty reads

Durability

When do we need ACID in MongoDB ?

Building a digital bank using MongoDB

Setting up our data

Transferring between accounts – part 1

Transferring between accounts – part 2

Transferring between accounts – part 3

E-commerce using MongoDB

The best practices and limitations of multi-document ACID transactions

Summary

Aggregation

Why aggregation?

Aggregation operators

Aggregation stage operators

Expression operators

Expression Boolean operators

Expression comparison operators

Set expression and array operators

Expression date operators

Expression string operators

Expression arithmetic operators

Aggregation accumulators

Conditional expressions

Type conversion operators

Other operators

Text search

Variable

Literal

Parsing data type

Limitations

Aggregation use case

Summary

Indexing

Index internals

Index types

Single field indexes

Dropping indexes

Indexing embedded fields

Indexing embedded documents

Background indexes

Compound indexes

Sorting with compound indexes

Reusing compound indexes

Multikey indexes

Special types of indexes

Text indexes

Hashed indexes

Time to live indexes

Partial indexes

Sparse indexes

Unique indexes

Case-insensitive

Geospatial indexes

2d geospatial indexes

2dsphere geospatial indexes

geoHaystack indexes

Building and managing indexes

Forcing index usage

Hint and sparse indexes

Building indexes on replica sets

Managing indexes

Naming indexes

Special considerations

Using indexes efficiently

Measuring performance

Improving performance

Index intersection

Further reading

Summary

Section 3: Administration and Data Management

Monitoring, Backup, and Security

Monitoring

What should we monitor?

Page faults

Resident memory

Virtual and mapped memory

Working sets

Monitoring memory usage in WiredTiger

Tracking page faults

Tracking B-tree misses

I/O wait

Read and write queues

Lock percentage

Background flushes

Tracking free space

Monitoring replication

Oplog size

Working set calculations

Monitoring tools

Hosted tools

Open source tools

Backups

Backup options

Cloud-based solutions

Backups with filesystem snapshots

Making a backup of a sharded cluster

Making backups using mongodump

Backing up by copying raw files

Making backups using queuing

EC2 backup and restore

Incremental backups

Security

Authentication

Authorization

User roles

Database administration roles

Cluster administration roles

Backup and restore roles

Roles across all databases

Superuser

Network-level security

Auditing security

Special cases

Overview

Summary

Storage Engines

Pluggable storage engines

WiredTiger

Document-level locking

Snapshots and checkpoints

Journaling

Data compression

Memory usage

readConcern

WiredTiger collection-level options

WiredTiger performance strategies

WiredTiger B-tree versus LSM indexes

Encrypted

In-memory

MMAPv1

MMAPv1 storage optimization

Mixed usage

Other storage engines

RocksDB

TokuMX

Locking in MongoDB

Lock reporting

Lock yield

Commonly used commands and locks

Commands requiring a database lock

Further reading

Summary

MongoDB Tooling

Introduction

MongoDB Atlas

Creating a new cluster

Important notes

MongoDB Cloud Manager

MongoDB Ops Manager

MongoDB Charts

MongoDB Compass

MongoDB Connector for Business Intelligence (BI)

An introduction to Kubernetes

Enterprise Kubernetes Operator

MongoDB Mobile

MongoDB Stitch

QueryAnywhere

Rules

Functions

Triggers

Mobile Sync

Summary

Harnessing Big Data with MongoDB

What is big data?

The big data landscape

Message queuing systems

Apache ActiveMQ

RabbitMQ

Apache Kafka

Data warehousing

Apache Hadoop

Apache Spark

Comparing Spark with Hadoop MapReduce

MongoDB as a data warehouse

A big data use case

Setting up Kafka

Setting up Hadoop

Steps for Hadoop setup

Using a Hadoop to MongoDB pipeline

Setting up Spark to MongoDB

Further reading

Summary

Section 4: Scaling and High Availability

Replication

Logical or physical replication

Different high availability types

An architectural overview

How do elections work?

What is the use case for a replica set?

Setting up a replica set

Converting a standalone server into a replica set

Creating a replica set

Read preference

Write concern

Custom write concerns

Priority settings for replica set members

Zero priority replica set members

Hidden replica set members

Delayed replica set members

Production considerations

Connecting to a replica set

Replica set administration

How to perform maintenance on replica sets

Re-syncing a member of a replica set

Changing the oplog's size

Reconfiguring a replica set when we have lost the majority of our servers

Chained replication

Cloud options for a replica set

mLab

MongoDB Atlas

Replica set limitations

Summary

Sharding

Why do we use sharding?

Architectural overview

Development, continuous deployment, and staging environments

Planning ahead with sharding

Sharding setup

Choosing the shard key

Changing the shard key

Choosing the correct shard key

Range-based sharding

Hash-based sharding

Coming up with our own key

Location-based data

Sharding administration and monitoring

Balancing data – how to track and keep our data balanced

Chunk administration

Moving chunks

Changing the default chunk size

Jumbo chunks

Merging chunks

Adding and removing shards

Sharding limitations

Querying sharded data

The query router

Find

Sort/limit/skip

Update/remove

Querying using Ruby

Performance comparison with replica sets

Sharding recovery

mongos

mongod

Config server

A shard goes down

The entire cluster goes down

Further reading

Summary

Fault Tolerance and High Availability

Application design

Schema-less doesn't mean schema design-less

Read performance optimization

Consolidating read querying

Defensive coding

Monitoring integrations

Operations

Security

Enabling security by default

Isolating our servers

Checklists

Further reading

Summary

Other Books You May Enjoy

Leave a review - let other readers know what you think

Preface

MongoDB has grown to become the de facto NoSQL database with millions of users, from small start-ups to Fortune 500 companies. Addressing the limitations of SQL schema-based databases, MongoDB pioneered a shift of focus for DevOps and offered sharding and replication that can be maintained by DevOps teams. This book is based on MongoDB 4.0 and covers topics ranging from database querying using the shell, built-in drivers, and popular ODM mappers, to more advanced topics such as sharding, high availability, and integration with big data sources.

You will get an overview of MongoDB and will learn how to play to its strengths, with relevant use cases. After that, you will learn how to query MongoDB effectively and make use of indexes as much as possible. The next part deals with the administration of MongoDB installations, whether on-premises or on the cloud. We deal with database internals in the following section, explaining storage systems and how they can affect performance. The last section of this book deals with replication and MongoDB scaling, along with integration with heterogeneous data sources. By the end this book, you will be equipped with all the required industry skills and knowledge to become a certified MongoDB developer and administrator.

Who this book is for

Mastering MongoDB 4.0 is a book for database developers, architects, and administrators who want to learn how to use MongoDB more effectively and productively. If you have experience with, and are interested in working with, NoSQL databases to build apps and websites, then this book is for you.

What this book covers

Chapter 1, MongoDB – A Database for Modern Web, takes us on a journey through web, SQL, and NoSQL technologies, from their inception to their current states.

Chapter 2, Schema Design and Data Modeling, teaches you schema design for relational databases and MongoDB, and how we can achieve the same goal from a different starting point.

Chapter 3, MongoDB CRUD Operations, provides a bird's-eye view of CRUD operations.

Chapter 4, Advanced Querying, covers advanced querying concepts using Ruby, Python, and PHP, using both the official drivers and an ODM.

Chapter 5, Multi-Document ACID Transactions, explores transactions following ACID characteristics, which is a new functionality introduced in MongoDB 4.0.

Chapter 6, Aggregation, dives deep into the aggregation framework. We also discuss when and why we should use aggregation, as opposed to MapReduce and querying the database.

Chapter 7, Indexing, explores one of the most important properties of every database, which is indexing.

Chapter 8, Monitoring, Backup, and Security, discusses the operational aspects of MongoDB. Monitoring, backup, and security should not be an afterthought, but rather are necessary processes that need to be taken care of before deploying MongoDB in a production environment.

Chapter 9, Storage Engines, teaches you about the different storage engines in MongoDB. We identify the pros and cons of each one, and the use cases for choosing each storage engine.

Chapter 10, MongoDB Tooling, covers all the different tools, both on-premises and in the cloud, that we can utilize in the MongoDB ecosystem.

Chapter 11, Harnessing Big Data with MongoDB, provides more detail on how MongoDB fits into the wider big data landscape and ecosystem.

Chapter 12, Replication, discusses replica sets and how to administer them. Starting from an architectural overview of replica sets and replica set internals around elections, we dive deep into setting up and configuring a replica set.

Chapter 13, Sharding, explores sharding, one of the most interesting features of MongoDB. We start with an architectural overview of sharding and move on to discuss how to design a shard, and especially, how to choose the right shard key.

Chapter 14, Fault Tolerance and High Availability, tries to fit in the information that we didn't manage to discuss in the previous chapters, and places emphasis on security and a series of checklists that developers and DBAs should keep into mind.

To get the most out of this book

You will need the following software to be able to smoothly sail through the chapters:

MongoDB version 4+

Apache Kafka version 1

Apache Spark version 2+

Apache Hadoop version 2+

Download the example code files

You can download the example code files for this book from your account at www.packt.com. If you purchased this book elsewhere, you can visit www.packt.com/support and register to have the files emailed directly to you.

You can download the code files by following these steps:

Select the SUPPORT tab.

Click on Code Downloads & Errata.

Enter the name of the book in the Search box and follow the onscreen instructions.

Once the file is downloaded, please make sure that you unzip or extract the folder using the latest version of:

WinRAR/7-Zip for Windows

Zipeg/iZip/UnRarX for Mac

7-Zip/PeaZip for Linux

The code bundle for the book is also hosted on GitHub at https://github.com/PacktPublishing/Mastering-MongoDB-4.x-Second-Edition. In case there's an update to the code, it will be updated on the existing GitHub repository.

We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Conventions used

In this book, you will find a number of text styles that distinguish between different kinds of information. Here are some examples of these styles and an explanation of their meaning.

Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: In a sharded environment, each mongod applies its own locks, thus greatly improving concurrency.

A block of code is set as follows:

db.account.find( { balance : { $type : 16 } } );

db.account.find( { balance : { $type : integer } } );

Any command-line input or output is written as follows:

> db.types.insert({a:4})

WriteResult({ nInserted : 1 })

Bold: Indicates a new term, an important word, or words that you see onscreen. For example, words in menus or dialog boxes appear in the text like this. Here is an example: The following screenshot shows the Zone configuration summary:

Warnings or important notes appear like this.

Tips and tricks appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, mention the book title in the subject of your message and email us at customercare@packtpub.com.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packt.com/submit-errata, selecting your book, clicking on the Errata Submission Form link, and entering the details.

Piracy: If you come across any illegal copies of our works in any form on the Internet, we would be grateful if you would provide us with the location address or website name. Please contact us at copyright@packt.com with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Reviews

Please leave a review. Once you have read and used this book, why not leave a review on the site that you purchased it from? Potential readers can then see and use your unbiased opinion to make purchase decisions, we at Packt can understand what you think about our products, and our authors can see your feedback on their book. Thank you!

For more information about Packt, please visit packt.com.

Section 1: Basic MongoDB – Design Goals and Architecture

In this section, we will go through the history of databases and how we arrived at the need for non-relational databases. We will also learn how to model our data so that storage and retrieval from MongoDB can be as efficient as possible. Even though MongoDB is schemaless, designing how data will be organized into documents can have a great effect in terms of performance.

This section consists of the following chapters:

Chapter 1, MongoDB – A Database for Modern Web

Chapter 2, Schema Design and Data Modeling

MongoDB – A Database for Modern Web

In this chapter, we will lay the foundations for understanding MongoDB, and how it claims to be a database that's designed for the modern web. Learning in the first place is as important as knowing how to learn. We will go through the references that have the most up-to-date information about MongoDB, for both new and experienced users. We will cover the following topics:

SQL and MongoDB's history and evolution

MongoDB from the perspective of SQL and other NoSQL technology users

MongoDB's common use cases and why they matter

MongoDB's configuration and best practices

Technical requirements

You will require MongoDB version 4+, Apache Kafka, Apache Spark and Apache Hadoop installed to smoothly sail through the chapter. The codes that have been used for all the chapters can be found at: https://github.com/PacktPublishing/Mastering-MongoDB-4.x-Second-Edition.

The evolution of SQL and NoSQL

Structured Query Language (SQL) existed even before the WWW. Dr. E. F. Codd originally published the paper, A Relational Model of Data for Large Shared Data Banks, in June, 1970, in the Association of Computer Machinery (ACM) journal, Communications of the ACM. SQL was initially developed at IBM by Chamberlin and Boyce, in 1974. Relational Software (now Oracle Corporation) was the first to develop a commercially available implementation of SQL, targeted at United States governmental agencies.

The first American National Standards Institute (ANSI) SQL standard came out in 1986. Since then, there have been eight revisions, with the most recent being published in 2016 (SQL:2016).

SQL was not particularly popular at the start of the WWW. Static content could just be hardcoded into the HTML page without much fuss. However, as the functionality of websites grew, webmasters wanted to generate web page content driven by offline data sources, in order to generate content that could change over time without redeploying code.

Common Gateway Interface (CGI) scripts, developing Perl or Unix shells, were driving early database-driven websites in Web 1.0. With Web 2.0, the web evolved from directly injecting SQL results into the browser to using two-tier and three-tier architectures that separated views from the business and model logic, allowing for SQL queries to be modular and isolated from the rest of the web application.

On the other hand, Not only SQL (NoSQL) is much more modern and supervened web evolution, rising at the same time as Web 2.0 technologies. The term was first coined by Carlo Strozzi in 1998, for his open source database that did not follow the SQL standard, but was still relational.

This is not what we currently expect from a NoSQL database. Johan Oskarsson, a developer at Last.fm at the time, reintroduced the term in early 2009, in order to group a set of distributed, non-relational data stores that were being developed. Many of them were based on Google's Bigtable and MapReduce papers, or Amazon's DynamoDB, a highly available key-value based storage system.

NoSQL's foundations grew upon relaxed atomicity, consistency, isolation, and durability (ACID) properties, which guarantee the performance, scalability, flexibility, and reduced complexity. Most NoSQL databases have gone one way or another in providing as many of the previously mentioned qualities as possible, even offering adjustable guarantees to the developer. The following diagram describes the evolution of SQL and NoSQL:

The evolution of MongoDB

10gen started to develop a cloud computing stack in 2007 and soon realized that the most important innovation was centered around the document-oriented database that they built to power it, which was MongoDB. MongoDB was initially released on August 27, 2009.

Version 1 of MongoDB was pretty basic in terms of features, authorization, and ACID guarantees but it made up for these shortcomings with performance and flexibility.

In the following sections, we will highlight the major features of MongoDB, along with the version numbers with which they were introduced.

Major feature set for versions 1.0 and 1.2

The different features of versions 1.0 and 1.2 are as follows:

Document-based model

Global lock (process level)

Indexes on collections

CRUD operations on documents

No authentication (authentication was handled at the server level)

Master and slave replication

MapReduce (introduced in v1.2)

Stored JavaScript functions (introduced in v1.2)

Version 2

The different features of version 2.0 are as follows:

Background index creation (since v1.4)

Sharding (since v1.6)

More query operators (since v1.6)

Journaling (since v1.8)

Sparse and covered indexes (since v1.8)

Compact commands to reduce disk usage

Memory usage more efficient

Concurrency improvements

Index performance enhancements

Replica sets are now more configurable and data center aware

MapReduce improvements

Authentication (since 2.0, for sharding and most database commands)

Geospatial features introduced

Aggregation framework (since v2.2) and enhancements (since v2.6)

TTL collections (since v2.2)

Concurrency improvements, among which is DB-level locking (since v2.2)

Text searching (since v2.4) and integration (since v2.6)

Hashed indexes (since v2.4)

Security enhancements and role-based access (since v2.4)

V8 JavaScript engine instead of SpiderMonkey (since v2.4)

Query engine improvements (since v2.6)

Pluggable storage engine API

WiredTiger storage engine introduced, with document-level locking, while previous storage engine (now called MMAPv1) supports collection-level locking

Version 3

The different features of version 3.0 are as follows:

Replication and sharding enhancements (since v3.2)

Document validation (since v3.2)

Aggregation framework enhanced operations (since v3.2)

Multiple storage engines (since v3.2, only in Enterprise Edition)

Query language and indexes collation (since v3.4)

Read-only database views (since v3.4)

Linearizable read concern (since v3.4)

Version 4

The different features of version 4.0 are as follows:

Multi-document ACID transactions

Change streams

MongoDB tools (Stitch, Mobile, Sync, and Kubernetes Operator)

The following diagram shows MongoDB's evolution:

As we can observe, version 1 was pretty basic, whereas version 2 introduced most of the features present in the current version, such as sharding, usable and special indexes, geospatial features, and memory and concurrency improvements.

On the way from version 2 to version 3, the aggregation framework was introduced, mainly as a supplement to the ageing (and never up to par with dedicated frameworks, such as Hadoop) MapReduce framework. Then, text search was added, and slowly but surely, the framework was improving performance, stability, and security, to adapt to the increasing enterprise load of customers using MongoDB.

With WiredTiger's introduction in version 3, locking became much less of an issue for MongoDB, as it was brought down from the process (global lock) to the document level, almost the most granular level possible.

Version 4 marked a major transition, bridging the SQL and NoSQL world with the introduction of multi-document ACID transactions. This allowed for a wider range of applications

Enjoying the preview?

Page 1 of 1

Mastering MongoDB 4.x - Second Edition: Expert techniques to run high-volume and fault-tolerant database solutions using MongoDB 4.x, 2nd Edition

About this ebook

Alex Giamas

Related authors

Related to Mastering MongoDB 4.x - Second Edition

Related ebooks

Node.JS Guidebook: Comprehensive guide to learn Node.js

MongoDB Complete Guide: Develop Strong Understanding of Administering MongoDB, CRUD Operations, MongoDB Commands, MongoDB Compass, MongoDB Server, MongoDB Replication and MongoDB Sharding (English Edition)

Mongoose for Application Development

MongoDB Cookbook - Second Edition: Modern Database Management Made Easy

Mastering JavaScript Single Page Application Development

Flask By Example: Unleash the full potential of the Flask web framework by creating simple yet powerful web applications

Code with Java 21: A practical approach for building robust and efficient applications (English Edition)

Mastering AWS Serverless: Architecting, developing, and deploying serverless solutions on AWS (English Edition)

ElasticSearch Cookbook

Practical C++ Backend Programming: Crafting Databases, APIs, and Web Servers for High-Performance Backend

Full Stack iOS Development with Swift and Vapor: Full stack iOS development made easy (English Edition)

Instant MongoDB

Node.js Design Patterns - Second Edition

Getting Started with React

Cross-Platform Modern Apps with VS Code: Combine the power of EF Core, ASP.NET Core and Xamarin.Forms to build multi-platform applications on Visual Studio Code

Redis® Deep Dive: Explore Redis - Its Architecture, Data Structures and Modules like Search, JSON, AI, Graph, Timeseries (English Edition)

Exploring Web Components: Build Reusable UI Web Components with Standard Technologies (English Edition)

JavaScript and JSON Essentials

Node.js: Novice to Ninja

Learning JavaScript Data Structures and Algorithms - Second Edition

Object-Oriented JavaScript - Third Edition

Designing Microservices using Django: Structuring, Deploying and Managing the Microservices Architecture with Django

Learning Angular for .NET Developers

JavaScript for Gurus: Use JavaScript programming features, techniques and modules to solve everyday problems

ASP.NET Core 3 and React: Hands-On full stack web development using ASP.NET Core, React, and TypeScript 3

Building Server-side and Microservices with Go: Building Modern Backends and Microservices Using Go, Docker and Kubernetes

Learn AngularJS in 24 Hours

Mastering Reactive JavaScript

Learning Single-page Web Application Development

The Joy of JavaScript

Enterprise Applications For You

QuickBooks Online For Dummies, 2025 Edition

Scrivener Superpowers

Excel All-in-One For Dummies

QuickBooks 2023 All-in-One For Dummies

Excel Dashboards & Reports For Dummies

Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert

Some Future Day: How AI Is Going to Change Everything

Bitcoin For Dummies

Microsoft Excel 365 Bible

Excel VBA Programming For Dummies

QuickBooks 2024 All-in-One For Dummies

Microsoft Office 365 Bible: 10:1 Mastery | Excel in Your Profession, Enhance Time Management, and Foster Exceptional Collaboration [III EDITION]

CompTIA Project+ Study Guide: Exam PK0-005

QuickBooks 2021 For Dummies

Dragon NaturallySpeaking For Dummies

Excel 2021

Scrivener For Dummies

QuickBooks Online For Dummies

Salesforce.com For Dummies

Microsoft Access Guide to Success: From Fundamentals to Mastery in Crafting Databases, Optimizing Tasks, & Making Unparalleled Impressions [III EDITION]

Learning Microsoft Endpoint Manager: Unified Endpoint Management with Intune and the Enterprise Mobility + Security Suite

Microsoft Copilot For Dummies

Learning AWS

Excel Macros For Dummies

Enterprise AI For Dummies

Microsoft Office 365 Non-Tech Savvy Bible: [10 in 1]: From Beginner to Pro with This Fully Illustrated Guide– Word, Excel, PowerPoint, Access, Outlook, Skype, Teams, OneDrive, OneNote, and Publisher

Microsoft 365 Excel For Dummies

Microsoft Power Platform A Deep Dive: Dig into Power Apps, Power Automate, Power BI, and Power Virtual Agents (English Edition)

SQL 101 Crash Course: Comprehensive Guide to SQL Fundamentals and Practical Applications

Agile Project Management: Scrum for Beginners

Related categories

What did you think?

Book preview

Mastering MongoDB 4.x - Second Edition - Alex Giamas

Mastering MongoDB 4.x Second Edition

Why subscribe?

Packt.com

Contributors

About the author

About the reviewers

Packt is searching for authors like you

Table of Contents

Preface