Expert SQL Server Transactions and Locking: Concurrency Internals for SQL Server Practitioners

Ebook482 pages3 hours

Expert SQL Server Transactions and Locking: Concurrency Internals for SQL Server Practitioners

Name: Expert SQL Server Transactions and Locking: Concurrency Internals for SQL Server Practitioners
Author: Dmitri Korotkevitch
ISBN: 9781484239575

By Dmitri Korotkevitch

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Master SQL Server’s Concurrency Model so you can implement high-throughput systems that deliver transactional consistency to your application customers. This book explains how to troubleshoot and address blocking problems and deadlocks, and write code and design database schemas to minimize concurrency issues in the systems you develop.
SQL Server’s Concurrency Model is one of the least understood parts of the SQL Server Database Engine. Almost every SQL Server system experiences hard-to-explain concurrency and blocking issues, and it can be extremely confusing to solve those issues without a base of knowledge in the internals of the Engine. While confusing from the outside, the SQL Server Concurrency Model is based on several well-defined principles that are covered in this book.
Understanding the internals surrounding SQL Server’s Concurrency Model helps you build high-throughput systems in multi-user environments. This book guides you through the Concurrency Model and elaborates how SQL Server supports transactional consistency in the databases. The book covers all versions of SQL Server, including Microsoft Azure SQL Database, and it includes coverage of new technologies such as In-Memory OLTP and Columnstore Indexes.

What You'll Learn

Know how transaction isolation levels affect locking behavior and concurrency
Troubleshoot and address blocking issues and deadlocks
Provide required data consistency while minimizing concurrency issues
Design efficient transaction strategies that lead to scalable code
Reduce concurrency problems through good schema design
Understand concurrency models for In-Memory OLTP and Columnstore Indexes
Reduce blocking during index maintenance, batch data load, and similar tasks

Who This Book Is For
SQL Server developers, database administrators, and application architects who are developing highly-concurrent applications. The book is for anyone interested in the technical aspects of creating and troubleshooting high-throughput systems that respond swiftly to user requests.

Skip carousel

Databases

LanguageEnglish

PublisherApress

Release dateOct 8, 2018

ISBN9781484239575

Author

Dmitri Korotkevitch

Related authors

Skip carousel

Related to Expert SQL Server Transactions and Locking

Related ebooks

Skip carousel

Pro SQL Server 2019 Wait Statistics: A Practical Guide to Analyzing Performance in SQL Server
Ebook
Pro SQL Server 2019 Wait Statistics: A Practical Guide to Analyzing Performance in SQL Server
byEnrico van de Laar
Rating: 0 out of 5 stars
0 ratings
Sql : The Ultimate Beginner to Advanced Guide To Master SQL Quickly with Step-by-Step Practical Examples
Ebook
Sql : The Ultimate Beginner to Advanced Guide To Master SQL Quickly with Step-by-Step Practical Examples
byMark Robinson
Rating: 0 out of 5 stars
0 ratings
Introducing the MySQL 8 Document Store
Ebook
Introducing the MySQL 8 Document Store
byCharles Bell
Rating: 0 out of 5 stars
0 ratings
MySQL Concurrency: Locking and Transactions for MySQL Developers and DBAs
Ebook
MySQL Concurrency: Locking and Transactions for MySQL Developers and DBAs
byJesper Wisborg Krogh
Rating: 0 out of 5 stars
0 ratings
Securing SQL Server: Protecting Your Database from Attackers
Ebook
Securing SQL Server: Protecting Your Database from Attackers
byDenny Cherry
Rating: 0 out of 5 stars
0 ratings
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
Ebook
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
byAnthony Serpico
Rating: 0 out of 5 stars
0 ratings
Oracle SQL Revealed: Executing Business Logic in the Database Engine
Ebook
Oracle SQL Revealed: Executing Business Logic in the Database Engine
byAlex Reprintsev
Rating: 0 out of 5 stars
0 ratings
Pro Oracle SQL Development: Best Practices for Writing Advanced Queries
Ebook
Pro Oracle SQL Development: Best Practices for Writing Advanced Queries
byJon Heller
Rating: 0 out of 5 stars
0 ratings
High Performance SQL Server: Consistent Response for Mission-Critical Applications
Ebook
High Performance SQL Server: Consistent Response for Mission-Critical Applications
byBenjamin Nevarez
Rating: 0 out of 5 stars
0 ratings
Practical Oracle SQL: Mastering the Full Power of Oracle Database
Ebook
Practical Oracle SQL: Mastering the Full Power of Oracle Database
byKim Berg Hansen
Rating: 0 out of 5 stars
0 ratings
Introducing InnoDB Cluster: Learning the MySQL High Availability Stack
Ebook
Introducing InnoDB Cluster: Learning the MySQL High Availability Stack
byCharles Bell
Rating: 0 out of 5 stars
0 ratings
Dynamic SQL: Applications, Performance, and Security in Microsoft SQL Server
Ebook
Dynamic SQL: Applications, Performance, and Security in Microsoft SQL Server
byEdward Pollack
Rating: 0 out of 5 stars
0 ratings
Beginning jOOQ: Learn to Write Efficient and Effective Java-Based SQL Database Operations
Ebook
Beginning jOOQ: Learn to Write Efficient and Effective Java-Based SQL Database Operations
byTayo Koleoso
Rating: 0 out of 5 stars
0 ratings
Pro Azure Administration and Automation: A Comprehensive Guide to Successful Cloud Management
Ebook
Pro Azure Administration and Automation: A Comprehensive Guide to Successful Cloud Management
byVladimir Stefanovic
Rating: 0 out of 5 stars
0 ratings
MySQL Connector/Python Revealed: SQL and NoSQL Data Storage Using MySQL for Python Programmers
Ebook
MySQL Connector/Python Revealed: SQL and NoSQL Data Storage Using MySQL for Python Programmers
byJesper Wisborg Krogh
Rating: 0 out of 5 stars
0 ratings
Building Serverless Apps with Azure Functions and Cosmos DB: Leverage Azure functions and Cosmos DB for building serverless applications (English Edition)
Ebook
Building Serverless Apps with Azure Functions and Cosmos DB: Leverage Azure functions and Cosmos DB for building serverless applications (English Edition)
byHansamali Gamage
Rating: 0 out of 5 stars
0 ratings
Modern API Design with ASP.NET Core 2: Building Cross-Platform Back-End Systems
Ebook
Modern API Design with ASP.NET Core 2: Building Cross-Platform Back-End Systems
byFanie Reynders
Rating: 0 out of 5 stars
0 ratings
ORACLE PL/SQL Interview Questions You'll Most Likely Be Asked
Ebook
ORACLE PL/SQL Interview Questions You'll Most Likely Be Asked
byVibrant Publishers
Rating: 5 out of 5 stars
5/5
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
Ebook
SQL: For Beginners: Your Guide To Easily Learn SQL Programming in 7 Days
byi Code Academy
Rating: 5 out of 5 stars
5/5
Beginning Oracle SQL for Oracle Database 18c: From Novice to Professional
Ebook
Beginning Oracle SQL for Oracle Database 18c: From Novice to Professional
byBen Brumm
Rating: 0 out of 5 stars
0 ratings
Joe Celko’s Complete Guide to NoSQL: What Every SQL Professional Needs to Know about Non-Relational Databases
Ebook
Joe Celko’s Complete Guide to NoSQL: What Every SQL Professional Needs to Know about Non-Relational Databases
byJoe Celko
Rating: 4 out of 5 stars
4/5
Beginning Backup and Restore for SQL Server: Data Loss Management and Prevention Techniques
Ebook
Beginning Backup and Restore for SQL Server: Data Loss Management and Prevention Techniques
byBradley Beard
Rating: 0 out of 5 stars
0 ratings
SQL Programming & Database Management For Noobee
Ebook
SQL Programming & Database Management For Noobee
byKishor Sarkar X
Rating: 0 out of 5 stars
0 ratings
SQL Interview Questions: A complete question bank to crack your ANN SQL interview with real-time examples
Ebook
SQL Interview Questions: A complete question bank to crack your ANN SQL interview with real-time examples
byPrasad Kulkarni
Rating: 0 out of 5 stars
0 ratings
SQL Server: Tips and Tricks - 1
Ebook
SQL Server: Tips and Tricks - 1
byPriyanka Agarwal
Rating: 5 out of 5 stars
5/5
Introducing Delphi ORM: Object Relational Mapping Using TMS Aurelius
Ebook
Introducing Delphi ORM: Object Relational Mapping Using TMS Aurelius
byJohn Kouraklis
Rating: 0 out of 5 stars
0 ratings
SQL 101 Crash Course: Comprehensive Guide to SQL Fundamentals and Practical Applications
Ebook
SQL 101 Crash Course: Comprehensive Guide to SQL Fundamentals and Practical Applications
byEmrys Callahan
Rating: 5 out of 5 stars
5/5
Expert Oracle Database Architecture: Techniques and Solutions for High Performance and Productivity
Ebook
Expert Oracle Database Architecture: Techniques and Solutions for High Performance and Productivity
byDarl Kuhn
Rating: 0 out of 5 stars
0 ratings
Advanced ASP.NET Core 3 Security: Understanding Hacks, Attacks, and Vulnerabilities to Secure Your Website
Ebook
Advanced ASP.NET Core 3 Security: Understanding Hacks, Attacks, and Vulnerabilities to Secure Your Website
byScott Norberg
Rating: 0 out of 5 stars
0 ratings
Selenium Framework Design in Keyword-Driven Testing: Automate Your Test Using Selenium and Appium
Ebook
Selenium Framework Design in Keyword-Driven Testing: Automate Your Test Using Selenium and Appium
byPinakin Ashok Chaubal
Rating: 0 out of 5 stars
0 ratings

Databases For You

Skip carousel

100+ SQL Queries T-SQL for Microsoft SQL Server
Ebook
100+ SQL Queries T-SQL for Microsoft SQL Server
byIFS Harrison
Rating: 4 out of 5 stars
4/5
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
Ebook
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
byAlexander Cooper
Rating: 1 out of 5 stars
1/5
Practical Data Analysis
Ebook
Practical Data Analysis
byHector Cuesta
Rating: 4 out of 5 stars
4/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Learn SQL Server Administration in a Month of Lunches
Ebook
Learn SQL Server Administration in a Month of Lunches
byDon Jones
Rating: 0 out of 5 stars
0 ratings
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
Ebook
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
byJeremy Li
Rating: 3 out of 5 stars
3/5
Learn SQL in 24 Hours
Ebook
Learn SQL in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
Blockchain Basics: A Non-Technical Introduction in 25 Steps
Ebook
Blockchain Basics: A Non-Technical Introduction in 25 Steps
byDaniel Drescher
Rating: 5 out of 5 stars
5/5
CompTIA DataSys+ Study Guide: Exam DS0-001
Ebook
CompTIA DataSys+ Study Guide: Exam DS0-001
byMike Chapple
Rating: 0 out of 5 stars
0 ratings
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
Ebook
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
byJohn Ladley
Rating: 4 out of 5 stars
4/5
Oracle DBA Mentor: Succeeding as an Oracle Database Administrator
Ebook
Oracle DBA Mentor: Succeeding as an Oracle Database Administrator
byBrian Peasland
Rating: 0 out of 5 stars
0 ratings
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
Ebook
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
byWilliam Sullivan
Rating: 5 out of 5 stars
5/5
Access 2010 All-in-One For Dummies
Ebook
Access 2010 All-in-One For Dummies
byAlison Barrows
Rating: 4 out of 5 stars
4/5
Access 2019 For Dummies
Ebook
Access 2019 For Dummies
byLaurie A. Ulrich
Rating: 0 out of 5 stars
0 ratings
Building a Scalable Data Warehouse with Data Vault 2.0
Ebook
Building a Scalable Data Warehouse with Data Vault 2.0
byDaniel Linstedt
Rating: 4 out of 5 stars
4/5
Behind Every Good Decision: How Anyone Can Use Business Analytics to Turn Data into Profitable Insight
Ebook
Behind Every Good Decision: How Anyone Can Use Business Analytics to Turn Data into Profitable Insight
byPiyanka Jain
Rating: 5 out of 5 stars
5/5
The Visual Imperative: Creating a Visual Culture of Data Discovery
Ebook
The Visual Imperative: Creating a Visual Culture of Data Discovery
byLindy Ryan
Rating: 4 out of 5 stars
4/5
Data Mining: Concepts and Techniques
Ebook
Data Mining: Concepts and Techniques
byJiawei Han
Rating: 4 out of 5 stars
4/5
Beginning Microsoft SQL Server 2012 Programming
Ebook
Beginning Microsoft SQL Server 2012 Programming
byPaul Atkinson
Rating: 1 out of 5 stars
1/5
Relational Database Design and Implementation
Ebook
Relational Database Design and Implementation
byJan L. Harrington
Rating: 5 out of 5 stars
5/5
Business Intelligence Guidebook: From Data Integration to Analytics
Ebook
Business Intelligence Guidebook: From Data Integration to Analytics
byRick Sherman
Rating: 4 out of 5 stars
4/5
The Data and Analytics Playbook: Proven Methods for Governed Data and Analytic Quality
Ebook
The Data and Analytics Playbook: Proven Methods for Governed Data and Analytic Quality
byLowell Fryman
Rating: 5 out of 5 stars
5/5
Data Modeling Essentials
Ebook
Data Modeling Essentials
byGraeme Simsion
Rating: 4 out of 5 stars
4/5
SQL Clearly Explained
Ebook
SQL Clearly Explained
byJan L. Harrington
Rating: 5 out of 5 stars
5/5
The SQL Workshop: Learn to create, manipulate and secure data and manage relational databases with SQL
Ebook
The SQL Workshop: Learn to create, manipulate and secure data and manage relational databases with SQL
byFrank Solomon
Rating: 0 out of 5 stars
0 ratings
Database Design: Know It All
Ebook
Database Design: Know It All
byToby J. Teorey
Rating: 5 out of 5 stars
5/5
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics
Ebook
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics
byDan Clark
Rating: 0 out of 5 stars
0 ratings
Serverless Architectures on AWS, Second Edition
Ebook
Serverless Architectures on AWS, Second Edition
byPeter Sbarski
Rating: 5 out of 5 stars
5/5
Python and SQLite Development
Ebook
Python and SQLite Development
byAgus Kurniawan
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

Whiteboard Confessional: Everything's a Database Except SQLite: Join me as I continue a new series called Whiteboard Confessional with a look at the awesomeness that is SQLite, including how it wasn’t designed to work in a client-server fashion, when you should use it and when you absolutely shouldn’t, how deciding to
Podcast episode
Whiteboard Confessional: Everything's a Database Except SQLite: Join me as I continue a new series called Whiteboard Confessional with a look at the awesomeness that is SQLite, including how it wasn’t designed to work in a client-server fashion, when you should use it and when you absolutely shouldn’t, how deciding to
byAWS Morning Brief
0 ratings
0% found this document useful
Networking in the Cloud Fundamentals: Connectivity Issues in EC2: Join me as continue my series on cloud fundamentals with a look at connectivity issues in EC2, including how to troubleshoot said issues, why DNS is often the cause of connectivity issues and how to get around that, security groups and why they’re fascina
Podcast episode
Networking in the Cloud Fundamentals: Connectivity Issues in EC2: Join me as continue my series on cloud fundamentals with a look at connectivity issues in EC2, including how to troubleshoot said issues, why DNS is often the cause of connectivity issues and how to get around that, security groups and why they’re fascina
byAWS Morning Brief
0 ratings
0% found this document useful
Why Don't We Have Better Practices?: This essay isn't quite Safe For Work (SFW) and might offend a few people, so be warned, but I found it somewhat humorous. It's a look at all the insanity and problems with programming and software in the world. It certainly makes me glad that...
Podcast episode
Why Don't We Have Better Practices?: This essay isn't quite Safe For Work (SFW) and might offend a few people, so be warned, but I found it somewhat humorous. It's a look at all the insanity and problems with programming and software in the world. It certainly makes me glad that...
byVoice of the DBA
0 ratings
0% found this document useful
#06 - Tech stack of Open Podcast: Which database is best?
Podcast episode
#06 - Tech stack of Open Podcast: Which database is best?
byTOPP - The Open Podcast Podcast
0 ratings
0% found this document useful
Should you go with an Optimistic or Pessimistic Concurrency Control Database?
Podcast episode
Should you go with an Optimistic or Pessimistic Concurrency Control Database?
byThe Backend Engineering Show with Hussein Nasser
0 ratings
0% found this document useful
kslDB Database for Streaming Events
Podcast episode
kslDB Database for Streaming Events
byThe Cloudcast
0 ratings
0% found this document useful
The Cloudcast #330 - Oracle’s Next-Generation Cloud IaaS: Brian talks with Clay Magouyrk (@cmagoyrk, Vice President at Oracle Cloud Infrastructure) about the architecture of the Oracle cloud, how it manages bare-metal VMs and containers, how Oracle integrates on-premises and cloud workloads, and how it integr...
Podcast episode
The Cloudcast #330 - Oracle’s Next-Generation Cloud IaaS: Brian talks with Clay Magouyrk (@cmagoyrk, Vice President at Oracle Cloud Infrastructure) about the architecture of the Oracle cloud, how it manages bare-metal VMs and containers, how Oracle integrates on-premises and cloud workloads, and how it integr...
byThe Cloudcast
0 ratings
0% found this document useful
Oracle Database Cloud Service: In this episode, hosts Lois Houston and Nikita Abraham are joined once again by Alex Bouchereau to discuss how you can use Oracle Database Cloud Service to deploy Oracle Databases in the cloud. They also talk through the fundamentals of Oracle Cloud...
Podcast episode
Oracle Database Cloud Service: In this episode, hosts Lois Houston and Nikita Abraham are joined once again by Alex Bouchereau to discuss how you can use Oracle Database Cloud Service to deploy Oracle Databases in the cloud. They also talk through the fundamentals of Oracle Cloud...
byOracle University Podcast
0 ratings
0% found this document useful
Episode 428: RR 420: Stimulus
Podcast episode
Episode 428: RR 420: Stimulus
byRuby Rogues
0 ratings
0% found this document useful
Oracle NoSQL Database Cloud Service: High availability, data model flexibility, elastic scalability… If these words have piqued your interest, then this is the episode for you! Join Lois Houston and Nikita Abraham, along with Autumn Black, as they discuss how Oracle NoSQL...
Podcast episode
Oracle NoSQL Database Cloud Service: High availability, data model flexibility, elastic scalability… If these words have piqued your interest, then this is the episode for you! Join Lois Houston and Nikita Abraham, along with Autumn Black, as they discuss how Oracle NoSQL...
byOracle University Podcast
0 ratings
0% found this document useful
Episode 305: JSJ 302: Evaluating Web Frameworks with Kitson Kelly
Podcast episode
Episode 305: JSJ 302: Evaluating Web Frameworks with Kitson Kelly
byJavaScript Jabber
0 ratings
0% found this document useful
MySQL Document Store: In this episode, Lois Houston and Nikita Abraham are joined by MySQL Developer Advocate Scott Stroz to talk about MySQL Document Store, a NoSQL solution built on top of MySQL. Oracle MyLearn: Oracle University Learning Community: ...
Podcast episode
MySQL Document Store: In this episode, Lois Houston and Nikita Abraham are joined by MySQL Developer Advocate Scott Stroz to talk about MySQL Document Store, a NoSQL solution built on top of MySQL. Oracle MyLearn: Oracle University Learning Community: ...
byOracle University Podcast
0 ratings
0% found this document useful
Keeping the Cloudwatch with Ewere Diagboya: This week Corey is joined by Ewere Diagboya, Head of Cloud at Mycloudseries, and multifaceted blogger and author, and the first AWS Hero from Africa. Ewere’s book on CloudWatch is the first of its kind, and certainly a valuable asset to the community. Ewe
Podcast episode
Keeping the Cloudwatch with Ewere Diagboya: This week Corey is joined by Ewere Diagboya, Head of Cloud at Mycloudseries, and multifaceted blogger and author, and the first AWS Hero from Africa. Ewere’s book on CloudWatch is the first of its kind, and certainly a valuable asset to the community. Ewe
byScreaming in the Cloud
0 ratings
0% found this document useful
Whiteboard Confessional: Configuration MisManagement: Join me as I continue a new series called Whiteboard Confessional by examining the dark underbelly of configuration management: configuration mismanagement. In this episode, I discuss what it was like to be a very early developer on the SaltStack project,
Podcast episode
Whiteboard Confessional: Configuration MisManagement: Join me as I continue a new series called Whiteboard Confessional by examining the dark underbelly of configuration management: configuration mismanagement. In this episode, I discuss what it was like to be a very early developer on the SaltStack project,
byAWS Morning Brief
0 ratings
0% found this document useful
416: Multi-Dimensional Numbers: Joël discusses the challenges he encountered while optimizing slow SQL queries in a non-Rails application. Stephanie shares her experience with canary deploys in a Rails upgrade. Together, Stephanie and Joël address a listener's question about replacing the wkhtml2pdf tool, which is no longer maintained. The episode's main topic revolves around the concept of multidimensional numbers and their applications in software development. Joël introduces the idea of treating objects containing multiple numbers as single entities, using the example of 2D points in space to illustrate how custom classes can define mathematical operations like addition and subtraction for complex data types. They explore how this approach can simplify operations on data structures, such as inventories of T-shirt sizes, by treating them as mathematical objects.
Podcast episode
416: Multi-Dimensional Numbers: Joël discusses the challenges he encountered while optimizing slow SQL queries in a non-Rails application. Stephanie shares her experience with canary deploys in a Rails upgrade. Together, Stephanie and Joël address a listener's question about replacing the wkhtml2pdf tool, which is no longer maintained. The episode's main topic revolves around the concept of multidimensional numbers and their applications in software development. Joël introduces the idea of treating objects containing multiple numbers as single entities, using the example of 2D points in space to illustrate how custom classes can define mathematical operations like addition and subtraction for complex data types. They explore how this approach can simplify operations on data structures, such as inventories of T-shirt sizes, by treating them as mathematical objects.
byThe Bike Shed
0 ratings
0% found this document useful
005 Top .NET Performance Problems: Microsoft is doing a good job in shielding the complexity of what is going on in the CLR from us. Until now Microsoft is taking care to optimize the Garbage Collector and tries to come up with good defaults when it comes to thread and connection pool...
Podcast episode
005 Top .NET Performance Problems: Microsoft is doing a good job in shielding the complexity of what is going on in the CLR from us. Until now Microsoft is taking care to optimize the Garbage Collector and tries to come up with good defaults when it comes to thread and connection pool...
byPurePerformance
0 ratings
0% found this document useful
jOOQ - Crossing the Object-Relational Bridge (with Lukas Eder)
Podcast episode
jOOQ - Crossing the Object-Relational Bridge (with Lukas Eder)
byDeveloper Voices
0 ratings
0% found this document useful
Easier Stream Processing On Kafka With ksqlDB - Episode 122: An interview about the ksqlDB platform and the unified experience that it provides for building stream processing applications on top of Kafka with SQL.
Podcast episode
Easier Stream Processing On Kafka With ksqlDB - Episode 122: An interview about the ksqlDB platform and the unified experience that it provides for building stream processing applications on top of Kafka with SQL.
byData Engineering Podcast
0 ratings
0% found this document useful
Whiteboard Confessional: Scaling Databases in a Single Bound: Join me as I continue a new series called Whiteboard Confessional by examining an all-too-common problem: having to scale a database when it’s too late. In this episode, I touch upon the underlying reason many developers don’t think about their database u
Podcast episode
Whiteboard Confessional: Scaling Databases in a Single Bound: Join me as I continue a new series called Whiteboard Confessional by examining an all-too-common problem: having to scale a database when it’s too late. In this episode, I touch upon the underlying reason many developers don’t think about their database u
byAWS Morning Brief
0 ratings
0% found this document useful
Fast Analytics On Semi-Structured And Structured Data In The Cloud - Episode 101: An interview about the architecture of Rockset and how they built a serverless platform for fast and flexible analytics on your semi-structured data
Podcast episode
Fast Analytics On Semi-Structured And Structured Data In The Cloud - Episode 101: An interview about the architecture of Rockset and how they built a serverless platform for fast and flexible analytics on your semi-structured data
byData Engineering Podcast
0 ratings
0% found this document useful
Episode 87: Software Components: In this episode, Michael and Markus talk about software components. We first looked at a couple of attempts at defining what a component is. We then provided our own definition that will be used in the rest of the episode.
Podcast episode
Episode 87: Software Components: In this episode, Michael and Markus talk about software components. We first looked at a couple of attempts at defining what a component is. We then provided our own definition that will be used in the rest of the episode.
bySoftware Engineering Radio - the podcast for professional software developers
0 ratings
0% found this document useful
Leveraging SQLite in Web Development - RUBY 630: Stephen Margheim is the Head of Engineering at Test IO. They explore the world of web development with a focus on the use of SQLite, a powerful and often underestimated database tool. They dive deep into the capabilities and potential of SQLite for...
Podcast episode
Leveraging SQLite in Web Development - RUBY 630: Stephen Margheim is the Head of Engineering at Test IO. They explore the world of web development with a focus on the use of SQLite, a powerful and often underestimated database tool. They dive deep into the capabilities and potential of SQLite for...
byRuby Rogues
0 ratings
0% found this document useful
The Cloudcast #342 - Understanding Databases in AWS
Podcast episode
The Cloudcast #342 - Understanding Databases in AWS
byThe Cloudcast
0 ratings
0% found this document useful
Episode 104: Plugin Architectures: In this episode we talk with Klaus Marquardt about building systems out of plugins. After briefly introducing the concept of a plugin in contrast to modules and related software engineering concepts, we discuss different views on plugins and different ...
Podcast episode
Episode 104: Plugin Architectures: In this episode we talk with Klaus Marquardt about building systems out of plugins. After briefly introducing the concept of a plugin in contrast to modules and related software engineering concepts, we discuss different views on plugins and different ...
bySoftware Engineering Radio - the podcast for professional software developers
0 ratings
0% found this document useful
Troubleshooting Kafka In Production: Kafka has become a ubiquitous technology, offering a simple method for coordinating events and data across different systems. Operating it at scale, however, is notoriously challenging. Elad Eldor has experienced these challenges first-hand, leading to his work writing the book "Kafka: Troubleshooting in Production". In this episode he highlights the sources of complexity that contribute to Kafka's operational difficulties, and some of the main ways to identify and mitigate potential sources of trouble.
Podcast episode
Troubleshooting Kafka In Production: Kafka has become a ubiquitous technology, offering a simple method for coordinating events and data across different systems. Operating it at scale, however, is notoriously challenging. Elad Eldor has experienced these challenges first-hand, leading to his work writing the book "Kafka: Troubleshooting in Production". In this episode he highlights the sources of complexity that contribute to Kafka's operational difficulties, and some of the main ways to identify and mitigate potential sources of trouble.
byData Engineering Podcast
0 ratings
0% found this document useful
Oracle Machine Learning: There is so much data available today. But it only makes a difference when you transform that data into actionable intelligence. In this episode, hosts Lois Houston and Nikita Abraham, along with Nick Commisso, discuss how you can harness the...
Podcast episode
Oracle Machine Learning: There is so much data available today. But it only makes a difference when you transform that data into actionable intelligence. In this episode, hosts Lois Houston and Nikita Abraham, along with Nick Commisso, discuss how you can harness the...
byOracle University Podcast
0 ratings
0% found this document useful
Nick Dodson: Fuel Labs – Modular Execution Blockchains: We were joined by Nick Dodson, CEO and co-founder of Fuel Labs, to discuss modular execution layers and how they approach different aspects of blockchain design and scalability.
Podcast episode
Nick Dodson: Fuel Labs – Modular Execution Blockchains: We were joined by Nick Dodson, CEO and co-founder of Fuel Labs, to discuss modular execution layers and how they approach different aspects of blockchain design and scalability.
byEpicenter - Learn about Crypto, Blockchain, Ethereum, Bitcoin and Distributed Technologies
0 ratings
0% found this document useful
The Development Backup: Have you ever had a development server crash? Have you lost work because of this? Had delays or had to recreate code? You shouldn't, or at least you shouldn't lose much work or time.. There was a time when I offered to manage backups on all...
Podcast episode
The Development Backup: Have you ever had a development server crash? Have you lost work because of this? Had delays or had to recreate code? You shouldn't, or at least you shouldn't lose much work or time.. There was a time when I offered to manage backups on all...
byVoice of the DBA
0 ratings
0% found this document useful
No Handwaving Away the DBA: There's a great quote I read, at the end of this article. It says: "...if you think that switching to NoSQL will just let you hand-wave away all of the challenges of running a database, you are terribly misguided." The context is that all too...
Podcast episode
No Handwaving Away the DBA: There's a great quote I read, at the end of this article. It says: "...if you think that switching to NoSQL will just let you hand-wave away all of the challenges of running a database, you are terribly misguided." The context is that all too...
byVoice of the DBA
0 ratings
0% found this document useful
Episode 12. Giving the Model a Controlling View (The Model-View-Controller pattern): In this podcast we talk about the Model-View-Controller pattern (and is prettier cousin, the Model-View-Presenter), and go over how to go about implementing the MVC/MVP Pattern (really, what to instantiate first, and how to wire the whole thing). We...
Podcast episode
Episode 12. Giving the Model a Controlling View (The Model-View-Controller pattern): In this podcast we talk about the Model-View-Controller pattern (and is prettier cousin, the Model-View-Presenter), and go over how to go about implementing the MVC/MVP Pattern (really, what to instantiate first, and how to wire the whole thing). We...
byJava Pub House
0 ratings
0% found this document useful

Skip carousel

Disrupting Databases
Linux Format
Article
Disrupting Databases
Nov 17, 2020
Part of our last hacking feature in LXF258 involved setting up the Metasploitable virtual machine and attacking it. We’d encourage you to do that too, (you’ll find all the information you need at https://github.com/rapid7/metasploitable3). But this t
3 min read
Disrupting Databases
APC
Article
Disrupting Databases
Apr 19, 2021
3 min read
Kernel Watch
Linux Format
Article
Kernel Watch
Oct 20, 2020
2 min read
Disrupting Databases
Maximum PC
Article
Disrupting Databases
Mar 2, 2021
3 min read
Observability Of The Kernel And Containers
Linux Format
Article
Observability Of The Kernel And Containers
Apr 4, 2023
Mihalis Tsoukalos is currently working on Time Series. You can reach him at: @mactsouk. For our final delve into eBPF, we’re tackling applications, the kernel and Docker containers. At the end of the day, all Linux machines execute code for applicat
10 min read
Join the Pod, Man!
Linux Format
Article
Join the Pod, Man!
May 30, 2023
8 min read
Master Linux VM creation in Azure
Linux Format
Article
Master Linux VM creation in Azure
May 30, 2023
12 min read
It’s Great When You’re K8s
Linux Format
Article
It’s Great When You’re K8s
Oct 18, 2022
8 min read
Mailserver
Linux Format
Article
Mailserver
May 3, 2022
2 min read
Mailserver
Linux Format
Article
Mailserver
May 3, 2022
2 min read
MARIADB Optimise And Control Your Databases
Linux Format
Article
MARIADB Optimise And Control Your Databases
Jul 30, 2019
9 min read
Nextcloud
Maximum PC
Article
Nextcloud
Jan 5, 2021
4 min read
KeePassXC: The Friendlier Free Offline Password Manager
PCWorld
Article
KeePassXC: The Friendlier Free Offline Password Manager
Sep 5, 2023
7 min read
Introducing Kali
APC
Article
Introducing Kali
Apr 19, 2021
4 min read
Apple Launches Open Source Password Program
MacFormat
Article
Apple Launches Open Source Password Program
Jun 30, 2020
1 min read
Kernel Watch
Linux Format
Article
Kernel Watch
May 5, 2020
Linus Torvalds announced Linux 5.7-rc1 (release candidate 1) and with it the closure of the “merge window” (period of time during which disruptive changes are allowed) for what will hopefully be Linux 5.7 in time for the summer. Many new features wer
2 min read
The Future Of Storage
APC
Article
The Future Of Storage
Oct 3, 2022
15 min read
Pull, Configure And Run
Linux Format
Article
Pull, Configure And Run
Apr 7, 2020
Guacamole offers ready-to-run installation packages that are available for Linux distros such as CentOS or Debian. However, the thrust of this article is to illustrate running Guacamole in a Docker container context. Fire up an environment where you
8 min read
The Future Of Storage
Maximum PC
Article
The Future Of Storage
Sep 13, 2022
11 min read
Revealing The Shell Behind The Shell
Linux Format
Article
Revealing The Shell Behind The Shell
Mar 5, 2024
Ferenc Deák sees no way back from the C++ mayhem he brought upon readers with this quick and dirty shell, so you just have to accept it. C++. The code for the shell can still be found at https://github. com/fritzone/lxf-shell. In the first three part
11 min read
Introducing Kali
Maximum PC
Article
Introducing Kali
Mar 2, 2021
4 min read
Beta Yourself Password Managers
Stuff Magazine South Africa
Article
Beta Yourself Password Managers
Apr 29, 2019
2 min read
Introducing Kali
Linux Format
Article
Introducing Kali
Nov 17, 2020
4 min read
Tips For Managing Docker Containers
Linux Format
Article
Tips For Managing Docker Containers
Apr 2, 2024
4 min read
Ongoing Development
Linux Format
Article
Ongoing Development
Jun 28, 2022
A potential security hole was discovered in which someone with local console access (including a remote console available over the network) “could trigger the debugger”. The kernel debugger can be used to read/write to arbitrary memory, which isn’t u
1 min read
Ongoing Development
Linux Format
Article
Ongoing Development
Jun 28, 2022
A potential security hole was discovered in which someone with local console access (including a remote console available over the network) “could trigger the debugger”. The kernel debugger can be used to read/write to arbitrary memory, which isn’t u
1 min read
Node.js Logging
Linux Format
Article
Node.js Logging
Mar 10, 2020
When developing a Node.js application you need to set it for logging and debugging. This happens in your code, and you direct it to standard out. It is always better to direct all logging to standard out and let external tools route and filter where
1 min read
Kernel Watch
Linux Format
Article
Kernel Watch
Jul 25, 2023
Linus Torvalds announced both the release of Linux 6.4, and the first release candidate for what will become Linux 6.5 in another couple of months. Linux 6.4 had few “big ticket” user visible features (although it did include initial Apple Silicon M2
2 min read
Mailserver
Linux Format
Article
Mailserver
Dec 15, 2020
3 min read
Browser Wars 2020
Maximum PC
Article
Browser Wars 2020
May 26, 2020
8 min read

Related categories

Skip carousel

Reviews for Expert SQL Server Transactions and Locking

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Expert SQL Server Transactions and Locking - Dmitri Korotkevitch

Dmitri KorotkevitchExpert SQL Server Transactions and Lockinghttps://doi.org/10.1007/978-1-4842-3957-5_1

1. Data Storage and Access Methods

Dmitri Korotkevitch¹

(1)

Land O Lakes, Florida, USA

It is impossible to grasp the SQL Server concurrency model without understanding how SQL Server stores and accesses the data. This knowledge helps you to comprehend various aspects of locking behavior in the system, and it is also essential when troubleshooting concurrency issues.

Nowadays, SQL Server and Microsoft Azure SQL Databases support three different technologies that dictate how data is stored and manipulated in the system. The classic Storage Engine implements row-based storage. This technology persists the data in disk-based tables, combining all columns from a table together into data rows. The data rows, in turn, reside on 8 KB data pages, each of which may have one or multiple rows.

Starting with SQL Server 2012, you can store data in a columnar format using columnstore indexes. SQL Server splits the data into row groups of up to 1,048,576 rows each. The data in the row group is combined and stored on a per-column rather than a per-row basis. This format is optimized for reporting and analytics queries.

Finally, the In-Memory OLTP Engine, introduced in SQL Server 2014, allows you to define memory-optimized tables, which keep all data entirely in memory. The data rows in memory are linked to the data row chains through the memory pointers. This technology is optimized for heavy OLTP workload.

We will discuss locking behavior in In-Memory OLTP and columnstore indexes later in the book, after we cover the concurrency model of the classic Storage Engine. This knowledge is a cornerstone of understanding how SQL Server behaves in a multi-user environment.

The goal of this chapter is to give a high-level overview of row-based storage in SQL Server. It will explain how SQL Server stores the data in disk-based tables, illustrate the structure of B-Tree indexes, and demonstrate how SQL Server accesses data from them.

You should not consider this chapter as a deep dive into the SQL Server Storage Engine. It should provide, however, enough information to discuss the concurrency model in SQL Server.

Anatomy of a Table

The internal structure of a disk-based table is rather complex and consists of multiple elements and internal objects, as shown in Figure 1-1.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig1_HTML.jpg

Figure 1-1

Internal structure of a table

The data in the tables is stored either completely unsorted (those tables are called heap tables or heaps) or sorted according to the value of a clustered index key when a table has such an index defined.

In addition to a single clustered index, every table may have a set of nonclustered indexes. These indexes are separate data structures that store a copy of the data from a table sorted according to index key column(s). For example, if a column was included in three nonclustered indexes, SQL Server would store that data four times—once in a clustered index or heap and in each of the three nonclustered indexes.

You can create either 250 or 999 nonclustered indexes per table, depending on SQL Server version. However, it is clearly not a good idea to create a lot of them due to the overhead they introduce. In addition to storage overhead, SQL Server needs to insert or delete data from each nonclustered index during data modifications. Moreover, the update operation requires SQL Server to modify data in every index in which updated columns were present.

Internally, each index (and heap) consists of one or multiple partitions. Every partition, in a nutshell, is an internal data structure (index or heap) independent from other partitions in the object. SQL Server allows the use of a different partition strategy for every index in the table; however, in most cases, all indexes are partitioned in the same way and aligned with each other.

Note

Every table/index in SQL Server is partitioned. Non-partitioned tables are treated as single-partition tables/indexes internally.

As I already mentioned, the actual data is stored in data rows on 8 KB data pages with 8,060 bytes available to users. The pages that store users’ data may belong to three different categories called allocation units based on the type of data they store.

IN_ROW_DATA allocation unit pages store the main data row objects, which consist of internal attributes and the data from fixed-length columns, such as int, datetime, float, and others. The in-row part of a data row must fit on a single data page and, therefore, cannot exceed 8,060 bytes. The data from variable-length columns, such as (n)varchar(max), (n)varbinary(max), xml, and others, may also be stored in-row in the main row object when it fits into this limit.

In cases when variable-length data does not fit in-row, SQL Server stores it off-row on different data pages, referencing them through in-row pointers. Variable-length data that exceeds 8,000 bytes is stored on LOB_DATA allocation unit data pages (LOB stands for large objects). Otherwise, the data is stored in ROW_OVERFLOW_DATA allocation unit pages.

Let’s look at an example and create a table that contains several fixed- and variable-length columns and insert one row there, as shown in Listing 1-1.

create table dbo.DataRows

(

ID int not null,

ADate datetime not null,

VarCol1 varchar(max),

VarCol2 varchar(5000),

VarCol3 varchar(5000)

);

insert into dbo.DataRows(ID, ADate, VarCol1, VarCol2, VarCol3)

values

(

,'1974-08-22'

,replicate(convert(varchar(max),'A'),32000)

,replicate(convert(varchar(max),'B'),5000)

,replicate(convert(varchar(max),'C'),5000)

);

Listing 1-1

Data row storage: Creating the test table

The data from fixed-length columns (ID, ADate) will be stored in-row on an IN_ROW_DATA allocation unit page. The data from VarCol1 column is 32,000 bytes and will be stored on LOB_DATA data pages.

The VarCol2 and VarCol3 columns have 5,000 bytes of data each. SQL Server would keep one of them in-row (it would fit into the 8,060-byte limit) and place the other one on the single ROW_OVERFLOW_DATA page.

Note

Off-row column pointers use 16 or 24 bytes in-row, which counts toward the 8,060 maximum row size. In practice, this may limit the number of columns you can have in a table.

Figure 1-2 illustrates this state.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig2_HTML.jpg

Figure 1-2

Data row storage: Data pages after the first INSERT

The sys.dm_db_index_physical_stats data management function is usually used to analyze index fragmentation. It also displays the information about data pages on a per–allocation unit basis.

Listing 1-2 shows the query that returns the information about the dbo.DataRows table.

select

index_id, partition_number, alloc_unit_type_desc

,page_count, record_count, min_record_size_in_bytes

,max_record_size_in_bytes, avg_record_size_in_bytes

from

sys.dm_db_index_physical_stats

(

db_id()

,object_id(N'dbo.DataRows')

,0 /* IndexId = 0 -> Table Heap */

,NULL /* All Partitions */

,'DETAILED'

);

Listing 1-2

Data row storage: Analyzing the table using sys.dm_db_index_physical_stats DMO

Figure 1-3 illustrates the output of the code. As expected, the table has one IN_ROW_DATA, one ROW_OVERFLOW_DATA, and four LOB_DATA pages. The IN_ROW data page has about 2,900 free bytes available.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig3_HTML.jpg

Figure 1-3

Data row storage: sys.dm_db_index_physical_stats output after the first INSERT

Let’s insert another row using the code from Listing 1-3.

insert into dbo.DataRows(ID, ADate, VarCol1, VarCol2, VarCol3)

values(2,'2006-09-29','DDDDD','EEEEE','FFFFF');

Listing 1-3

Data row storage: Inserting the second row

All three variable-length columns store five-character strings, and, therefore, the row would fit on the already-allocated IN_ROW_DATA page. Figure 1-4 illustrates data pages at this phase.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig4_HTML.jpg

Figure 1-4

Data row storage: Data pages after the second INSERT

You can confirm it by running the code from Listing 1-2 again. Figure 1-5 illustrates the output from the view.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig5_HTML.jpg

Figure 1-5

Data row storage: sys.dm_db_index_physical_stats output after the second INSERT

SQL Server logically groups eight pages into 64KB units called extents. There are two types of extents available: mixed extents store data that belongs to different objects, while uniform extents store the data for the same object.

By default, when a new object is created, SQL Server stores the first eight object pages in mixed extents. After that, all subsequent space allocation for that object is done with uniform extents.

Tip

Disabling mixed extents allocation may help to improve tempdb throughput in the system. In SQL Server prior to 2016, you can achieve that by enabling server-level trace flag T1118. This trace flag is not required in SQL Server 2016 and above, where tempdb does not use mixed extents anymore.

SQL Server uses a special kind of pages, called allocation maps, to track extent and page usage in database files. Index Allocation Maps (IAM) pages track extents that belong to an allocation unit on a per-partition basis. Those pages are, in a nutshell, bitmaps, where each bit indicates if the extent belongs to a specific allocation unit from the object partition.

Each IAM page covers about 64,000 extents, or almost 4 GB of data in a data file. For larger files, multiple IAM pages are linked together into IAM chains.

Note

There are many other types of allocation maps used for database management. You can read about them at https://docs.microsoft.com/en-us/sql/relational-databases/pages-and-extents-architecture-guide or in my Pro SQL Server Internals book.

Heap Tables

Heap tables are tables without a clustered index. The data in heap tables is unsorted. SQL Server does not guarantee, nor does it maintain, a sorting order of the data in heap tables.

When you insert data into heap tables, SQL Server tries to fill pages as much as possible, although it does not analyze the actual free space available on a page. It uses another type of allocation map page called Page Free Space (PFS) , which tracks the amount of free space available on the page. This tracking is imprecise, however. SQL Server uses three bits, which indicate if the page is empty, or if it is 1 to 50, 51 to 80, 81 to 95 or above 95 percent full. It is entirely possible that SQL Server would not store a new row on the page even when it has available space.

When you select data from the heap table, SQL Server uses IAM pages to find the pages and extents that belong to the table, processing them based on their order on the IAM pages rather than on the order in which the data was inserted. Figure 1-6 illustrates this point. This operation is shown as Table Scan in the execution plan.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig6_HTML.jpg

Figure 1-6

Selecting data from the heap table

When you update the row in the heap table, SQL Server tries to accommodate it on the same page. If there is no free space available, SQL Server moves the new version of the row to another page and replaces the old row with a special 16-byte row called a forwarding pointer. The new version of the row is called a forwarded row. Figure 1-7 illustrates this point.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig7_HTML.jpg

Figure 1-7

Forwarding pointers

There are two main reasons why forwarding pointers are used. First, they prevent updates of nonclustered index keys, which reference the row. We will talk about nonclustered indexes in more detail later in this chapter.

In addition, forwarding pointers help minimize the number of duplicated reads; that is, the situation when a single row is read multiple times during the table scan. Let’s look at Figure 1-7 as an example and assume that SQL Server scans the pages in left-to-right order. Let’s further assume that the row in page 3 was modified at the time when SQL Server reads page 4 (after page 3 has already been read). The new version of the row would be moved to page 5, which has yet to be processed. Without forwarding pointers, SQL Server would not know that the old version of the row had already been read, and it would read it again during the page 5 scan. With forwarding pointers, SQL Server skips the forwarded rows—they have a flag in their internal attributes indicating that condition.

Although forwarding pointers help minimize duplicated reads, they introduce additional read operations at the same time. SQL Server follows the forwarding pointers and reads the new versions of the rows at the time it encounters them. That behavior can introduce an excessive number of I/O operations when heap tables are frequently updated and have a large number of forwarded rows.

Note

You can analyze the number of forwarded rows in the table by checking the forwarded_record_count column in the sys.dm_db_index_physical_stats view.

When the size of the forwarded row is reduced by another update, and the data page with the forwarding pointer has enough space to accommodate the updated version of the row, SQL Server may move it back to its original data page and remove the forwarding pointer row. Nevertheless, the only reliable way to get rid of all forwarding pointers is by rebuilding the heap table. You can do that by using an ALTER TABLE REBUILD statement.

Heap tables can be useful in staging environments where you want to import a large amount of data into the system as quickly as possible. Inserting data into heap tables can often be faster than inserting it into tables with clustered indexes. Nevertheless, during a regular workload, tables with clustered indexes usually outperform heap tables as a result of heap tables’ suboptimal space control and extra I/O operations introduced by forwarding pointers.

Note

You can find the scripts that demonstrate forwarding pointers’ overhead and suboptimal space control in heap tables in this book’s companion materials.

Clustered Indexes and B-Trees

A clustered index dictates the physical order of the data in a table, which is sorted according to the clustered index key. The table can have only one clustered index defined.

Let’s assume that you want to create a clustered index on the heap table with the data. As a first step, which is shown in Figure 1-8, SQL Server creates another copy of the data and sorts it based on the value of the clustered key. The data pages are linked in a double-linked list, where every page contains pointers to the next and previous pages in the chain. This list is called the leaf level of the index , and it contains the actual table data.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig8_HTML.jpg

Figure 1-8

Clustered index structure: Leaf level

Note

The pages reference each other through page addresses, which consist of two values: file_id in the database and sequential number of the page in the file.

When the leaf level consists of multiple pages, SQL Server starts to build an intermediate level of the index, as shown in Figure 1-9.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig9_HTML.jpg

Figure 1-9

Clustered index structure: Intermediate levels

The intermediate level stores one row per each leaf-level page. It stores two pieces of information: the physical address and the minimum value of the index key from the page it references. The only exception is the very first row on the first page, where SQL Server stores NULL rather than the minimum index key value. With such optimization, SQL Server does not need to update non-leaf level rows when you insert the row with the lowest key value in the table.

The pages on the intermediate level are also linked in a double-linked list. SQL Server adds more and more intermediate levels until there is a level that includes just a single page. This level is called the root level, and it becomes the entry point to the index, as shown in Figure 1-10.

Note

This index structure is called a B-Tree Index , which stands for Balanced Tree.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig10_HTML.jpg

Figure 1-10

Clustered index structure: Root level

As you can see, the index always has one leaf level, one root level, and zero or more intermediate levels. The only exception is when the index data fits into a single page. In that case, SQL Server does not create the separate root-level page, and the index consists of just the single leaf-level page.

SQL Server always maintains the order of the data in the index, inserting new rows on the data pages to which they belong. In cases when a data page does not have enough free space, SQL Server allocates a new page and places the row there, adjusting pointers in the double-linked page list to maintain a logical sorting order in the index. This operation is called page split and leads to index fragmentation.

Figure 1-11 illustrates this condition. When Original Page does not have enough space to accommodate the new row, SQL Server performs a page split, moving about half of the data from Original Page to New Page, adjusting page pointers afterward.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig11_HTML.jpg

Figure 1-11

Leaf-level data pages after page split

A page split may also occur during data modifications. SQL Server does not use forwarding pointers with B-Tree indexes. Instead, when an update cannot be done in-place—for example, during data row increase—SQL Server performs a page split and moves updated and subsequent rows from the page to another page. Nevertheless, the index sorting order is maintained through the page pointers.

SQL Server may read the data from the index in three different ways. The first is an allocation order scan . SQL Server accesses the table data through IAM pages similar to how it does this with heap tables. This method, however, could introduce data consistency phenomena—with page splits, rows may be skipped or read more than once—and, therefore, allocation order scan is rarely used. We will discuss conditions that may lead to allocation order scans later in the book.

The second method is called an ordered scan . Let’s assume that we want to run the SELECT Name FROM dbo.Customers query. All data rows reside on the leaf level of the index, and SQL Server can scan it and return the rows to the client.

SQL Server starts with the root page of the index and reads the first row from there. That row references the intermediate page with the minimum key value from the table. SQL Server reads that page and repeats the process until it finds the first page on the leaf level. Then, SQL Server starts to read rows one by one, moving through the linked list of the pages until all rows have been read. Figure 1-12 illustrates this process.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig12_HTML.jpg

Figure 1-12

Ordered index scan

Both allocation order scan and ordered scan are represented as Index Scan operators in the execution plans.

Note

The server can navigate through indexes in both directions, forward and backward. However, SQL Server does not use parallelism during backward index scans.

The last index access method is called index seek . Let’s assume we want to run the following query: SELECT Name FROM dbo.Customers WHERE CustomerId BETWEEN 4 AND 7. Figure 1-13 illustrates how SQL Server may process it.

../images/463519_1_En_1_Chapter/463519_1_En_1_Fig13_HTML.jpg

Figure 1-13

Index seek

In order to read the range of rows from the table, SQL Server needs to find the row with the minimum value of the key from the range, which is 4. SQL Server starts with the root page, where the second row references the page with the minimum key value of 350. It is greater than the key value that we are looking for, and SQL Server reads the intermediate-level data page (1:170) referenced by the first row on the root page.

Similarly, the intermediate page leads SQL Server to the first leaf-level page (1:176). SQL Server reads that page, then it reads the rows with CustomerId equal to 4 and 5, and, finally, it reads the two remaining rows from the second page.

Technically speaking, there are two kinds of index seek operations. The first is called a point-lookup (or, sometimes, singleton lookup), where SQL Server seeks and returns a single row. You can think about the WHERE CustomerId = 2 predicate as an example.

The other type is called a range scan , and it requires SQL Server to find the lowest or highest value of the key and scan (either forward or backward) the set of rows until it reaches the end of scan range. The predicate WHERE CustomerId BETWEEN 4 AND 7 leads to the range scan. Both cases are shown as Index Seek operators in the execution plans.

As you can guess, index seek is more efficient than index scan because SQL Server processes just the subset of rows and data pages rather

Enjoying the preview?

Page 1 of 1

Expert SQL Server Transactions and Locking: Concurrency Internals for SQL Server Practitioners

About this ebook

Dmitri Korotkevitch

Related authors

Related to Expert SQL Server Transactions and Locking

Related ebooks

Databases For You

Related podcast episodes

Related articles

Related categories

Reviews for Expert SQL Server Transactions and Locking

What did you think?

Book preview

Expert SQL Server Transactions and Locking - Dmitri Korotkevitch

1. Data Storage and Access Methods

Anatomy of a Table

Note

Note

Tip

Note

Heap Tables

Note

Note

Clustered Indexes and B-Trees

Note

Note

Note