Ebook886 pages4 hours

Solr Cookbook - Third Edition

Name: Solr Cookbook - Third Edition
Author: Rafał Kuć
ISBN: 9781783553167

By Rafał Kuć

Rating: 0 out of 5 stars

()

Read preview

About this ebook

About This Book

Solve performance, setup, configuration, analysis, and querying problems in no time
Learn to efficiently utilize faceting and grouping
Explore real-life examples of Apache Solr and how to deal with any issues that might arise using this practical guide

Who This Book Is For

This book is for intermediate Solr Developers who are willing to learn and implement Pro-level practices, techniques, and solutions. This edition will specifically appeal to developers who wish to quickly get to grips with the changes and new features of Apache Solr 5.

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateJan 23, 2015

ISBN9781783553167

Author

Rafał Kuć

Related to Solr Cookbook - Third Edition

Related ebooks

Skip carousel

Neo4j Cookbook
Ebook
Neo4j Cookbook
byAnkur Goel
Rating: 0 out of 5 stars
0 ratings
Apache Hive Cookbook
Ebook
Apache Hive Cookbook
byShrey Mehrotra
Rating: 0 out of 5 stars
0 ratings
PostgreSQL 9 High Availability Cookbook
Ebook
PostgreSQL 9 High Availability Cookbook
byShaun M. Thomas
Rating: 5 out of 5 stars
5/5
Hadoop 2.x Administration Cookbook
Ebook
Hadoop 2.x Administration Cookbook
byGurmukh Singh
Rating: 0 out of 5 stars
0 ratings
Elixir Cookbook
Ebook
Elixir Cookbook
byPaulo A Pereira
Rating: 0 out of 5 stars
0 ratings
PostgreSQL High Performance Cookbook
Ebook
PostgreSQL High Performance Cookbook
byChitij Chauhan
Rating: 0 out of 5 stars
0 ratings
Hadoop Real-World Solutions Cookbook - Second Edition
Ebook
Hadoop Real-World Solutions Cookbook - Second Edition
byDeshpande Tanmay
Rating: 0 out of 5 stars
0 ratings
D Cookbook
Ebook
D Cookbook
byAdam D. Ruppe
Rating: 0 out of 5 stars
0 ratings
Apache Camel Developer's Cookbook
Ebook
Apache Camel Developer's Cookbook
byScott Cranton
Rating: 0 out of 5 stars
0 ratings
Git Version Control Cookbook
Ebook
Git Version Control Cookbook
byAske Olsson
Rating: 4 out of 5 stars
4/5
PostgreSQL 9 Administration Cookbook - Second Edition
Ebook
PostgreSQL 9 Administration Cookbook - Second Edition
bySimon Riggs
Rating: 0 out of 5 stars
0 ratings
Apache Solr for Indexing Data
Ebook
Apache Solr for Indexing Data
byHandiekar Sachin
Rating: 0 out of 5 stars
0 ratings
Apache Solr Search Patterns
Ebook
Apache Solr Search Patterns
byJayant Kumar
Rating: 0 out of 5 stars
0 ratings
Building Python Real-Time Applications with Storm
Ebook
Building Python Real-Time Applications with Storm
byBhatnagar Kartik
Rating: 0 out of 5 stars
0 ratings
Administrating Solr
Ebook
Administrating Solr
bySurendra Mohan
Rating: 0 out of 5 stars
0 ratings
Learning HBase
Ebook
Learning HBase
byShashwat Shriparv
Rating: 0 out of 5 stars
0 ratings
Apache Cassandra Essentials
Ebook
Apache Cassandra Essentials
byPadalia Nitin
Rating: 4 out of 5 stars
4/5
Hadoop in Practice
Ebook
Hadoop in Practice
byAlex Holmes
Rating: 0 out of 5 stars
0 ratings
Practical OneOps
Ebook
Practical OneOps
byNilesh Nimkar
Rating: 0 out of 5 stars
0 ratings
Monitoring Elasticsearch
Ebook
Monitoring Elasticsearch
byDan Noble
Rating: 0 out of 5 stars
0 ratings
Learning Apache Mahout Classification
Ebook
Learning Apache Mahout Classification
byGupta Ashish
Rating: 0 out of 5 stars
0 ratings
The Illustrated AWS Cloud: A Guide to Help You on Your Cloud Practitioner Journey
Ebook
The Illustrated AWS Cloud: A Guide to Help You on Your Cloud Practitioner Journey
byJen Looper
Rating: 0 out of 5 stars
0 ratings
Apache ZooKeeper Essentials
Ebook
Apache ZooKeeper Essentials
bySaurav Haloi
Rating: 5 out of 5 stars
5/5
Elasticsearch 8 for Developers - 2nd Edition: A beginner's guide to indexing, analyzing, searching, and aggregating data (English Edition)
Ebook
Elasticsearch 8 for Developers - 2nd Edition: A beginner's guide to indexing, analyzing, searching, and aggregating data (English Edition)
byAnurag Srivastava
Rating: 0 out of 5 stars
0 ratings
Securing Hadoop
Ebook
Securing Hadoop
bySudheesh Narayanan
Rating: 4 out of 5 stars
4/5
Mastering Apache Cassandra - Second Edition
Ebook
Mastering Apache Cassandra - Second Edition
byNishant Neeraj
Rating: 0 out of 5 stars
0 ratings
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
Ebook
Exploring Hadoop Ecosystem (Volume 1): Batch Processing
byWei Liu
Rating: 0 out of 5 stars
0 ratings
Schematron: A language for validating XML
Ebook
Schematron: A language for validating XML
byErik Siegel
Rating: 0 out of 5 stars
0 ratings
PostgreSQL Development Essentials
Ebook
PostgreSQL Development Essentials
byManpreet Kaur
Rating: 5 out of 5 stars
5/5
Mastering PostgreSQL 9.6
Ebook
Mastering PostgreSQL 9.6
byHans-Jürgen Schönig
Rating: 0 out of 5 stars
0 ratings

Computers For You

Skip carousel

Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
Ebook
Excel Essentials: A Step-by-Step Guide with Pictures for Absolute Beginners to Master the Basics and Start Using Excel with Confidence
byNigel Tillery
Rating: 0 out of 5 stars
0 ratings
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
Ebook
How to Create Cpn Numbers the Right way: A Step by Step Guide to Creating cpn Numbers Legally
byAlex Parkinson
Rating: 4 out of 5 stars
4/5
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
Deep Search: How to Explore the Internet More Effectively
Ebook
Deep Search: How to Explore the Internet More Effectively
byAlan Pearce
Rating: 5 out of 5 stars
5/5
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
Ebook
Machine Learning for Beginners: An Introduction for Beginners, Why Machine Learning Matters Today and How Machine Learning Networks, Algorithms, Concepts and Neural Networks Really Work
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
Ebook
CompTIA IT Fundamentals (ITF+) Study Guide: Exam FC0-U61
byQuentin Docter
Rating: 0 out of 5 stars
0 ratings
CompTIA Security+ Practice Questions
Ebook
CompTIA Security+ Practice Questions
byIP Specialist
Rating: 2 out of 5 stars
2/5
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
Ebook
The ChatGPT Millionaire Handbook: Make Money Online With the Power of AI Technology
byTJ Books
Rating: 0 out of 5 stars
0 ratings
Network+ Study Guide & Practice Exams
Ebook
Network+ Study Guide & Practice Exams
byRobert Shimonski
Rating: 4 out of 5 stars
4/5
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
Ebook
The Simulation Hypothesis: An MIT Computer Scientist Shows Why AI, Quantum Physics and Eastern Mystics All Agree We Are In a Video Game
byRizwan Virk
Rating: 5 out of 5 stars
5/5
Ultimate Guide to Mastering Command Blocks!: Minecraft Keys to Unlocking Secret Commands
Ebook
Ultimate Guide to Mastering Command Blocks!: Minecraft Keys to Unlocking Secret Commands
byTriumph Books
Rating: 5 out of 5 stars
5/5
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
Ebook
Procreate for Beginners: Introduction to Procreate for Drawing and Illustrating on the iPad
byAaron Smith
Rating: 0 out of 5 stars
0 ratings
Practical Lock Picking: A Physical Penetration Tester's Training Guide
Ebook
Practical Lock Picking: A Physical Penetration Tester's Training Guide
byDeviant Ollam
Rating: 5 out of 5 stars
5/5
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
AP Computer Science Principles Premium, 2024: 6 Practice Tests + Comprehensive Review + Online Practice
Ebook
AP Computer Science Principles Premium, 2024: 6 Practice Tests + Comprehensive Review + Online Practice
bySeth Reichelson
Rating: 0 out of 5 stars
0 ratings
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
Childhood Unplugged: Practical Advice to Get Kids Off Screens and Find Balance
Ebook
Childhood Unplugged: Practical Advice to Get Kids Off Screens and Find Balance
byKatherine Johnson Martinko
Rating: 0 out of 5 stars
0 ratings
The Professional Voiceover Handbook: Voiceover training, #1
Ebook
The Professional Voiceover Handbook: Voiceover training, #1
byPeter Baker
Rating: 5 out of 5 stars
5/5
Summary of Dotcom Secrets: by Russell Brunson - The Underground Playbook for Growing Your Company Online with Sales Funnels - A Comprehensive Summary
Ebook
Summary of Dotcom Secrets: by Russell Brunson - The Underground Playbook for Growing Your Company Online with Sales Funnels - A Comprehensive Summary
byAlexander Cooper
Rating: 5 out of 5 stars
5/5
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
Elon Musk
Ebook
Elon Musk
byWalter Isaacson
Rating: 4 out of 5 stars
4/5
Master Builder Roblox: The Essential Guide
Ebook
Master Builder Roblox: The Essential Guide
byTriumph Books
Rating: 4 out of 5 stars
4/5
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
Ebook
101 Awesome Builds: Minecraft® Secrets from the World's Greatest Crafters
byTriumph Books
Rating: 4 out of 5 stars
4/5
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
Ebook
How to Write a Book: An 11-Step Process to Build Habits, Stop Procrastinating, Fuel Self-Motivation, Quiet Your Inner Critic, Bust Through Writer's Block, & Let Your Creative Juices Flow (Short Read)
byDavid Kadavy
Rating: 5 out of 5 stars
5/5
Hacking: Ultimate Beginner's Guide for Computer Hacking in 2018 and Beyond: Hacking in 2018, #1
Ebook
Hacking: Ultimate Beginner's Guide for Computer Hacking in 2018 and Beyond: Hacking in 2018, #1
byDexter Jackson
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
Podcast episode
Ali Ghodsi – The Past, Present, and Future of Big Data – [Founder’s Field Guide, EP.18]: My Guest today is Ali Ghodsi, founder and CEO of Databricks, a data analytics platform for data scientists and developers. He's also the founder of Apache Spark, the open-source project that Databricks is built on, and is an accomplished researcher at...
byInvest Like the Best with Patrick O'Shaughnessy
0 ratings
0% found this document useful
433: Falling for FastAPI: Mike's falling in love with FastAPI and gives us a hint at the next project he's building.
Podcast episode
433: Falling for FastAPI: Mike's falling in love with FastAPI and gives us a hint at the next project he's building.
byCoder Radio
0 ratings
0% found this document useful
The future of programming and defining success as a software engineer: On this episode Abadesi talks to Cassidy Williams. Cassidy is a great follow on social media and is a software engineer at CodePen. Prior to CodePen, she worked for Venmo, Amazon, Clarify and others. She is a true maker and a huge mechanical keyboard nerd (which you hear a bit about on the show). In this episode they discuss... * How she got to where she is today, including lessons learned from working at big and small companies. * Her personal definition of success as a software engineer. * The future of programming. * Why she loves mechanical keyboards so much. We’ll be back next week so be sure to subscribe wherever you listen to your favorite podcasts. Big thanks to Copper for their support. ?
Podcast episode
The future of programming and defining success as a software engineer: On this episode Abadesi talks to Cassidy Williams. Cassidy is a great follow on social media and is a software engineer at CodePen. Prior to CodePen, she worked for Venmo, Amazon, Clarify and others. She is a true maker and a huge mechanical keyboard nerd (which you hear a bit about on the show). In this episode they discuss... * How she got to where she is today, including lessons learned from working at big and small companies. * Her personal definition of success as a software engineer. * The future of programming. * Why she loves mechanical keyboards so much. We’ll be back next week so be sure to subscribe wherever you listen to your favorite podcasts. Big thanks to Copper for their support. ?
byProduct Hunt Radio
0 ratings
0% found this document useful
Cloud Dataflow with Eric Anderson: Batch and stream processing systems have been evolving for the past decade. From MapReduce to Apache Storm to Dataflow, the best practices for large volume data processing have become more sophisticated as the industry and open source communities have ...
Podcast episode
Cloud Dataflow with Eric Anderson: Batch and stream processing systems have been evolving for the past decade. From MapReduce to Apache Storm to Dataflow, the best practices for large volume data processing have become more sophisticated as the industry and open source communities have ...
byCloud Engineering Archives - Software Engineering Daily
0 ratings
0% found this document useful
55: Go on The Web: Summary Andrew Gerrand (@enneff), Developer Advocate at Google & Go core contributor, talks about GoLang and how it is being used in Web Development today as well as the plans for the future of the Go as a platform for the web. Resources Go...
Podcast episode
55: Go on The Web: Summary Andrew Gerrand (@enneff), Developer Advocate at Google & Go core contributor, talks about GoLang and how it is being used in Web Development today as well as the plans for the future of the Go as a platform for the web. Resources Go...
byThe Web Platform Podcast
100%
100% found this document useful
Putting Airflow Into Production With James Meickle - Episode 43: Lessons Learned While Building A Data Science Platform With Airflow (Interview)
Podcast episode
Putting Airflow Into Production With James Meickle - Episode 43: Lessons Learned While Building A Data Science Platform With Airflow (Interview)
byData Engineering Podcast
0 ratings
0% found this document useful
Building Data Flows In Apache NiFi With Kevin Doran and Andy LoPresto - Episode 39: Self Service Data Flows With Apache NiFi (Interview)
Podcast episode
Building Data Flows In Apache NiFi With Kevin Doran and Andy LoPresto - Episode 39: Self Service Data Flows With Apache NiFi (Interview)
byData Engineering Podcast
0 ratings
0% found this document useful
Open Source Object Storage For All Of Your Data - Episode 99: An interview on the open source MinIO platform for fast and flexible object storage for data intensive applications and analytics that runs everywhere
Podcast episode
Open Source Object Storage For All Of Your Data - Episode 99: An interview on the open source MinIO platform for fast and flexible object storage for data intensive applications and analytics that runs everywhere
byData Engineering Podcast
0 ratings
0% found this document useful
How ChatGPT Changes Tech + The End of Remote Work? — With Aaron Levie
Podcast episode
How ChatGPT Changes Tech + The End of Remote Work? — With Aaron Levie
byBig Technology Podcast
100%
100% found this document useful
Level Up Your Data Platform With Active Metadata: A conversation with Atlan co-founder Prukalpa Sankar about the idea of active metadata and how it can reduce the toil involved in managing a data platform
Podcast episode
Level Up Your Data Platform With Active Metadata: A conversation with Atlan co-founder Prukalpa Sankar about the idea of active metadata and how it can reduce the toil involved in managing a data platform
byData Engineering Podcast
0 ratings
0% found this document useful
Reflections On Designing A Data Platform From Scratch: A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
Podcast episode
Reflections On Designing A Data Platform From Scratch: A monologue by Tobias Macey, the host of the show, about the design considerations involved in building a data platform and how the lessons learned from running the Data Engineering Podcast are influencing the choices made.
byData Engineering Podcast
100%
100% found this document useful
Open Source TensorFlow with Yifei Feng: Yifei Feng, a TensorFlow software engineer, shares with Melanie and Mark about her work on the open source TensorFlow project and the tools she builds.
Podcast episode
Open Source TensorFlow with Yifei Feng: Yifei Feng, a TensorFlow software engineer, shares with Melanie and Mark about her work on the open source TensorFlow project and the tools she builds.
byGoogle Cloud Platform Podcast
100%
100% found this document useful
Data Visualization and D3.js with Irene Ros: Scott talks to Data Visualization expert Irene Ros. When she isn't contributing to the Miso Project, teaching her d3.js class, or working on making OpenVis Conf the best data visualization conference it can be, she's working on projects that focus on creating engaging interactive visual displays of information.
Podcast episode
Data Visualization and D3.js with Irene Ros: Scott talks to Data Visualization expert Irene Ros. When she isn't contributing to the Miso Project, teaching her d3.js class, or working on making OpenVis Conf the best data visualization conference it can be, she's working on projects that focus on creating engaging interactive visual displays of information.
byHanselminutes with Scott Hanselman
0 ratings
0% found this document useful
240: Important Kotlin Constructs: In this episode, Donn and Kaushik talk about 5 new-ish Kotlin constructs that you might not be aware of.
Podcast episode
240: Important Kotlin Constructs: In this episode, Donn and Kaushik talk about 5 new-ish Kotlin constructs that you might not be aware of.
byFragmented - An Android Developer Podcast
0 ratings
0% found this document useful
API First, Lifecycles and Governance
Podcast episode
API First, Lifecycles and Governance
byThe Cloudcast
0 ratings
0% found this document useful
Data Security in Snowflake’s Data Cloud with Dan Myers: Snowflake went public last year and is one of the fastest growing companies in the data cloud space. Businesses from all over the world are utilizing Snowflake for data storage, processing, and analytics. Businesses using Snowflake are storing massive am...
Podcast episode
Data Security in Snowflake’s Data Cloud with Dan Myers: Snowflake went public last year and is one of the fastest growing companies in the data cloud space. Businesses from all over the world are utilizing Snowflake for data storage, processing, and analytics. Businesses using Snowflake are storing massive am...
byPartially Redacted: Data Privacy, Security & Compliance
0 ratings
0% found this document useful
The Undocumented Web: scraping, private APIs, proxies and “alternative solutions”: What is the undocumented web? Scott and Wes dive into it, discussing APIs, faking, scraping, automation, proxies as well as tips and tricks for best practices. Kyle Prinsloo’s Freelancing & Beyond — Sponsor Kyle Prinsloo teaches you everything...
Podcast episode
The Undocumented Web: scraping, private APIs, proxies and “alternative solutions”: What is the undocumented web? Scott and Wes dive into it, discussing APIs, faking, scraping, automation, proxies as well as tips and tricks for best practices. Kyle Prinsloo’s Freelancing & Beyond — Sponsor Kyle Prinsloo teaches you everything...
bySyntax - Tasty Web Development Treats
0 ratings
0% found this document useful
Putting Apache Spark Into Action with Jean Georges Perrin - Episode 60: Tackling Apache Spark From The Data Engineer's Perspective (Interview)
Podcast episode
Putting Apache Spark Into Action with Jean Georges Perrin - Episode 60: Tackling Apache Spark From The Data Engineer's Perspective (Interview)
byData Engineering Podcast
0 ratings
0% found this document useful
108: PySpark - Jonathan Rioux: Apache Spark is a unified analytics engine for large-scale data processing. PySpark blends the powerful Spark big data processing engine with the Python programming language to provide a data analysis platform that can scale up for nearly any task.
Podcast episode
108: PySpark - Jonathan Rioux: Apache Spark is a unified analytics engine for large-scale data processing. PySpark blends the powerful Spark big data processing engine with the Python programming language to provide a data analysis platform that can scale up for nearly any task.
byTest and Code
0 ratings
0% found this document useful
A Chaos Engineering & Jeli Sandwich with Nora Jones: Nora Jones is the founder and CEO at Jeli, makers of an incident analysis platform that leverages data to recommend productive solutions to the problems at hand. Before this role, she was Head of Chaos Engineering and Human Factors at Slack, a senior soft
Podcast episode
A Chaos Engineering & Jeli Sandwich with Nora Jones: Nora Jones is the founder and CEO at Jeli, makers of an incident analysis platform that leverages data to recommend productive solutions to the problems at hand. Before this role, she was Head of Chaos Engineering and Human Factors at Slack, a senior soft
byScreaming in the Cloud
0 ratings
0% found this document useful
DynamoDB The Database of Choice for Serverless Applications with Alex DeBrie: Alex DeBrie is the founder of DeBrie, LLC, a cloud-native training and AWS consulting company with a focus on DynamoDB and serverless technologies. He’s also the author of The DynamoDB Book, a 450-page tome that offers tips, strategies, and more about dat
Podcast episode
DynamoDB The Database of Choice for Serverless Applications with Alex DeBrie: Alex DeBrie is the founder of DeBrie, LLC, a cloud-native training and AWS consulting company with a focus on DynamoDB and serverless technologies. He’s also the author of The DynamoDB Book, a 450-page tome that offers tips, strategies, and more about dat
byScreaming in the Cloud
0 ratings
0% found this document useful
Python, Django, and Channels: with Andrew Godwin, creator of Django Channels
Podcast episode
Python, Django, and Channels: with Andrew Godwin, creator of Django Channels
byThe Changelog: Software Development, Open Source
0 ratings
0% found this document useful
Speed Up And Simplify Your Streaming Data Workloads With Red Panda - Episode 152: An interview with Vectorized founder Alexander Gallego about the Red Panda streaming engine and building a drop-in replacement for Kafka with better performance and throughput.
Podcast episode
Speed Up And Simplify Your Streaming Data Workloads With Red Panda - Episode 152: An interview with Vectorized founder Alexander Gallego about the Red Panda streaming engine and building a drop-in replacement for Kafka with better performance and throughput.
byData Engineering Podcast
0 ratings
0% found this document useful
The hard parts of data architecture: Following on from our earlier episode on the Software Architecture: the hard parts, we’re joined by the other two co-authors of that book to explore issues around data architecture and how that fits into these broader concepts of architecture. We...
Podcast episode
The hard parts of data architecture: Following on from our earlier episode on the Software Architecture: the hard parts, we’re joined by the other two co-authors of that book to explore issues around data architecture and how that fits into these broader concepts of architecture. We...
byThoughtworks Technology Podcast
0 ratings
0% found this document useful
001: The PHP Community, PHP CLI, & ElePHPants: A discussion that examines the zeitgeist of PHP. From elePHPants, to user groups, to PHP internals, to testing, to the awesome people in the community.
Podcast episode
001: The PHP Community, PHP CLI, & ElePHPants: A discussion that examines the zeitgeist of PHP. From elePHPants, to user groups, to PHP internals, to testing, to the awesome people in the community.
byPHPRoundtable Podcast
0 ratings
0% found this document useful
Open Source at Google Cloud Platform with Sarah Novotny: Mark and Melanie are joined by Sarah Novotny, Head of Open Source Strategy for GCP, to talk all about Open Source, the Cloud Native Compute Foundation & their relationships to Google Cloud Platform.
Podcast episode
Open Source at Google Cloud Platform with Sarah Novotny: Mark and Melanie are joined by Sarah Novotny, Head of Open Source Strategy for GCP, to talk all about Open Source, the Cloud Native Compute Foundation & their relationships to Google Cloud Platform.
byGoogle Cloud Platform Podcast
100%
100% found this document useful
#37 Prophet, Time Series & Causal Inference, with Sean Taylor
Podcast episode
#37 Prophet, Time Series & Causal Inference, with Sean Taylor
byLearning Bayesian Statistics
0 ratings
0% found this document useful
Beam and Spark with Holden Karau: This week our colleague, Holden Karau, joins us to talk about Spark and Beam.
Podcast episode
Beam and Spark with Holden Karau: This week our colleague, Holden Karau, joins us to talk about Spark and Beam.
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
Build A Full Stack ML Powered App In An Afternoon With Baseten: An interview with Tuhin Srivastava about how the Baseten platform allows data scientists and ML engineers to build a full stack machine learning powered application by themselves in an afternoon
Podcast episode
Build A Full Stack ML Powered App In An Afternoon With Baseten: An interview with Tuhin Srivastava about how the Baseten platform allows data scientists and ML engineers to build a full stack machine learning powered application by themselves in an afternoon
byThe Python Podcast.__init__
0 ratings
0% found this document useful
Microservices with Rafi Schloming: Microservices are a widely adopted pattern for breaking an application up into pieces that can be well-understood by the individual teams within the company. Microservices also allow these individual pieces to be scaled independently and updated in iso...
Podcast episode
Microservices with Rafi Schloming: Microservices are a widely adopted pattern for breaking an application up into pieces that can be well-understood by the individual teams within the company. Microservices also allow these individual pieces to be scaled independently and updated in iso...
byCloud Engineering Archives - Software Engineering Daily
0 ratings
0% found this document useful

Skip carousel

Build A Search And Analytic Engine
Linux Format
Article
Build A Search And Analytic Engine
Mar 10, 2020
7 min read
Create Asynchronous Code With Python
Linux Format
Article
Create Asynchronous Code With Python
Jun 29, 2021
8 min read
Build A Static Analysis Development Pipeline
Linux Format
Article
Build A Static Analysis Development Pipeline
Jul 27, 2021
9 min read
Ad-blocking To Get Harder
Linux Format
Article
Ad-blocking To Get Harder
Nov 15, 2022
A focus of this issue’s main feature is chrome is shifting from Manifest V2 extensions to V3; the process is expected to be complete in January 2023. According to the Chrome peeps, it will offer “increased safety and peace of mind”. Until then, Manif
1 min read
In Brief
Linux Format
Article
In Brief
Jun 1, 2021
Mu is a code editor for many forms of Python. We can write standard Python 3 code, create web apps and write code for microcontrollers such as the new Raspberry Pi Pico. Mu is designed for new users and does away with complicated IDEs in favour of a
1 min read
Elasticsearch And Kibana Basics
Linux Format
Article
Elasticsearch And Kibana Basics
Dec 15, 2020
1 min read
Traefik Configuration
Linux Format
Article
Traefik Configuration
Mar 10, 2020
In this tutorial we have configured Traefik using command-line switches in our Docker Compose file (the section starting command:). This is the equivalent of starting the application with a whole bunch of command options each time, and while this wou
1 min read
Automatically Provision Devices With Ansible
Linux Format
Article
Automatically Provision Devices With Ansible
Nov 15, 2022
Matt Holder has worked in IT support for over a decade, and always tries to utilise Linux alongside other installed systems. C loud computing is a term that means a number of things. Software as a Service (SaaS) is one such example of what can be hos
9 min read
Route Traffic Between Networks Using A Pi
Linux Format
Article
Route Traffic Between Networks Using A Pi
Jun 2, 2020
A deep-dive into Pi networking solutions resulted in this tutorial. The goal was to uncover a Pi configuration that would enable the routing of network traffic from a wired network to a wireless network. The aim is to build a network router using a R
10 min read
Join the Pod, Man!
Linux Format
Article
Join the Pod, Man!
May 30, 2023
8 min read
Docker vs Podman
APC
Article
Docker vs Podman
Apr 19, 2021
When Cockpit was first developed, it had plug-in support for administering your Docker containers remotely via its user-friendly web interface. But then Red Hat OS became a major backer of Cockpit, and when Red Hat developed its own alternative to Do
1 min read
Tensor Flow 101
APC
Article
Tensor Flow 101
Jan 27, 2020
4 min read
Why Are We Stuck With M.2 When U.2 Is So Much Better?
APC
Article
Why Are We Stuck With M.2 When U.2 Is So Much Better?
May 22, 2023
4 min read
Understanding CPUs
PC Powerplay
Article
Understanding CPUs
Sep 2, 2019
10 min read
How To Develop A RESTful Client In Go
Linux Format
Article
How To Develop A RESTful Client In Go
Nov 16, 2021
Mihalis Tsoukalos is a systems engineer and technical writer. He’s the author of Go Systems Programming and Mastering Go. You can reach him at @mactsouk. The subject of this month’s tutorial is RESTful services. In particular, you’re going to learn h
9 min read
What Is The Future Of Game Streaming Now That Stadia Is Dead?
APC
Article
What Is The Future Of Game Streaming Now That Stadia Is Dead?
Oct 31, 2022
Once hyped as being ‘the future of gaming’, the Google Stadia game streaming service was officially, just three years after launch and before even making it to Australian shores. When game streaming first launched we did have some apprehension about
2 min read
A Place For Everything
Outdoor Photographer
Article
A Place For Everything
Aug 10, 2019
9 min read
Scan And Scrape Websites Using Python
Linux Format
Article
Scan And Scrape Websites Using Python
Nov 14, 2023
David Bolton once accidentally boosted the traffic for his firm’s website by 25% in one day by running a web scraper on it. Luckily, they never found out! Ever since the web made an appearance back in the mid-’90s, programmers have been writing softw
6 min read
ORGANIZING YOUR PHOTOS, PART 2: Using Keywords
Outdoor Photographer
Article
ORGANIZING YOUR PHOTOS, PART 2: Using Keywords
Sep 14, 2019
10 min read
Google Answer Box Strategy
Techfastly
Article
Google Answer Box Strategy
Sep 21, 2020
Leveraging the Google PAA (People Also Ask) element on a Search Results Page for Targeted Content Creation with a Python Scraper All businesses that are online today are creating content at a furious pace. According to Technavio, a research firm, con
7 min read
Observability Of The Kernel And Containers
Linux Format
Article
Observability Of The Kernel And Containers
Apr 4, 2023
Mihalis Tsoukalos is currently working on Time Series. You can reach him at: @mactsouk. For our final delve into eBPF, we’re tackling applications, the kernel and Docker containers. At the end of the day, all Linux machines execute code for applicat
10 min read
Enterprise-grade Monitoring Made Easy
Linux Format
Article
Enterprise-grade Monitoring Made Easy
Mar 10, 2020
9 min read
Tweaking System Components
Linux Format
Article
Tweaking System Components
Nov 19, 2019
4 min read
Cerebro
Linux Format
Article
Cerebro
Jul 26, 2022
1 min read
Mac 911
MacWorld
Article
Mac 911
Sep 18, 2018
5 min read
Find And Clean Up Your Config Files
Linux Format
Article
Find And Clean Up Your Config Files
Feb 11, 2020
10 min read
Mac 911
MacWorld
Article
Mac 911
Apr 20, 2021
7 min read
“There’s No Single ‘Best’ Language To Learn. I Think The Real Key Is To Learn How To Write Code”
PC Pro Magazine
Article
“There’s No Single ‘Best’ Language To Learn. I Think The Real Key Is To Learn How To Write Code”
Oct 8, 2022
9 min read
Scikit-Learn: The Ultimate Python Library
APC
Article
Scikit-Learn: The Ultimate Python Library
Jul 15, 2019
4 min read
FLASK Web Frameworks
Linux Format
Article
FLASK Web Frameworks
Jun 4, 2019
The main focus of Python has always been to get you cracking on with your coding – the language was never made for web programming. However, this has just made it more interesting to extend the language for the web, or to create an interface to web-b
9 min read

Related categories

Skip carousel

Reviews for Solr Cookbook - Third Edition

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Solr Cookbook - Third Edition - Rafał Kuć

Solr Cookbook Third Edition

Credits

About the Author

Acknowledgments

About the Reviewers

www.PacktPub.com

Support files, eBooks, discount offers, and more

Why subscribe?

Free access for Packt account holders

Preface

What this book covers

What you need for this book

Who this book is for

Sections

Getting ready

How to do it…

How it works…

There's more…

See also

Conventions

Reader feedback

Customer support

Downloading the example code

Errata

Piracy

Questions

1. Apache Solr Configuration

Introduction

Running Solr on a standalone Jetty

Getting ready

How to do it...

How it works...

There's more...

I want Jetty to run on a different port

Buffer size is too small

Installing ZooKeeper for SolrCloud

Getting ready

How to do it...

How it works...

Migrating configuration from master-slave to SolrCloud

Getting ready

How to do it...

How it works...

Choosing the proper directory configuration

How to do it...

How it works...

Configuring the Solr spellchecker

How to do it...

How it works...

There's more...

More than one spellchecker

Using Solr in a schemaless mode

How to do it...

How it works...

Limiting I/O usage

Getting ready

How to do it...

How it works...

Using core discovery

How to do it...

How it works...

There's more...

Configuring SolrCloud for NRT use cases

How to do it...

How it works...

Configuring SolrCloud for high-indexing use cases

Getting ready

How to do it...

How it works...

Configuring SolrCloud for high-querying use cases

Getting ready

How to do it...

How it works...

Configuring the Solr heartbeat mechanism

How to do it...

How it works...

There's more...

Enabling and disabling the heartbeat mechanism

Changing similarity

Getting ready

How to do it...

How it works...

There's more...

Changing the global similarity

2. Indexing Your Data

Introduction

Indexing PDF files

How to do it...

How it works...

Counting the number of fields

How to do it...

How it works...

Using parsing update processors to parse data

Getting ready

How to do it...

How it works...

See also

Using scripting update processors to modify documents

Getting ready

How to do it...

How it works...

See also

Indexing data from a database using Data Import Handler

How to do it...

How it works...

There's more...

How to change the default behavior of deleting index contents at the beginning of a full import

Incremental imports with DIH

Getting ready

How to do it...

How it works...

See also

Transforming data when using DIH

Getting ready

How to do it...

How it works...

There's more...

Using scripts other than JavaScript

Indexing multiple geographical points

How to do it...

How it works...

See also

Updating document fields

How to do it...

How it works...

Detecting the document language during indexation

How to do it...

How it works...

There's more...

Language identification based on Apache Tika

Optimizing the primary key indexation

How to do it...

How it works...

See also

Handling multiple currencies

How to do it...

How it works...

There's more...

Setting up your own currency provider

3. Analyzing Your Text Data

Introduction

Using the enumeration type

How to do it...

How it works...

Removing HTML tags during indexing

How to do it...

How it works...

There's more...

Preserving defined tags

See also

Storing data outside of Solr index

How to do it...

How it works...

Using synonyms

How to do it...

How it works...

There's more...

Equivalent synonyms setup

See also

Stemming different languages

How to do it...

How it works...

There's more...

Using nonaggressive stemmers

How to do it...

How it works...

There's more...

Using the n-gram approach to do performant trailing wildcard searches

How to do it...

How it works...

Using position increment to divide sentences

How to do it...

How it works...

Using patterns to replace tokens

How to do it...

How it works...

There's more...

Using solr.PatternReplaceCharFilterFactory

4. Querying Solr

Introduction

Understanding and using the Lucene query language

How to do it...

How it works...

See also

Using position aware queries

How to do it...

How it works...

There's more...

Too many generated queries

Using boosting with autocomplete

How to do it...

How it works...

Phrase queries with shingles

How to do it...

How it works...

See also

Handling user queries without errors

Getting ready

How to do it...

How it works...

See also

Handling hierarchies with nested documents

How to do it...

How it works...

There's more...

Returning children documents in the query

Sorting data on the basis of a function value

How to do it...

How it works...

Controlling the number of terms needed to match

Getting ready

How to do it...

How it works...

See also

Affecting document score using function queries

How to do it...

How it works...

See also

Using simple nested queries

How to do it...

How it works...

Using the Solr document query join functionality

How to do it...

How it works...

Handling typos with n-grams

How to do it...

How it works...

Rescoring query results

How to do it...

How it works...

5. Faceting

Introduction

Getting the number of documents with the same field value

How to do it...

How it works...

There's more...

How to show facets with counts greater than zero

Lexicographical sorting of the faceting results

Getting the number of documents with the same value range

How to do it...

How it works...

Getting the number of documents matching the query and subquery

How to do it...

How it works...

Removing filters from faceting results

Getting ready

How to do it...

How it works...

Using decision tree faceting

How to do it...

How it works...

Calculating faceting for relevant documents in groups

Getting ready

How to do it...

How it works...

Improving faceting performance for low cardinality fields

Getting ready

How to do it...

How it works...

There's more...

Using per segment field cache for faceting calculation

Specifying the number of faceting threads

6. Improving Solr Performance

Introduction

Handling deep paging efficiently

How to do it...

How it works...

See also

Configuring the document cache

Getting ready

How to do it...

How it works...

Configuring the query result cache

Getting ready

How to do it...

How it works...

Configuring the filter cache

Getting ready

How to do it...

How it works...

Improving Solr query performance after the start and commit operations

How to do it...

How it works...

There's more...

Improving Solr performance after committing operations

Lowering the memory consumption of faceting and sorting

How to do it...

How it works...

Speeding up indexing with Solr segment merge tuning

How to do it...

How it works...

There's more...

Increasing the RAM buffer size to improve the indexing throughput

Speeding up querying with merge policy tuning

See also

Avoiding caching of rare filters to improve the performance

How to do it...

How it works...

Controlling the filter execution to improve expensive filter performance

Getting ready

How to do it...

How it works...

Configuring numerical fields for high-performance sorting and range queries

How to do it...

How it works...

See also

7. In the Cloud

Introduction

Creating a new SolrCloud cluster

Getting ready

How to do it...

How it works...

There's more...

Starting an embedded ZooKeeper server

Specifying the Solr server name

Setting up multiple collections on a single cluster

Getting ready

How to do it...

How it works...

Splitting shards

Getting ready

How to do it...

How it works...

Having more than a single shard from a collection on a node

Getting ready

How to do it...

How it works...

Creating a collection on defined nodes

Getting ready

How to do it...

How it works...

Adding replicas after collection creation

Getting ready

How to do it...

How it works...

Removing replicas

Getting ready

How to do it...

How it works...

Moving shards between nodes

Getting ready

How to do it...

How it works...

Using aliasing

Getting ready

How to do it...

How it works...

Using routing

Getting ready

How to do it...

How it works...

8. Using Additional Functionalities

Introduction

Finding similar documents

How to do it...

How it works...

Highlighting fragments found in documents

How to do it...

How it works...

There's more...

Changing the default HTML tags that surround the matched content

Efficient highlighting

How to do it...

How it works...

Using versioning

Getting ready

How to do it...

How it works...

Retrieving information about the index structure

How to do it...

How it works...

There's more...

Retrieving the index structure information in XML

Retrieving information about dynamic fields

Retrieving information about copy fields

See also

Altering the index structure on a live collection

Getting ready

How to do it...

How it works...

See also

Grouping documents by the field value

How to do it...

How it works...

There's more...

Having more than a single document in a group

Modifying the number of returned groups

Grouping documents by the query value

Getting ready

How to do it…

How it works...

Grouping documents by the function value

Getting ready

How to do it...

How it works...

Efficient documents grouping using the post filter

Getting ready

How to do it...

How it works...

There's more...

Expanding collapsed groups

9. Dealing with Problems

Introduction

Dealing with the too many opened files exception

How to do it...

How it works...

Diagnosing and dealing with memory problems

How to do it...

How it works...

There's more...

Seeing heap when out of memory error occurs

Configuring sorting for non-English languages

How to do it...

How it works...

Migrating data to another collection

Getting ready

How to do it...

How it works...

SolrCloud read-side fault tolerance

Getting ready

How to do it...

How it works...

There's more...

Defining the achieved replication factor

Using the check index functionality

How to do it...

How it works...

There's more...

Checking the index without the repair procedure

Adjusting the Jetty configuration to avoid deadlocks

Getting ready

How to do it...

How it works...

Tuning segment merging

How to do it...

How it works...

See also

Avoiding swapping

Getting ready

How to do it...

How it works...

10. Real-life Situations

Introduction

Implementing the autocomplete functionality for products

How to do it...

How it works...

Implementing the autocomplete functionality for categories

How to do it...

How it works...

Handling time-sliced data using aliases

Getting ready

How to do it...

How it works...

There's more...

Deleting an alias

Boosting words closer to each other

How to do it...

How it works...

Using the Solr spellchecking functionality

Getting ready

How to do it...

How it works...

Using the Solr administration panel for monitoring

How to do it...

How it works...

There's more...

SPM Performance Monitoring & Alerting

Automatically expiring Solr documents

How to do it...

How it works...

There's more...

Changing the time to live parameter name

Exporting whole query results

How to do it...

How it works...

Index

Solr Cookbook Third Edition

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author, nor Packt Publishing, and its dealers and distributors will be held liable for any damages caused or alleged to be caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

First published: July 2011

Second edition: January 2013

Third edition: January 2015

Production reference: 1200115

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham B3 2PB, UK.

ISBN 978-1-78355-315-0

www.packtpub.com

Credits

Author

Rafał Kuć

Reviewers

Sunil Gulabani

Charles Lee

Stefan Matheis

Marcelo Ochoa

Walt Stoneburner

Ning Sun

Commissioning Editor

Ashwin Nair

Acquisition Editor

Richard Brookes-Bland

Content Development Editor

Prachi Bisht

Technical Editors

Mrunal M. Chavan

Dennis John

Copy Editors

Sayanee Mukherjee

Rashmi Sawant

Project Coordinator

Sageer Parkar

Proofreaders

Simran Bhogal

Samuel Redman Birch

Maria Gould

Ameesha Green

Paul Hindle

Indexer

Tejal Soni

Graphics

Sheetal Aute

Production Coordinator

Nitesh Thakur

Cover Work

Nitesh Thakur

About the Author

Rafał Kuć is a born team leader and software developer. He currently works as a consultant and software engineer at Sematext Group, Inc., where he concentrates on open source technologies such as Apache Lucene and Solr, Elasticsearch, and Hadoop stack. He has more than 14 years of experience in various software branches—from banking software to e-commerce products. He focuses mainly on Java but is open to every tool and programming language that will make the achievement of his goal easier and faster. Rafał is also one of the founders of the solr.pl site, where he tries to share his knowledge and help people with the problems they face with Solr and Lucene. He is also a speaker at various conferences around the world, such as Lucene Eurocon, Berlin Buzzwords, ApacheCon, Lucene Revolution, and DevOps Days.

Rafał began his journey with Lucene in 2002, and it wasn't love at first sight. When he came back to Lucene in late 2003, he revised his thoughts about the framework and saw the potential in search technologies. Then, Solr came along and that was it. He started working with Elasticsearch in the middle of 2010. Currently, Lucene, Solr, Elasticsearch, and information retrieval are his main points of interest.

Rafał is also the author of Apache Solr 3.1 Cookbook, and the update to it, Apache Solr 4.0 Cookbook, both published by Packt Publishing. He also authored Elasticsearch-related books, ElasticSearch Server and its second edition, and the first and second editions of Mastering ElasticSearch, all published by Packt Publishing.

This book is a second update to the first book I ever wrote— Apache Solr 3.1 Cookbook, Packt Publishing. Again, similar to Apache Solr Cookbook 4.0, Packt Publishing, what meant to be an update turned out to be almost a complete rewrite because of the pending release of Solr 5.0 and the changes to Solr itself. Between Solr 4.0 and 5.0, there were a lot of changes and additions to Solr, and I know I didn't manage to gather them all in the recipes that are present in the book you are holding. However, I hope that if you are either using Solr 4.x or Solr 5.0, this book will help you overcome some common problems and will push your knowledge about Solr a bit further.

Acknowledgments

Although I would go the same way if I could go back in time, the time during the writing of this book was not easy for my family. The ones that suffered from this the most were my wife, Agnes, and my two great kids—son Philip and daughter Susanna. Without their patience and understanding, writing this book wouldn't have been possible. I would also like to thank my and Agnes' parents for their support and help.

I would like to thank all the people involved in creating, developing, and maintaining Lucene and Solr projects for their work and passion. Without them, this book wouldn't have been written.

Once again, thank you.

About the Reviewers

Sunil Gulabani is a technical geek in software development based in Ahmedabad, Gujarat, India. He graduated in commerce from S. M. Patel Institute of Commerce (SMPIC) and has a master's degree in computer applications from Ahmedabad Education Society Institute of Computer Studies (AESICS). He had been a top ranker while pursuing his master's degree.

He has also presented a paper Effective Label Matching For Automated Evaluation of Use -- Case Diagrams on Technology For Education (T4E)—IIIT Hyderabad, an IEEE conference, along with senior lecturers, Vinay Vachharajani and Dr. Jyoti Pareek.

Since 2011, he has been working as a software engineer and is cloud technology savvy. He has experience in developing enterprise solutions using Java (EE), Apache Solr, RESTful Web Services, GWT, Smart GWT, Amazon Web Services (AWS), Redis, Memcache, MongoDB, and others. He has a keen interest in system architecture and integration, data modeling, relational databases, and mapping with NoSQL for high throughput.

He is the author of Developing RESTful Web Services with Jersey 2.0, Packt Publishing, that looks at JAX-RS 2.0, which is an enhanced framework based on the RESTful architecture. He also reviewed the book RESTful Web Services with Dropwizard, Packt Publishing.

He also takes interest in writing tech blogs and is actively involved in knowledge-sharing communities such as JUG-Ahmedabad, GDG Ahmedabad, and Ahmedabad University.

You can visit him online at http://www.sunilgulabani.com and follow him on Twitter at @sunil_gulabani.He can be reached directly at .

Stefan Matheis is a freelance backend engineer, currently living in Zurich, Switzerland. He likes to work on projects around API development, natural language processing, graph databases, and infrastructure management. Lately, he got involved in payment and logistics projects. Stefan is an Apache Lucene/Solr committer since 2012 as well as a member of the project management committee. His main contribution was the new Admin UI, which is shipped with all Solr releases since 4.0.

Marcelo Ochoa works at the System Laboratory of Facultad de Ciencias Exactas of the Universidad Nacional del Centro de la Provincia de Buenos Aires and is the CTO at Scotas.com, a company specialized in near real-time search solutions using Apache Solr and Oracle. He divides his time between university jobs and external projects related to Oracle and Big Data technologies. He has worked on several Oracle-related projects such as translation of Oracle manuals and multimedia CBTs. His background is in database, network, Web, and Java technologies. In the XML world, he is known as the developer of the DB Generator for the Apache Cocoon project, the open source projects DBPrism and DBPrism CMS, the Lucene-Oracle integration using Oracle JVM Directory implementation, and in the Restlet.org project, the Oracle XDB Restlet Adapter (an alternative to writing native REST web services inside the database-resident JVM).

Since 2006, he has been part of the Oracle ACE program; Oracle ACEs are known for their strong credentials as Oracle community enthusiasts and advocates, with candidates nominated by ACEs in the Oracle Technology and Applications communities.

He is the author of Chapter 17, 360-Degree Programming the Oracle Database, of the book, Oracle Database Programming using Java and Web Services, Kuassi Mensah, Elsevier Digital Press, and Chapter 21, DB Prism: A Framework to Generate Dynamic XML from a Database, of the book, Professional XML Databases, Kevin Williams, Wrox Press.

Walt Stoneburner is a software architect with over 25 years of commercial application development and consulting experience. Fringe passions involve quality assurance, configuration management, and security. If cornered, he might actually admit to liking statistics and authoring documentation as well.

He is easily amused by programming language design, collaborative applications, Big Data, knowledge management, data visualization, and ASCII art. Self-described as a closet geek, Walt also evaluates software products and consumer electronics, draws comics, runs a freelance photography studio specializing in portraits and art (CharismaticMoments.com), writes humor pieces, performs sleights of hand, enjoys game design, and can occasionally be found on ham radio.

Walt can be reached directly via email at <wls@wwco.com> or . He publishes a tech and humor blog called the Walt-O-Matic at http://www.wwco.com/~wls/blog/.

His other book reviews and contributions include:

AntiPatterns and Patterns in Software Configuration Management, John Wiley & Sons (ISBN 978-0-471-32929-9, p. xi)

Exploiting Software: How to Break Code, Addison-Wesley Professional (ISBN 978-0-201-78695-8, p. xxxiii)

Ruby on Rails Web Mashup Projects, Packt Publishing (ISBN 978-1-847193-93-3)

Building Dynamic Web 2.0 Websites with Ruby on Rails, Packt Publishing (ISBN 978-1-847193-41-4)

Instant Sinatra Starter, Packt Publishing (ISBN 978-1782168218)

C++ Multithreading Cookbook, Packt Publishing (978-1-78328-979-0)

Learning Selenium Testing Tools with Python, Packt Publishing (978-1-78398-350-6)

Whittier (ASIN B00GTD1RBS)

Cooter Brown's South Mouth Book of Hillbilly Wisdom, CreateSpace Independent Publishing Platform (ISBN 978-1-482340-99-0)

Ning Sun is a software engineer currently working for a China-based start-up, LeanCloud, providing one-stop Backend as a Service (BaaS) for mobile apps. Being a startup engineer, he solves various kinds of problems and plays different kinds of roles. However, he has always been an enthusiast for open source technology. He contributes to several open source projects and has also learned a lot from them.

Ning worked on Delicious.com in 2013, which is known as one of the most important websites in early Web 2.0 EAR. The search for Delicious is fully powered by a Solr cluster, and it might be one of the largest deployments for Solr.

You can always find Ning on Github.com/sunng87 and Twitter.com/Sunng.

www.PacktPub.com

Support files, eBooks, discount offers, and more

For support files and downloads related to your book, please visit www.PacktPub.com.

Did you know that Packt offers eBook versions of every book published, with PDF and ePub files available? You can upgrade to the eBook version at www.PacktPub.com and as a print book customer, you are entitled to a discount on the eBook copy. Get in touch with us at for more details.

At www.PacktPub.com, you can also read a collection of free technical articles, sign up for a range of free newsletters and receive exclusive discounts and offers on Packt books and eBooks.

https://www2.packtpub.com/books/subscription/packtlib

Do you need instant solutions to your IT questions? PacktLib is Packt's online digital book library. Here, you can search, access, and read Packt's entire library of books.

Why subscribe?

Fully searchable across every book published by Packt

Copy and paste, print, and bookmark content

On demand and accessible via a web browser

Free access for Packt account holders

If you have an account with Packt at www.PacktPub.com, you can use this to access PacktLib today and view 9 entirely free books. Simply use your login credentials for immediate access.

Preface

Welcome to Solr Cookbook, Third Edition. You will be taken on a tour of the most common problems that a user might face while dealing with Apache Solr. You will also explore some of the features that were recently introduced in Solr. You will learn how to deal with the problems when configuring and setting up Solr, handle common queries, fine-tune Solr instances, set up and use SolrCloud, use faceting and grouping, fighting common problems, and many more things. Each and every recipe is based on real-life problems and provides solutions along with detailed descriptions of the configuration and code that was used.

What this book covers

Chapter 1, Apache Solr Configuration, covers Solr configuration recipes, along with setting up ZooKeeper, migrating from master to slave, and configuring Solr for different use cases.

Chapter 2, Indexing Your Data, as the name suggests, explains data indexing, such as binary files indexing, using Data Import Handler, language detection, updating a single field of document, and much more.

Chapter 3, Analyzing Your Text Data, concentrates on common problems when analyzing your data, such as stemming, geographical location indexing, or using synonyms.

Chapter 4, Querying Solr, describes querying Apache Solr, such as nesting queries, affecting the scoring of documents, phrase searching, or using the parent-child relationship.

Chapter 5, Faceting, is dedicated to the faceting mechanism in which you can find the information needed to overcome some problems that you might encounter while working with Solr and faceting.

Chapter 6, Improving Solr Performance, focuses on improving your Apache Solr cluster performance with information such as cache configuration, indexing speed up, and much more.

Chapter 7, In the Cloud, covers the cloud side of Solr—SolrCloud, setting up collections, replicas configuration, distributed indexing and searching, as well as aliasing and shard manipulation.

Chapter 8, Using Additional Functionalities, explains how we can highlight long text fields, sort results on the basis of function value, check user spelling mistakes, and use the grouping functionality.

Chapter 9, Dealing with Problems, is a small chapter dedicated to the most common situations such as memory problems, tuning segment merges, and others.

Chapter 10, Real-life Situations, describes how to handle real-life situations such as implementing different autocomplete functionalities, using near real-time search, or improving query relevance.

What you need for this book

In order to run most of the examples in this book, you will need Java Runtime Environment 1.7 or the newer version and of course, the 4.10 or the newer version of Apache Solr search server. To run examples found in this book, you might need a web browser or a command-line tool that is able to run HTTP requests such as curl.

The recipes in this book (unless stated otherwise) are tested in a Linux environment with the latest available Version of Solr 5.0. For Windows-based hosts, the single quotes should be replaced with double quotes in the commands. Remember that during the writing of this book, the final Version of Solr 5.0 was not released and there might have been changes between the version used during testing and the released Version of Solr 5.0.

A few chapters in this book require additional software such as Apache ZooKeeper 3.4.3 or Jetty.

Who this book is for

This book is for intermediate Solr Developers who are willing to learn and implement pro-level practices, techniques, and solutions. This edition will specifically appeal to developers who wish to quickly get to grips with the changes and new features of Apache Solr 5.

Sections

In this book, you will find several headings that appear frequently (Getting ready, How to do it, How it works, There's more, and See also).

To give clear instructions on how to complete a recipe, we use these sections as follows:

Getting ready

This section tells you what to expect in the recipe, and describes how to set up any software or any preliminary settings required for the recipe.

How to do it…

This section contains the steps required to follow the recipe.

How it works…

This section usually consists of a detailed explanation of what happened in the previous section.

There's more…

This section consists of additional information about the recipe in order to make the reader more knowledgeable about the recipe.

Conventions

In this book, you will find a number of text styles that distinguishes between different kinds of information. Here are some examples of these styles, and an explanation of their meaning.

Code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles are shown as follows: The lib entry in the solrconfig.xml file tells Solr to look for all the JAR files from the ../../langid directory.

A block of code is set as follows:

id type=string indexed=true stored=true required=true multiValued=false />

name type=text_general indexed=true stored=true/>

description type=text_general indexed=true stored=true />

langId type=string indexed=true stored=true />

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

id type=string indexed=true stored=true required=true multiValued=false />

name type=text_general indexed=true stored=true/>

description type=text_general indexed=true stored=true />

langId type=string indexed=true stored=true />

Any command-line input or output is written as follows:

curl 'localhost:8983/solr/update?commit=true' -H 'Content-type:application/json' -d '[{id:1,file:{set:New file name}}]'

New terms and important words are shown in bold. Words that you see on the screen, for example, in menus or dialog boxes, appear in the text like this: The Overview page for a collection gives you basic statistics about the core of the collection such as number of documents, heap memory usage, version of the index, number of segments, and so on.

Note

Warnings or important notes appear in a box like this.

Tip

Tips and tricks appear like this.

Reader feedback

Feedback from our readers is always welcome. Let us know what you think about this book—what you liked or may have disliked. Reader feedback is important for us to develop titles that you really get the most out of.

To send us general feedback, simply send an e-mail to <feedback@packtpub.com>, and mention the book title via the subject of your message.

If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, see our author guide on www.packtpub.com/authors.

Customer support

Now that you are the proud owner of a Packt book, we have

Enjoying the preview?

Page 1 of 1

Solr Cookbook - Third Edition

About this ebook

Rafał Kuć

Read more from Rafał Kuć

Related authors

Related to Solr Cookbook - Third Edition

Related ebooks

Computers For You

Related podcast episodes

Related articles

Related categories

Reviews for Solr Cookbook - Third Edition

What did you think?

Book preview

Solr Cookbook - Third Edition - Rafał Kuć

Table of Contents

Solr Cookbook Third Edition

Solr Cookbook Third Edition

Credits

About the Author

Acknowledgments

About the Reviewers

Support files, eBooks, discount offers, and more

Why subscribe?

Preface

What this book covers

What you need for this book

Who this book is for

Sections

Getting ready

There's more…

See also

Conventions

Note

Tip

Reader feedback

Customer support