MacLife

macOS

Find missing words in Spotlight search

Why can’t Spotlight find words in the content of some of my PDF documents?

PDF docs can contain two notional layers, the first containing images perhaps from printed pages that were originally scanned in, and a second containing laid–out text that might have been generated by OCR from those scanned images. Those created directly from Word and Pages normally only have the second layer, together with any images embedded in their text.

Currently, when Spotlight indexes PDF documents, it only uses text from that second layer, whereOCR on any page images. It also faces the problem that both layers are intended to appear accurate visual representations of the document, rather than providing structured access to text contents, although that’s starting to change with the growing use of newer standards such as PDF/A.

You’re reading a preview, subscribe to read more.

More from MacLife

MacLife3 min read
Mac Hardware
What should I replace my two Time Capsules with, to store all the Time Machine backups for our four different Macs? Apple made its last Time Capsules in 2018 and even that late model is now approaching the end of its support period. At the very least
MacLife3 min read
Multi–room Audio & Video
ONE OF OUR very favorite things about Apple–compatible smart home technology is how entertaining it is — and we mean that literally. Being able to stream music from your Mac, iPad or iPhone to your home hi–fi, or to have different music playing in di
MacLife2 min read
Help! My Mac Is Broken!
We’ve all seen them. Those irritating error windows that pop up while you’re trying to do something on your Mac are never welcome, but if you know how to read them, they can tell you much about what’s gone wrong. Error code – 36, for example, occurs

Related Books & Audiobooks