Zscaler’s Data Protection platform integrates Optical Character Recognition (OCR) into its inline Data Loss Prevention (DLP) capabilities. OCR enables Zscaler to extract text embedded within images—such as screenshots, scanned documents, or photos of forms—and subject that text to the same DLP inspection engines that normally analyze plain text content.
Once OCR has converted image content into text, Zscaler can apply predefined dictionaries, custom dictionaries, and advanced classifiers to detect sensitive data types, including personally identifiable information (PII) such as national ID numbers, passport numbers, addresses, or other regulated personal data. This is crucial because many data leaks occur via screenshots or scanned documents that traditional, text-only DLP engines would miss.
While OCR could, in theory, detect patterns related to network configurations, software licenses, or financial transactions, Zscaler’s training and exam materials emphasize its use to protect sensitive data in images—especially user-related regulated data such as PII and other compliance-relevant information. Network configurations and software licenses are better addressed through configuration management and IP protection policies, and “financial transactions” describes activities rather than a specific information pattern. Therefore, Personally Identifiable Information (PII) is the best and most exam-accurate answer for the type of sensitive information protected using OCR.
===========