Which stage of the indexing pipeline divides text into tokens?
A.
Sectioner
B.
Tokenizer
C.
Filter
D.
Lexer
The Answer Is:
D
This question includes an explanation.
Explanation:
The indexing pipeline in Oracle Text processes text for search:
Correct Answer (D): “Lexer” divides text into tokens (words, symbols) based on language rules and settings (e.g., whitespace, punctuation). It’s the stage responsible for tokenization in Oracle’s text indexing process.
Incorrect Options:
A: Sectioner identifies document sections (e.g., headers), not tokens.
B: Tokenizer is a generic term, but in Oracle Text, “Lexer” is the specific component.
[Reference:Oracle Text Indexing, ]
1z0-931-25 PDF/Engine
Printable Format
Value of Money
100% Pass Assurance
Verified Answers
Researched by Industry Experts
Based on Real Exams Scenarios
100% Real Questions
Get 65% Discount on All Products,
Use Coupon: "ac4s65"