Package net.sf.saxon.expr.parser
Class Tokenizer
- java.lang.Object
- 
- net.sf.saxon.expr.parser.Tokenizer
 
- 
 public final class Tokenizer extends java.lang.ObjectTokenizer for expressions and inputs.This code was originally derived from James Clark's xt, though it has been greatly modified since. See copyright notice at end of file. 
- 
- 
Field SummaryFields Modifier and Type Field Description booleanallowSaxonExtensionsFlag to allow Saxon extensionsstatic intBARE_NAME_STATEState in which a name is NOT to be merged with what comes next, for example "("intcurrentTokenThe number identifying the most recently read tokenintcurrentTokenStartOffsetThe position in the input expression where the current token startsjava.lang.StringcurrentTokenValueThe string value of the most recently read tokenstatic intDEFAULT_STATEInitial default state of the TokenizerbooleandisallowUnionKeywordFlag to disallow "union" as a synonym for "|" when parsing XSLT 2.0 patternsjava.lang.StringinputThe string being parsedintinputOffsetThe current position within the input stringbooleanisXQueryFlag to indicate that this is XQuery as distinct from XPathintlanguageLevelXPath language level: e.g.static intOPERATOR_STATEState in which the next thing to be read is an operatorstatic intSEQUENCE_TYPE_STATEState in which the next thing to be read is a SequenceType
 - 
Constructor SummaryConstructors Constructor Description Tokenizer()
 - 
Method SummaryAll Methods Instance Methods Concrete Methods Modifier and Type Method Description intgetColumnNumber()Get the column number of the current tokenintgetColumnNumber(int offset)Return the column number corresponding to a given offset in the expressionintgetLineNumber()Get the line number of the current tokenintgetLineNumber(int offset)Return the line number corresponding to a given offset in the expressionintgetState()Get the current tokenizer statevoidincrementLineNumber(int offset)Increment the line number, making a record of where in the input string the newline character occurred.voidlookAhead()Look ahead by one token.voidnext()Get the next token from the input expression.charnextChar()Read next character directly.voidsetState(int state)Set the tokenizer into a special statevoidtokenize(java.lang.String input, int start, int end)Prepare a string for tokenization.voidtreatCurrentAsOperator()Force the current token to be treated as an operator if possiblevoidunreadChar()Step back one character.
 
- 
- 
- 
Field Detail- 
DEFAULT_STATEpublic static final int DEFAULT_STATE Initial default state of the Tokenizer- See Also:
- Constant Field Values
 
 - 
BARE_NAME_STATEpublic static final int BARE_NAME_STATE State in which a name is NOT to be merged with what comes next, for example "("- See Also:
- Constant Field Values
 
 - 
SEQUENCE_TYPE_STATEpublic static final int SEQUENCE_TYPE_STATE State in which the next thing to be read is a SequenceType- See Also:
- Constant Field Values
 
 - 
OPERATOR_STATEpublic static final int OPERATOR_STATE State in which the next thing to be read is an operator- See Also:
- Constant Field Values
 
 - 
currentTokenpublic int currentToken The number identifying the most recently read token
 - 
currentTokenValuepublic java.lang.String currentTokenValue The string value of the most recently read token
 - 
currentTokenStartOffsetpublic int currentTokenStartOffset The position in the input expression where the current token starts
 - 
inputpublic java.lang.String input The string being parsed
 - 
inputOffsetpublic int inputOffset The current position within the input string
 - 
disallowUnionKeywordpublic boolean disallowUnionKeyword Flag to disallow "union" as a synonym for "|" when parsing XSLT 2.0 patterns
 - 
isXQuerypublic boolean isXQuery Flag to indicate that this is XQuery as distinct from XPath
 - 
languageLevelpublic int languageLevel XPath language level: e.g. 2.0, 3.0, or 3.1
 - 
allowSaxonExtensionspublic boolean allowSaxonExtensions Flag to allow Saxon extensions
 
- 
 - 
Method Detail- 
getStatepublic int getState() Get the current tokenizer state- Returns:
- the current state
 
 - 
setStatepublic void setState(int state) Set the tokenizer into a special state- Parameters:
- state- the new state
 
 - 
tokenizepublic void tokenize(java.lang.String input, int start, int end) throws XPathExceptionPrepare a string for tokenization. The actual tokens are obtained by calls on next()- Parameters:
- input- the string to be tokenized
- start- start point within the string
- end- end point within the string (last character not read): -1 means end of string
- Throws:
- XPathException- if a lexical error occurs, e.g. unmatched string quotes
 
 - 
nextpublic void next() throws XPathExceptionGet the next token from the input expression. The type of token is returned in the currentToken variable, the string value of the token in currentTokenValue.- Throws:
- XPathException- if a lexical error is detected
 
 - 
treatCurrentAsOperatorpublic void treatCurrentAsOperator() Force the current token to be treated as an operator if possible
 - 
lookAheadpublic void lookAhead() throws XPathExceptionLook ahead by one token. This method does the real tokenization work. The method is normally called internally, but the XQuery parser also calls it to resume normal tokenization after dealing with pseudo-XML syntax.- Throws:
- XPathException- if a lexical error occurs
 
 - 
nextCharpublic char nextChar() throws java.lang.StringIndexOutOfBoundsExceptionRead next character directly. Used by the XQuery parser when parsing pseudo-XML syntax- Returns:
- the next character from the input
- Throws:
- java.lang.StringIndexOutOfBoundsException- if an attempt is made to read beyond the end of the string. This will only occur in the event of a syntax error in the input.
 
 - 
incrementLineNumberpublic void incrementLineNumber(int offset) Increment the line number, making a record of where in the input string the newline character occurred.- Parameters:
- offset- the place in the input string where the newline occurred
 
 - 
unreadCharpublic void unreadChar() Step back one character. If this steps back to a previous line, adjust the line number.
 - 
getLineNumberpublic int getLineNumber() Get the line number of the current token- Returns:
- the line number. Line numbers reported by the tokenizer start at zero.
 
 - 
getColumnNumberpublic int getColumnNumber() Get the column number of the current token- Returns:
- the column number. Column numbers reported by the tokenizer start at zero.
 
 - 
getLineNumberpublic int getLineNumber(int offset) Return the line number corresponding to a given offset in the expression- Parameters:
- offset- the byte offset in the expression
- Returns:
- the line number. Line and column numbers reported by the tokenizer start at zero.
 
 - 
getColumnNumberpublic int getColumnNumber(int offset) Return the column number corresponding to a given offset in the expression- Parameters:
- offset- the byte offset in the expression
- Returns:
- the column number. Line and column numbers reported by the tokenizer start at zero.
 
 
- 
 
-