| java.lang.Object | |
| ↳ | com.sforce.ws.parser.MXParser |
Absolutely minimal implementation of XMLPULL V1 API
| Constants | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| String | FEATURE_NAMES_INTERNED | ||||||||||
| String | FEATURE_XML_ROUNDTRIP | ||||||||||
| int | LOOKUP_MAX | ||||||||||
| char | LOOKUP_MAX_CHAR | ||||||||||
| String | PROPERTY_LOCATION | ||||||||||
| String | PROPERTY_XMLDECL_CONTENT | ||||||||||
| String | PROPERTY_XMLDECL_STANDALONE | ||||||||||
| String | PROPERTY_XMLDECL_VERSION | ||||||||||
| int | READ_CHUNK_SIZE | ||||||||||
| boolean | TRACE_SIZING | ||||||||||
| String | XMLNS_URI | ||||||||||
| String | XML_URI | ||||||||||
|
[Expand]
Inherited Constants | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
From interface
com.sforce.ws.parser.XmlPullParser
| |||||||||||
| Fields | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| NCODING | |||||||||||
| NO | |||||||||||
| TANDALONE | |||||||||||
| VERSION | |||||||||||
| YES | |||||||||||
| allStringsInterned | Implementation notice: the is instance variable that controls if newString() is interning. | ||||||||||
| attributeCount | |||||||||||
| attributeName | |||||||||||
| attributeNameHash | |||||||||||
| attributePrefix | |||||||||||
| attributeUri | |||||||||||
| attributeValue | |||||||||||
| buf | |||||||||||
| bufAbsoluteStart | |||||||||||
| bufEnd | |||||||||||
| bufLoadFactor | |||||||||||
| bufSoftLimit | |||||||||||
| bufStart | |||||||||||
| charRefOneCharBuf | |||||||||||
| columnNumber | |||||||||||
| depth | |||||||||||
| elName | |||||||||||
| elNamespaceCount | |||||||||||
| elPrefix | |||||||||||
| elRawName | |||||||||||
| elRawNameEnd | |||||||||||
| elRawNameLine | |||||||||||
| elUri | |||||||||||
| emptyElementTag | |||||||||||
| entityEnd | |||||||||||
| entityName | |||||||||||
| entityNameBuf | |||||||||||
| entityNameHash | |||||||||||
| entityRefName | |||||||||||
| entityReplacement | |||||||||||
| entityReplacementBuf | |||||||||||
| eventType | |||||||||||
| inputEncoding | |||||||||||
| lineNumber | |||||||||||
| location | |||||||||||
| lookupNameChar | |||||||||||
| lookupNameStartChar | |||||||||||
| namespaceEnd | |||||||||||
| namespacePrefix | |||||||||||
| namespacePrefixHash | |||||||||||
| namespaceUri | |||||||||||
| pastEndTag | |||||||||||
| pc | |||||||||||
| pcEnd | |||||||||||
| pcStart | |||||||||||
| pos | |||||||||||
| posEnd | |||||||||||
| posStart | |||||||||||
| preventBufferCompaction | |||||||||||
| processNamespaces | |||||||||||
| reachedEnd | |||||||||||
| reader | |||||||||||
| roundtripSupported | |||||||||||
| seenAmpersand | |||||||||||
| seenDocdecl | |||||||||||
| seenEndTag | |||||||||||
| seenMarkup | |||||||||||
| seenRoot | |||||||||||
| seenStartTag | |||||||||||
| text | |||||||||||
| tokenize | |||||||||||
| usePC | |||||||||||
| xmlDeclContent | |||||||||||
| xmlDeclStandalone | |||||||||||
| xmlDeclVersion | |||||||||||
|
[Expand]
Inherited Fields | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
From interface
com.sforce.ws.parser.XmlPullParser
| |||||||||||
| Public Constructors | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Public Methods | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
Set new value for entity replacement text as defined in
XML 1.0 Section 4.5
Construction of Internal Entity Replacement Text.
| |||||||||||
Returns the number of attributes of the current start tag, or
-1 if the current event type is not START_TAG
| |||||||||||
Returns the local name of the specified attribute
if namespaces are enabled or just attribute name if namespaces are disabled.
| |||||||||||
Returns the namespace URI of the attribute
with the given index (starts from 0).
| |||||||||||
Returns the prefix of the specified attribute
Returns null if the element has no prefix.
| |||||||||||
Returns the type of the specified attribute
If parser is non-validating it MUST return CDATA.
| |||||||||||
Returns the attributes value identified by namespace URI and namespace localName.
| |||||||||||
Returns the given attributes value.
| |||||||||||
Returns the current column number, starting from 0.
| |||||||||||
Returns the current depth of the element.
| |||||||||||
Returns the type of the current event (START_TAG, END_TAG, TEXT, etc.)
| |||||||||||
Unknown properties are
| |||||||||||
Returns the input encoding if known, null otherwise.
| |||||||||||
Returns the current line number, starting from 1.
| |||||||||||
For START_TAG or END_TAG events, the (local) name of the current
element is returned when namespaces are enabled.
| |||||||||||
Returns the namespace URI of the current element.
| |||||||||||
Returns the URI corresponding to the given prefix,
depending on current state of the parser.
| |||||||||||
Returns the numbers of elements in the namespace stack for the given
depth.
| |||||||||||
Returns the namespace prefixe for the given position
in the namespace stack.
| |||||||||||
Returns the namespace URI for the given position in the
namespace stack
If the position is out of range, an exception is thrown.
| |||||||||||
Return string describing current position of parsers as
text 'STATE [seen %s...] @line:column'.
| |||||||||||
Returns the prefix of the current element.
| |||||||||||
Look up the value of a property.
| |||||||||||
Returns the text content of the current event as String.
| |||||||||||
Returns the buffer that contains the text of the current event,
as well as the start offset and length relevant for the current
event.
| |||||||||||
Returns if the specified attribute was not in input was declared in XML.
| |||||||||||
Returns true if the current event is START_TAG and the tag
is degenerated
(e.g.
| |||||||||||
Checks whether the current TEXT event contains only whitespace
characters.
| |||||||||||
Get next parsing event - element content wil be coalesced and only one
TEXT event must be returned for whole element content
(comments and processing instructions will be ignored and emtity references
must be expanded or exception mus be thrown if entity reerence can not be exapnded).
| |||||||||||
Call next() and return event if it is START_TAG or END_TAG
otherwise throw an exception.
| |||||||||||
If current event is START_TAG then if next element is TEXT then element content is returned
or if next event is END_TAG then empty string is returned, otherwise exception is thrown.
| |||||||||||
This method works similarly to next() but will expose
additional event types (COMMENT, CDSECT, DOCDECL, ENTITY_REF, PROCESSING_INSTRUCTION, or
IGNORABLE_WHITESPACE) if they are available in input.
| |||||||||||
Test if the current event is of the given type and if the
namespace and name do match.
| |||||||||||
Method setFeature
| |||||||||||
Set the input source for parser to the given reader and
resets the parser.
| |||||||||||
Sets the input stream the parser is going to process.
| |||||||||||
Set the value of a property.
| |||||||||||
Skip sub tree that is currently porser positioned on.
| |||||||||||
| Protected Methods | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
Make sure that in attributes temporary array is enough space.
| |||||||||||
Make sure that we have enough space to keep element stack if passed size.
| |||||||||||
simplistic implementation of hash function that has constant
time to compute - so it also means diminishing hash quality for long strings
but for XML parsing it should be good enough ...
| |||||||||||
|
[Expand]
Inherited Methods | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
From class
java.lang.Object
| |||||||||||
From interface
com.sforce.ws.parser.XmlPullParser
| |||||||||||
Implementation notice: the is instance variable that controls if newString() is interning.
NOTE: newStringIntern always returns interned strings and newString MAY return interned String depending on this variable.
NOTE: by default in this minimal implementation it is false!
Set new value for entity replacement text as defined in XML 1.0 Section 4.5 Construction of Internal Entity Replacement Text. If FEATURE_PROCESS_DOCDECL or FEATURE_VALIDATION are set, calling this function will result in an exception -- when processing of DOCDECL is enabled, there is no need to the entity replacement text manually.
The motivation for this function is to allow very small implementations of XMLPULL that will work in J2ME environments. Though these implementations may not be able to process the document type declaration, they still can work with known DTDs by using this function.
Please notes: The given value is used literally as replacement text and it corresponds to declaring entity in DTD that has all special characters escaped: left angle bracket is replaced with <, ampersnad with & and so on.
Note: The given value is the literal replacement text and must not contain any other entity reference (if it contains any entity reference there will be no further replacement).
Note: The list of pre-defined entity names will always contain standard XML entities such as amp (&), lt (<), gt (>), quot ("), and apos ('). Those cannot be redefined by this method!
| entityName | |
|---|---|
| replacementText |
| XmlPullParserException |
|---|
Returns the number of attributes of the current start tag, or -1 if the current event type is not START_TAG
Returns the local name of the specified attribute if namespaces are enabled or just attribute name if namespaces are disabled. Throws an IndexOutOfBoundsException if the index is out of range or current event type is not START_TAG.
| index |
|---|
Returns the namespace URI of the attribute with the given index (starts from 0). Returns an empty string ("") if namespaces are not enabled or the attribute has no namespace. Throws an IndexOutOfBoundsException if the index is out of range or the current event type is not START_TAG.
NOTE: if FEATURE_REPORT_NAMESPACE_ATTRIBUTES is set then namespace attributes (xmlns:ns='...') must be reported with namespace http://www.w3.org/2000/xmlns/ (visit this URL for description!). The default namespace attribute (xmlns="...") will be reported with empty namespace.
NOTE:The xml prefix is bound as defined in Namespaces in XML specification to "http://www.w3.org/XML/1998/namespace".
| index |
|---|
Returns the prefix of the specified attribute Returns null if the element has no prefix. If namespaces are disabled it will always return null. Throws an IndexOutOfBoundsException if the index is out of range or current event type is not START_TAG.
| index |
|---|
Returns the type of the specified attribute If parser is non-validating it MUST return CDATA.
| index |
|---|
Returns the attributes value identified by namespace URI and namespace localName. If namespaces are disabled namespace must be null. If current event type is not START_TAG then IndexOutOfBoundsException will be thrown.
NOTE: attribute value must be normalized (including entity replacement text if PROCESS_DOCDECL is false) as described in XML 1.0 section 3.3.3 Attribute-Value Normalization
| namespace | Namespace of the attribute if namespaces are enabled otherwise must be null |
|---|---|
| name | If namespaces enabled local name of attribute otherwise just attribute name |
Returns the given attributes value. Throws an IndexOutOfBoundsException if the index is out of range or current event type is not START_TAG.
NOTE: attribute value must be normalized (including entity replacement text if PROCESS_DOCDECL is false) as described in XML 1.0 section 3.3.3 Attribute-Value Normalization
| index |
|---|
Returns the current column number, starting from 0. When the parser does not know the current column number or can not determine it, -1 is returned (e.g. for WBXML).
Returns the current depth of the element. Outside the root element, the depth is 0. The depth is incremented by 1 when a start tag is reached. The depth is decremented AFTER the end tag event was observed.
<!-- outside --> 0
<root> 1
sometext 1
<foobar> 2
</foobar> 2
</root> 1
<!-- outside --> 0
Returns the type of the current event (START_TAG, END_TAG, TEXT, etc.)
| XmlPullParserException |
|---|
Unknown properties are
| name | The name of feature to be retrieved. |
|---|
Returns the input encoding if known, null otherwise. If setInput(InputStream, inputEncoding) was called with an inputEncoding value other than null, this value must be returned from this method. Otherwise, if inputEncoding is null and the parser suppports the encoding detection feature (http://xmlpull.org/v1/doc/features.html#detect-encoding), it must return the detected encoding. If setInput(Reader) was called, null is returned. After first call to next if XML declaration was present this method will return encoding declared.
Returns the current line number, starting from 1. When the parser does not know the current line number or can not determine it, -1 is returned (e.g. for WBXML).
For START_TAG or END_TAG events, the (local) name of the current element is returned when namespaces are enabled. When namespace processing is disabled, the raw name is returned. For ENTITY_REF events, the entity name is returned. If the current event is not START_TAG, END_TAG, or ENTITY_REF, null is returned.
Please note: To reconstruct the raw element name when namespaces are enabled and the prefix is not null, you will need to add the prefix and a colon to localName..
Returns the namespace URI of the current element. The default namespace is represented as empty string. If namespaces are not enabled, an empty String ("") is always returned. The current event must be START_TAG or END_TAG; otherwise, null is returned.
Returns the URI corresponding to the given prefix, depending on current state of the parser.
If the prefix was not declared in the current scope, null is returned. The default namespace is included in the namespace table and is available via getNamespace (null).
This method is a convenience method for
for (int i = getNamespaceCount(getDepth ())-1; i >= 0; i--) {
if (getNamespacePrefix(i).equals( prefix )) {
return getNamespaceUri(i);
}
}
return null;
Please note: parser implementations may provide more efifcient lookup, e.g. using a Hashtable. The 'xml' prefix is bound to "http://www.w3.org/XML/1998/namespace", as defined in the Namespaces in XML specification. Analogous, the 'xmlns' prefix is resolved to http://www.w3.org/2000/xmlns/
| prefix |
|---|
Returns the numbers of elements in the namespace stack for the given depth. If namespaces are not enabled, 0 is returned.
NOTE: when parser is on END_TAG then it is allowed to call this function with getDepth()+1 argument to retrieve position of namespace prefixes and URIs that were declared on corresponding START_TAG.
NOTE: to retrieve lsit of namespaces declared in current element:
XmlPullParser pp = ...
int nsStart = pp.getNamespaceCount(pp.getDepth()-1);
int nsEnd = pp.getNamespaceCount(pp.getDepth());
for (int i = nsStart; i < nsEnd; i++) {
String prefix = pp.getNamespacePrefix(i);
String ns = pp.getNamespaceUri(i);
// ...
}
| depth |
|---|
| XmlPullParserException |
|---|
Returns the namespace prefixe for the given position in the namespace stack. Default namespace declaration (xmlns='...') will have null as prefix. If the given index is out of range, an exception is thrown.
Please note: when the parser is on an END_TAG, namespace prefixes that were declared in the corresponding START_TAG are still accessible although they are no longer in scope.
| pos |
|---|
| XmlPullParserException |
|---|
Returns the namespace URI for the given position in the namespace stack If the position is out of range, an exception is thrown.
NOTE: when parser is on END_TAG then namespace prefixes that were declared in corresponding START_TAG are still accessible even though they are not in scope
| pos |
|---|
| XmlPullParserException |
|---|
Return string describing current position of parsers as text 'STATE [seen %s...] @line:column'.
Returns the prefix of the current element. If the element is in the default namespace (has no prefix), null is returned. If namespaces are not enabled, or the current event is not START_TAG or END_TAG, null is returned.
Look up the value of a property.
The property name is any fully-qualified URI.NOTE: unknown properties are always returned as null.
| name | The name of property to be retrieved. |
|---|
Returns the text content of the current event as String. The value returned depends on current event type, for example for TEXT event it is element content (this is typical case when next() is used).
See description of nextToken() for detailed description of possible returned values for different types of events.NOTE: in case of ENTITY_REF, this method returns the entity replacement text (or null if not available). This is the only case where getText() and getTextCharacters() return different values.
Returns the buffer that contains the text of the current event, as well as the start offset and length relevant for the current event. See getText(), next() and nextToken() for description of possible returned values.
Please note: this buffer must not be modified and its content MAY change after a call to next() or nextToken(). This method will always return the same value as getText(), except for ENTITY_REF. In the case of ENTITY ref, getText() returns the replacement text and this method returns the actual input buffer containing the entity name. If getText() returns null, this method returns null as well and the values returned in the holder array MUST be -1 (both start and length).
| holderForStartAndLength | Must hold an 2-element int array into which the start offset and length values will be written. |
|---|
Returns if the specified attribute was not in input was declared in XML. If parser is non-validating it MUST always return false. This information is part of XML infoset:
| index |
|---|
Returns true if the current event is START_TAG and the tag is degenerated (e.g. <foobar/>).
NOTE: if the parser is not on START_TAG, an exception will be thrown.
| XmlPullParserException |
|---|
Checks whether the current TEXT event contains only whitespace characters. For IGNORABLE_WHITESPACE, this is always true. For TEXT and CDSECT, false is returned when the current event text contains at least one non-white space character. For any other event type an exception is thrown.
Please note: non-validating parsers are not able to distinguish whitespace and ignorable whitespace, except from whitespace outside the root element. Ignorable whitespace is reported as separate event, which is exposed via nextToken only.
| XmlPullParserException |
|---|
Get next parsing event - element content wil be coalesced and only one TEXT event must be returned for whole element content (comments and processing instructions will be ignored and emtity references must be expanded or exception mus be thrown if entity reerence can not be exapnded). If element content is empty (content is "") then no TEXT event will be reported.
NOTE: empty element (such as <tag/>) will be reported with two separate events: START_TAG, END_TAG - it must be so to preserve parsing equivalency of empty element to <tag></tag>. (see isEmptyElementTag ())
| IOException | |
|---|---|
| XmlPullParserException |
Call next() and return event if it is START_TAG or END_TAG otherwise throw an exception. It will skip whitespace TEXT before actual tag if any.
essentially it does this
int eventType = next();
if(eventType == TEXT && isWhitespace()) { // skip whitespace
eventType = next();
}
if (eventType != START_TAG && eventType != END_TAG) {
throw new XmlPullParserException("expected start or end tag", this, null);
}
return eventType;
| IOException | |
|---|---|
| XmlPullParserException |
If current event is START_TAG then if next element is TEXT then element content is returned or if next event is END_TAG then empty string is returned, otherwise exception is thrown. After calling this function successfully parser will be positioned on END_TAG.
The motivation for this function is to allow to parse consistently both empty elements and elements that has non empty content, for example for input:
p.nextTag() p.requireEvent(p.START_TAG, "", "tag"); String content = p.nextText(); p.requireEvent(p.END_TAG, "", "tag");This function together with nextTag make it very easy to parse XML that has no mixed content.
Essentially it does this
if(getEventType() != START_TAG) {
throw new XmlPullParserException(
"parser must be on START_TAG to read next text", this, null);
}
int eventType = next();
if(eventType == TEXT) {
String result = getText();
eventType = next();
if(eventType != END_TAG) {
throw new XmlPullParserException(
"event TEXT it must be immediately followed by END_TAG", this, null);
}
return result;
} else if(eventType == END_TAG) {
return "";
} else {
throw new XmlPullParserException(
"parser must be on START_TAG or TEXT to read text", this, null);
}
| IOException | |
|---|---|
| XmlPullParserException |
This method works similarly to next() but will expose additional event types (COMMENT, CDSECT, DOCDECL, ENTITY_REF, PROCESSING_INSTRUCTION, or IGNORABLE_WHITESPACE) if they are available in input.
If special feature FEATURE_XML_ROUNDTRIP (identified by URI: http://xmlpull.org/v1/doc/features.html#xml-roundtrip) is enabled it is possible to do XML document round trip ie. reproduce exectly on output the XML input using getText(): returned content is always unnormalized (exactly as in input). Otherwise returned content is end-of-line normalized as described XML 1.0 End-of-Line Handling and. Also when this feature is enabled exact content of START_TAG, END_TAG, DOCDECL and PROCESSING_INSTRUCTION is available.
Here is the list of tokens that can be returned from nextToken() and what getText() and getTextCharacters() returns:
" titlepage SYSTEM "http://www.foo.bar/dtds/typo.dtd" [<!ENTITY % active.links "INCLUDE">]"
for input document that contained:
<!DOCTYPE titlepage SYSTEM "http://www.foo.bar/dtds/typo.dtd" [<!ENTITY % active.links "INCLUDE">]>otherwise if FEATURE_XML_ROUNDTRIP is false and PROCESS_DOCDECL is true then what is returned is undefined (it may be even null)
NOTE: there is no gurantee that there will only one TEXT or IGNORABLE_WHITESPACE event from nextToken() as parser may chose to deliver element content in multiple tokens (dividing element content into chunks)
NOTE: whether returned text of token is end-of-line normalized is depending on FEATURE_XML_ROUNDTRIP.
NOTE: XMLDecl (<?xml ...?>) is not reported but its content is available through optional properties (see class description above).
| IOException | |
|---|---|
| XmlPullParserException |
Test if the current event is of the given type and if the namespace and name do match. null will match any namespace and any name. If the test is not passed, an exception is thrown. The exception text indicates the parser position, the expected event and the current event that is not meeting the requirement.
Essentially it does this
if (type != getEventType()
|| (namespace != null && !namespace.equals( getNamespace () ) )
|| (name != null && !name.equals( getName() ) ) )
throw new XmlPullParserException( "expected "+ TYPES[ type ]+getPositionDescription());
| type | |
|---|---|
| namespace | |
| name |
| IOException | |
|---|---|
| XmlPullParserException |
Set the input source for parser to the given reader and resets the parser. The event type is set to the initial value START_DOCUMENT. Setting the reader to null will just stop parsing and reset parser state, allowing the parser to free internal resources such as parsing buffers.
| in |
|---|
| XmlPullParserException |
|---|
Sets the input stream the parser is going to process. This call resets the parser state and sets the event type to the initial value START_DOCUMENT.
NOTE: If an input encoding string is passed, it MUST be used. Otherwise, if inputEncoding is null, the parser SHOULD try to determine input encoding following XML 1.0 specification (see below). If encoding detection is supported then following feature http://xmlpull.org/v1/doc/features.html#detect-encoding MUST be true amd otherwise it must be false
| inputStream | Contains a raw byte input stream of possibly unknown encoding (when inputEncoding is null). |
|---|---|
| inputEncoding | If not null it MUST be used as encoding for inputStream |
| XmlPullParserException |
|---|
Set the value of a property.
The property name is any fully-qualified URI.| name | |
|---|---|
| value |
| XmlPullParserException |
|---|
Skip sub tree that is currently porser positioned on.
NOTE: parser must be on START_TAG and when funtion returns
parser will be positioned on corresponding END_TAG
| IOException | |
|---|---|
| XmlPullParserException |
Make sure that in attributes temporary array is enough space.
| size |
|---|
Make sure that we have enough space to keep element stack if passed size. It will always create one additional slot then current depth
| size |
|---|
| end |
|---|
simplistic implementation of hash function that has constant time to compute - so it also means diminishing hash quality for long strings but for XML parsing it should be good enough ...
| ch | |
|---|---|
| off | |
| len |
| ch |
|---|
| ch |
|---|
| ch |
|---|
| cbuf | |
|---|---|
| off | |
| len |
| cbuf | |
|---|---|
| off | |
| len |
| ch |
|---|
| s |
|---|