XQuery IntelliJ Plugin 1.9 Data and Processing Model

Abstract

This document defines the data and processing model for vendor and plugin specific functionality that extends XQuery and associated W3C extensions. The plugin data model extensions are to fill in gaps within the type system and to provide static type analysis.

This document also documents the internals of how the plugin models and processes XPath and XQuery constructs. These are used to implement IntelliJ integration such as inlay parameters, and static analysis for various inspections.

Introduction
- PSI Tree and Data Model Construction
Basics
- Type System
Type Manipulation
- Upper and Lower Bounds
- Aggregate Types
Data Model
Operation Tree
Accessors
- namespace-attributes Accessor
  - Nodes
  - namespace attributes Property
References
- W3C References
- XPath NG Proposals

1 Introduction

This document defines the data model for vendor and plugin specific functionality that extends XQuery 3.1, XQuery and XPath Full Text 3.0, XQuery Update Facility 3.0, and XQuery Scripting Extension 1.0.

The plugin supports BaseX, eXist-db, MarkLogic, and Saxon vendor extensions. The type system extensions needed to support these extensions is detailed in this document.

The plugin-specific data model extensions are to fill in gaps within the type system and to support static type analysis.

This document also defines the processing model (operation tree) used by the plugin to integrate with the IDE and provide static analysis.

1.1 PSI Tree and Data Model Construction

An XQuery document is parsed according to the combined XQuery, Full Text, Updating, Scripting, and Vendor syntax described in the XQuery IntelliJ Plugin 1.6 XQuery document. The XQuery parser extends the XPath 2.0 parser to avoid duplicating code between the two parsers.

An XPath 2.0 and later expression, pattern, and sequence-type are parsed according to the combined XPath, Full Text, and Vendor syntax described in the XQuery IntelliJ Plugin 1.6 XPath document. The pattern syntax is validated using the conformance checking intention used to check vendor specific, extension, or later version syntax constructs.¹

An XPath 1.0 expression, pattern, and sequence-type is parsed according to the XPath 1.0 as 2.0 EBNF Grammar document.¹ The extra-grammatical constraints and pattern syntax are validated using the conformance checking intention used to check vendor specific, extension, or later version syntax constructs.¹

The XPath expression or XQuery file is parsed into an AST tree called a PSI tree by IntelliJ. These nodes are modelled on the XPath and XQuery symbols with a concrete class and corresponding interface. The XQuery symbols that are also XPath symbols are modelled as XPath PSI elements to avoid duplicating logic between the XPath and XQuery PSI trees.

Symbols that are just a list of possible types, such as ExprSingle are only defined as an interface, to avoid adding unnecessary nodes to the PSI tree. If a symbol is only forwarding to another symbol, such as when a RangeExpr does not have a to part, the corresponding PSI element is omitted from the PSI tree to further simplify the model in memory.

Example

The expression 5 instance of xs:string? has the following PSI tree from the XPath parser:
XPath
    InstanceofExpr
       IntegerLiteral       "5"
       XmlNCName            "instance"
       XmlNCName            "of"
       SequenceType
          AtomicOrUnionType
             QName
                XmlNCName   "xs"
                Token       ":"
                XmlNCName   "string"
          Token             "?"
A similar PSI tree is built for XQuery expressions. If the occurrence indicator is missing in the above expression, the SequenceType node is not included, only the AtomicOrUnionType node.

The PSI tree elements implement the model described in this document. These symbols provide the statically known information for that given context. This is used for static analysis and IDE integration such as symbol navigation and code completion.

The atomic type values determined statically are not checked to see if they conform to the given type aside from that specified by the EBNF grammar. The validation and normalisation is deferred to later processing and validation checks after the PSI tree has been constructed. This is to permit partially typed or incorrect values without throwing invalid/malformed type exceptions.

This functionality is not currently supported in the XQuery IntelliJ Plugin. This is a specification for how that functionality is intended to be implemented.

2 Basics

This document uses the following namespace prefixes to represent the namespace URIs with which they are listed. Although these prefixes are used within this specification to refer to the corresponding namespaces, not all of these bindings will necessarily be present in the static context of every expression, and authors are free to use different prefixes for these namespaces, or to bind these prefixes to different namespaces.

xs = http://www.w3.org/2001/XMLSchema
fn = http://www.w3.org/2005/xpath-functions

In addition to the prefixes in the above list, this document uses the following namespace prefixes to represent the namespace URIs with which they are listed. These namespace prefixes are not predeclared and their use in this document is not normative.

err = http://www.w3.org/2005/xqt-errors
error = http://marklogic.com/xdmp/error
xdm = http://reecedunn.co.uk/xquery-datamodel

2.1 Type System

The names in square brackets in the type diagrams are the Java/Kotlin interfaces in the uk.co.reecedunn.intellij.plugin.xdm.model package of lang-xdm that are used to model the specified type.

2.1.1 Part 1: Items

item() [XdmItem]
├─── node() [XdmNode]
│    ├─── attribute() [XdmAttributeNode]
│    │    └─── user-defined attribute types
│    ├─── document-node() [XdmDocumentNode]
│    │    └─── document types with more precise content type
│    ├─── element() [XdmElementNode]
│    │    └─── user-defined element types
│    ├─── comment() [XdmCommentNode]
│    ├─── namespace-node() [XdmNamespaceNode]
│    ├─── processing-instruction() [XdmProcessingInstructionNode]
│    ├─── text() [XdmTextNode]
│    ├─── array-node() [XdmArrayNode]
│    ├─── boolean-node() [XdmBooleanNode]
│    ├─── null-node() [XdmNullNode]
│    ├─── number-node() [XdmNumberNode]
│    └─── object-node() [XdmObjectNode]
├─── attribute-decl() [XdmAttributeDecl]
├─── binary() [XdmBinary]
├─── complex-type() [XdmComplexType]
├─── element-decl() [XdmElementDecl]
├─── model-group() [XdmModelGroup]
├─── schema-component() [XdmSchemaComponent]
├─── schema-facet() [XdmSchemaFacet]
├─── schema-particle() [XdmSchemaParticle]
├─── schema-root() [XdmSchemaRoot]
├─── schema-type() [XdmSchemaType]
├─── schema-wildcard() [XdmSchemaWildcard]
├─── simple-type() [XdmSimpleType]
├─── function(*) [XdmFunction]
│    ├─── map(*) [XdmMap]
│    └─── array(*) [XdmArray]
├─── annotation(*) [XdmAnnotation]
└─── xs:anyAtomicType [XsAnyAtomicType] ────────────────── See Part 3

The array-node(), boolean-node(), null-node(), number-node(), and object-node() types are MarkLogic JSON types.

The attribute-decl(), complex-type(), element-decl(), model-group(), schema-component(), schema-facet(), schema-particle(), schema-root(), schema-type(), schema-wildcard(), and simple-type() types are MarkLogic schema types.

The binary() type is a MarkLogic item type.

The annotation(*) type is from the XPath NG annotation sequence types proposal.

2.1.2 Part 2: Simple and Complex Types

xs:anyType [XsAnyType]
├─── xdm:anyComplexType [XdmAnyComplexType]
│    ├─── xs:untyped
│    └─── user-defined complex types
└─── xs:anySimpleType [XsAnySimpleType]
     ├─── xs:anyAtomicType [XsAnyAtomicType] ───────────── See Part 3
     ├─── xdm:anyListType [XdmAnyListType]
     │    ├─── xs:IDREFS
     │    ├─── xs:NMTOKENS
     │    ├─── xs:ENTITIES
     │    └─── user-defined list types
     └─── xdm:anyUnionType [XdmAnyUnionType]
          ├─── xs:numeric
          ├─── xs:error¹
          └─── user-defined union types

xs:error is defined in XML Schema 1.1 Part 2, and in the Types section of the XPath 3.1 and XQuery 3.1 specifications, but not in XQuery and XPath 3.1 Data Model. Support for this type is dependent on whether the implementation supports XML Schema 1.1.

The data model defines three additional simple and complex types: xdm:anyComplexType, xdm:anyListType, and xdm:anyUnionType. These types are defined in an XQuery IntelliJ Plugin specific namespace.

xdm:anyComplexType

The datatype xdm:anyComplexType is a complex type that includes all complex types (and no values that are not complex). Its base type is xs:anyType from which all schema types, including simple, and complex types are derived.

xdm:anyListType

The datatype xdm:anyListType is a list type that includes all list types (and no values that are not lists). Its base type is xs:anySimpleType from which all simple types, including atomic, list, and union types are derived.

xdm:anyUnionType

The datatype xdm:anyUnionType is a union type that includes all union types (and no values that are not unions). Its base type is xs:anySimpleType from which all simple types, including atomic, list, and union types are derived.

2.1.3 Part 3: Atomic Types

xs:anyAtomicType¹ [XsAnyAtomicType]
├─── xs:anyURI [XsAnyUriValue]
├─── xs:base64Binary
├─── xs:boolean
├─── xs:date
├─── xs:dateTime
│    └─── xs:dateTimeStamp²
├─── xs:decimal [XsDecimalValue³]
│    └─── xs:integer [XsIntegerValue³]
│         ├─── xs:long
│         │    └─── xs:int
│         │         └─── xs:short
│         │              └─── xs:byte
│         ├─── xs:nonNegativeInteger
│         │    ├─── xs:positiveInteger
│         │    └─── xs:unsignedLong
│         │         └─── xs:unsignedInt
│         │              └─── xs:unsignedShort
│         │                   └─── xs:unsignedByte
│         └─── xs:nonPositiveInteger
│              └─── xs:negativeInteger
├─── xs:double [XsDoubleValue]
├─── xs:duration [XsDurationValue]
│    ├─── xs:dayTimeDuration¹
│    └─── xs:yearMonthDuration¹
├─── xs:float
├─── xs:gDay
├─── xs:gMonth
├─── xs:gMonthDay
├─── xs:gYear
├─── xs:gYearMonth
├─── xs:hexBinary
├─── xs:NOTATION
├─── xs:QName [XsQNameValue]
├─── xs:string [XsStringValue]
│    └─── xs:normalizedString [XsNormalizedValue]
│         ├─── xs:token [XsTokenValue]
│         │    ├─── xs:language
│         │    └─── xs:Name [XsNameValue]
│         │         └─── xs:NCName [XsNCNameValue]
│         │              ├─── xs:ENTITY
│         │              ├─── xs:ID [XsIDValue]
│         │              ├─── xs:IDREF
│         │              └─── xdm:wildcard [XdmWildcardValue]
│         └─── xs:NMTOKEN
├─── xs:time
└─── xs:untypedAtomic [XsUntypedAtomicValue]

xs:anyAtomicType, xs:yearMonthDuration, and xs:dayTimeDuration are defined in XML Schema 1.1 Part 2, and in XQuery and XPath 3.1 Data Model. Support for these types is available on any conforming implementation.
xs:dateTimeStamp is defined in XML Schema 1.1 Part 2, but not in XQuery and XPath 3.1 Data Model. Support for this type is dependent on whether the implementation supports XML Schema 1.1.
In the interface hierarchy, XsIntegerValue is a base class of XsAnyAtomicType, not XsDecimalValue. This is because XsIntegerValue and XsDecimalValue are modelled as having different data value types, so are incompatible in the API.

The data model defines one additional atomic type: xdm:wildcard. This type is defined in an XQuery IntelliJ Plugin specific namespace.

xdm:wildcard

The type xdm:wildcard is derived from xs:NCName. The lexical representation of xdm:wildcard is *. The value space of xdm:wildcard is the empty set.

The unspecified prefix or local name of a Wildcard is an instance of xdm:wildcard.

2.1.4 Part 4: Sequences

[Definition: The lower bound of a sequence type specifies the minimum number of values the sequence can contain.] The value space is restricted to null, 0 and 1.

[Definition: The upper bound of a sequence type specifies the maximum number of values the sequence can contain.] The value space is either null, a non-negative integer, or infinity. Here, infinity signifies many items, an unbounded number of items and is mapped to the maximum integer value.

[Definition: The cardinality of a sequence is the lower bound and upper bound pair.] This constrains the number of items in the sequence.

[Definition: The item type of a sequence type is the single item type associated with the values in the sequence.]

The lower bound, upper bound, and item type values are mapped as follows:

Type	lower bound	upper bound	item type	Description
`xs:error`	`null`	`null`	`xs:error`	An XSD error item.
`xs:error+`	`null`	`null`	`xs:error`	An XSD error sequence.
`xs:error?`	`0`	`0`	`null`	An optional XSD error item.
`xs:error*`	`0`	`0`	`null`	An optional XSD error sequence.
`()`	`0`	`0`	`null`	An empty sequence.
`T?`	`0`	`1`	`T`	An optional item.
`T*`	`0`	`infinity`	`T`	An optional sequence.
list type	`0`	`infinity`	atomic type	An XMLSchema list type.
`T`	`1`	`1`	`T`	A single item.
`T+`	`1`	`infinity`	`T`	A sequence.
`(T1, T2, ..., Tn)?`	`0`	`n`	item type union	An optional restricted sequence.
`(T1, T2, ..., Tn)`	`1`	`n`	item type union	A restricted sequence.

Note:

The list types are only valid in cast as expressions, it cannot be used as the type of a variable. Specifying it here is useful for specifying the static type of the cast expression on a list type, and for providing a replacement suggestion in the IDE.

2.1.5 Sequence Types

XdmSequenceType
├─── XdmItemType
├─── XdmSequenceTypeList
└─── XdmSequenceTypeUnion

An XdmSequenceType is any of the SequenceType symbols. It provides access to the lower bound, upper bound, and item type values of that sequence type, as well as a type name used for presenting the type to the user.

An XdmItemType is any of the ItemType symbols that represent a sequence of size one whose item type is itself. An item type has a type class property that is the Java Class of the item. This is one of the interfaces defined in parts 1 to 3 of this section.

An XdmSequenceTypeList is an instance of the SequenceTypeList symbol used in a TypedFunctionTest. It is a comma-separated list of sequence types that define the type of each item in an ArgumentList.

An XdmSequenceTypeUnion is an instance of the SequenceTypeUnion symbol used in a typeswitch expression or parenthesized sequence type. It is a |-separated list of sequence types that define the possible types of a sequence type such as the operand expression in a TypeswitchExpr.

Each PSI element in the SequenceType EBNF implements one of these interfaces, defining the properties according to the specific sequence type. This is limited to the information available at parse time, so precise values for the lower and upper bounds of XMLSchema types are not defined.¹ It should be possible to calculate the precise bounds as needed on the resolved types.

This is primarily for performance reasons. Identifying the precise XMLSchema type requires expanding the namespace of the AtomicOrUnionType, locating the bound namespace, locating the type within the schema files, and then calculating the bounds from that type. If any of the context above the schema type, such as the schema namespace, changes then the type and everything it depends on need to be recalculated.

3 Type Manipulation

3.1 Upper and Lower Bounds

3.1.1 Minimum of two bounds

The minimum of two bounds is determined by taking the minimum numerical value of each bound. This gives the following results:

	`0`	`1`	`m`	`infinity`
`0`	`0`	`0`	`0`	`0`
`1`	`0`	`1`	`1`	`1`
`n`	`0`	`1`	`min(m,n)`	`n`
`infinity`	`0`	`1`	`m`	`infinity`

Note:

This does not currently handle the rules for xs:error (null bound).

3.1.2 Maximum of two bounds

The maximum of two bounds is determined by taking the maximum numerical value of each bound. This gives the following results:

	`0`	`1`	`m`	`infinity`
`0`	`0`	`1`	`m`	`infinity`
`1`	`1`	`1`	`m`	`infinity`
`n`	`n`	`n`	`max(m,n)`	`infinity`
`infinity`	`infinity`	`infinity`	`infinity`	`infinity`

Note:

This does not currently handle the rules for xs:error (null bound).

3.1.3 Sum of two bounds

The sum of two bounds Ab and Bb is computed using the following rules:

If Ab is infinity, the sum is infinity.
If Bb is infinity, the sum is infinity.
If the sum of Ab and Bb overflows (is greater than the maximum representable integer), the sum is infinity.
Otherwise, the sum is Ab + Bb.

Note:

This does not currently handle the rules for xs:error (null bound).

3.2 Aggregate Types

[Definition: The aggregate type of an expression is the type that best matches the type of each part of that expression, such that the expression could be assigned to a variable set to that aggregate type and not raise a type error.]

[Definition: A disjoint expression is an expression that consists of different values depending on the evaluation of conditions, such as in if, typeswitch, or switch expressions.] The aggregate type of a disjoint expression is the union of the type of each conditional value in that disjoint expression.

Example

The expression if ($x instance of xs:string) then 2 else () has the types xs:integer and empty-sequence() in the then and else clauses that form the conditional values of the if expression. This expression has an aggregate type of xs:integer? that is the union of the conditional types.

[Definition: A sequence expression is an expression that consists of a list of expressions that computes the values in the resulting sequence.] The aggregate type of a sequence expression is the addition of the type of each expression used to construct the sequence.

Example

The expression (2, (), "test" cast as xs:NCName) has the types xs:integer, empty-sequence(), and xs:NCName for each item in the sequence, and an aggregate type of union(xs:integer, xs:NCName)+ that is the combination of the types of the items in the sequence.

3.2.1 Item Type Union

The item type union is the union of the ItemType Ai and the ItemType Bi. It is determined as follows:

If subtype-itemtype(Ai, Bi), then the union is Bi.
If subtype-itemtype(Bi, Ai), then the union is Ai.
If Ai and Bi are union types, then the union is a union type with the member types of Ai and the member types of Bi as its member types.
If Ai is a union type, and Bi is a simple type, then the union is a union type with the member types of Ai and the simple type Bi as its member types.
If Ai is a simple type, and Bi is a union type, then the union is a union type with the simple type Ai and the member types of Bi as its member types.
If Ai and Bi are simple types, then the union is a union type with the simple types Ai and Bi as its member types.
If Ai and Bi are KindTest types, then the union is node().
Otherwise, the union is the item() type.

3.2.2 Sequence Type Union

When computing the union of BT with TT, the resulting sequence type is computed as follows:

The lower bound is the minimum of the lower bound for BT and TT.
The upper bound is the maximum of the upper bound for BT and TT.
The item type is determined as follows:
1. If BT is the empty sequence, then item type is the item type for TT.
2. If TT is the empty sequence, then item type is the item type for BT.
3. The item type is the item type union of the item type for BT and TT.

The union is then:

BT if the computed lower bound, upper bound, and item type are the same as those for BT.
TT if the computed lower bound, upper bound, and item type are the same as those for TT.
A new sequence type using the computed lower bound, upper bound, and item type values. See the mapping table in the type system part 4: sequences section.

3.3.3 Sequence Type Addition

When computing the addition of BT with TT, the resulting sequence type is computed as follows:

The lower bound is the maximum of the lower bound for BT and TT.
The upper bound is the sum of the upper bound for BT and TT.
The item type is determined as follows:
1. If BT is the empty sequence, then item type is the item type for TT.
2. If TT is the empty sequence, then item type is the item type for BT.
3. The item type is the item type union of the item type for BT and TT.

The addition is then:

BT if the computed lower bound, upper bound, and item type are the same as those for BT.
TT if the computed lower bound, upper bound, and item type are the same as those for TT.
A new sequence type using the computed lower bound, upper bound, and item type values. See the mapping table in the type system part 4: sequences section.

4 Data Model

4.1 Literals

Symbol	Type	Interface	Representation
`DecimalLiteral`	`xs:decimal`	`XsDecimalValue`	`BigDecimal`
`DoubleLiteral`	`xs:double`	`XsDoubleValue`	`Double`
`IntegerLiteral`	`xs:integer`	`XsIntegerValue`	`BigInteger`
`StringLiteral`	`xs:string`	`XsStringValue`	`String`
`URILiteral`	`xs:anyURI`	`XsAnyUriValue`	`String`

The PSI elements for the literal symbols implement the interface corresponding to their associated atomic type defined above. These have a data property that is the literal’s value as represented by the given Java type.

The xs:anyURI representation is String as the content is not validated at the point the PSI tree is constructed. This is to permit partially typed URIs, or incorrectly typed URIs, to be represented correctly without throwing malformed URI exceptions.

4.2 EQNames and Wildcards

Symbol	Type	Interface	Representation
`NCName`^Names	`xs:NCName`	`XsNCNameValue`	`String`
`BracedURILiteral`	`xs:anyURI`	`XsAnyUriValue`	`String`
`WildcardIndicator`	`xdm:wildcard`	`XdmWildcardValue`	`String`

The parts that make up an EQName implement the interface corresponding to their associated atomic type defined above. These have a data property that is the atomic type’s value as represented by the given Java type.

The XsQNameValue interface has the following properties:

namespace;
prefix;
local name;
is lexical qname.

Symbol	namespace	prefix	local name	is lexical qname
`QName`	no	yes	yes	true
`NCName`	no	no	yes	true
`URIQualifiedName`	yes	no	yes	false

The Wildcard symbol is also an XsQNameValue, with the properties mirroring the NCName, QName, or URIQualifiedName. The prefix or local parts can be an instance of xdm:wildcard to indicate that any value matches.

4.2.1 Accepts Namespace Types

An EQName accepts prefixed namespace declarations if it is a QName. It matches if the namespace prefix of the declaration is the same as the prefix of the QName.

An EQName accepts default element\/type namespace declarations if it is an NCName that is either an element or type.

An EQName accepts default element namespace declarations if it is an NCName that is an element.

An EQName accepts default type namespace declarations if it is an NCName that is a type.

An EQName accepts default function declaration namespace declarations if it is an NCName that is a function declaration.

An EQName accepts default function reference namespace declarations if it is an NCName that is a function reference.

When evaluating the expanded QName (such as when resolving function calls), the namespace uri of any matching namespace declaration is added to the expanded QName’s namespace.

4.3 Annotations

Symbol	Interface
`CompatibilityAnnotation`	`XdmAnnotation`
`Annotation`	`XdmAnnotation`

The name of the annotation is the unexpanded xs:QName annotation name.

The values of the annotation is the list of xs:string, xs:integer, xs:decimal, and xs:decimal arguments passed to the annotation. For CompatibilityAnnotation this is always empty.

5 Operation Tree

5.1 Expressions

An expression is any EBNF symbol that is documented in the Expressions section of the XPath and XQuery specifications, and the New Kinds of Expressions section of the Update Facility and Scripting extensions.

The XpmExpression interface is used to denote an expression in the AST.

The expression element property identifies the element (token or AST node) that is used to locate this expression. This property is used by the plugin to correctly map to MarkLogic expression breakpoints.

Note:

When constructing the PSI tree for ParenthesizedExpr, if the Expr is missing (()) then an EmptyExpr node is used, otherwise the node is not included in the PSI tree. This way, (2) and 2 have the same tree hierarchy.

Note:

When constructing the PSI tree for Expr, if the Expr is a single ExprSingle node then the Expr is not included in the PSI tree.

5.1.1 Concatenating Expressions

Symbol	Interface
`BlockBody`	`XpmConcatenatingExpression`
`ConcatExpr`	`XpmConcatenatingExpression`
`Expr`	`XpmConcatenatingExpression`
`QueryBody`	`XpmConcatenatingExpression`
`WithExpr`	`XpmConcatenatingExpression`
`XPath`	`XpmConcatenatingExpression`

A concatenating expression is an expression that constructs a sequence by concatenating the results of the expressions contained within it.

The expressions property of a concatenating expression is the list of expressions that are to be concatenated into the resulting sequence.

Note:

In XQuery, a Scripting Extensions ConcatExpr node is created instead of an Expr node for a miltiple ExprSingle expression.

5.1.2 Try/Catch Expressions

Symbol	Interface
`CatchClause`	`XpmCatchClause`
`TryCatchExpr`	`XpmTryCatchExpression`

The try expression of a try/catch expression is the expression to be evaluated.

The catch clauses of a try/catch expression are the expressions that may be evaluated if the try expression resulted in an error.

The error list of a catch clause is the list of error names to check against the raised error.

The catch expression of a catch clause is the expression to be evaluated if a raised error matches the specified error list.

Note:

If a catch clause is a MarkLogic catch clause then there are no error names listed. The error object is bound to the catch variable as defined by the variable binding for the catch clause, and the catch clause automatically matches a raised error.

5.1.3 Context Item Expressions

Symbol	Interface
`ContextItemExpr`	`XpmContextItemExpression`

5.1.4 Lookup Expressions

Symbol	Interface
`PostfixLookup`	`XpmLookupExpression`
`UnaryLookup`	`XpmLookupExpression`

The context expression is the expression used to locate the map or array object to look up the specified key.

The key expression is the expression used to determine the keys to look up in the specified map or array context object.

5.1.5 Sequence Type Expressions

Symbol	Interface
`CastExpr`	`XpmSequenceTypeExpression`
`CastableExpr`	`XpmSequenceTypeExpression`
`InstanceofExpr`	`XpmSequenceTypeExpression`
`TreatExpr`	`XpmSequenceTypeExpression`

The operation property is the action that is performed on the sequence associated with this expression. This is one of: cast as, castable as, instance of, or treat as.

The expression property is the expression used to calculate the sequence that this operation is applied to.

The type property is the sequence type or item type the sequence is to be checked against, cast to, or treated as.

5.1.6 FLWOR Expressions

Symbol	Interface
`ForExpr`	`XpmFlworExpression`
`LetExpr`	`XpmFlworExpression`

The clauses property is the list of clauses associated with the expression. All the clauses are derived from XpmFlworClause.

The return expression property is the expression used to evaluate on the items matching the clauses.

5.1.6.1 For Clauses

Symbol	Interface
`SimpleForBinding`	`XpmForBinding`
`SimpleForClause`	`XpmForClause`

The bindings property of a for clause is the list of for bindings associated with the clause.

A for binding is a collection binding.

The binding collection type property of a for binding is sequence item for sequence-style for bindings, and array member for member-style for bindings.

5.1.6.2 Let Clauses

Symbol	Interface
`SimpleLetBinding`	`XpmLetBinding`
`SimpleLetClause`	`XpmLetClause`

The bindings property of a let clause is the list of let bindings associated with the clause.

A let binding is an assignable variable.

5.2 Path Steps

Symbol	Interface
`AxisStep`	`XpmPathStep`
`FilterStep`	`XpmPathStep`
`ForwardStep`	`XpmPathStep`
`AbbrevForwardStep`	`XpmPathStep`
`ReverseStep`	`XpmPathStep`
`AbbrevReverseStep`	`XpmPathStep`
`NodeTest`	`XpmPathStep`
`NameTest`	`XpmPathStep`
`AbbrevDescendantOrSelfStep`	`XpmPathStep`
`PostfixExpr`	`XpmPathStep`
`FilterExpr`	`XpmPathStep`
`DynamicFunctionCall`	`XpmPathStep`
`PostfixLookup`	`XpmPathStep`

The axis type property is the forward or reverse axis associated with the step in its unabbreviated form.

The node name property is the EQName associated with the NameTest if one is present.

The node type property is the KindTest associated with the given step. If the NodeTest is a KindTest, then the node type is that KindTest. Otherwise, it is the principal node kind associated with the axis type.

The predicate expression property is the expression associated with the Predicate node of the current step.

A PostfixExpr is treated as a path step as it may occur anywhere in a path expression. It is not added to the PSI tree if the PrimaryExpr is not preceded or followed by another step. The axis type is self and the node type is node().

5.2.1 Abbreviated Syntax

The abbreviated syntax permits the following additional abbreviations:

Per item 4 of section 3.3.5 and paragraph 4 of section 3.3 of the XQuery specification, the AbbrevDescendantOrSelfStep symbol is equivalent to /descendant-or-self::node()/.
A / or // at the beginning of a path expression is an abbreviation for the following root step before the / or //:
1. (fn:root(self::node()) treat as document-node()) for standard XQuery expressions;
2. fn:collection() for MarkLogic XQuery expressions evaluated against a MarkLogic database.

5.3 Namespace Declarations

Symbol	Interface	Accepts EQName Type
`DefaultNamespaceDecl`	`XpmNamespaceDeclaration`	see below
`DirNamespaceAttribute`	`XpmNamespaceDeclaration`	see below
`ModuleDecl`	`XpmNamespaceDeclaration`	default function declaration/reference
`ModuleImport`	`XpmNamespaceDeclaration`	prefixed
`NamespaceDecl`	`XpmNamespaceDeclaration`	prefixed
`NamespaceDeclaration`	`XpmNamespaceDeclaration`	see below
`SchemaImport`	`XpmNamespaceDeclaration`	see below
`UsingDecl`	`XpmNamespaceDeclaration`	default function reference

A namespace declaration is an EBNF symbol that adds a namespace to the statically-known namespaces for the scope that it is contained in.

The namespace prefix of a namespace declaration is the xs:NCName used to resolve the prefix part of a QName. If the declaration is a default namespace declaration, then this is null.

The namespace uri of a namespace declaration is the xs:anyURI that the namespace resolves to. When a QName or NCName matches the declaration, the namespace of the expanded QName binds to this value.

If a DefaultNamespaceDecl is a default element declaration then it accepts default element\/type EQNames. If it is a default function declaration then it accepts default function declaration or default function reference EQNames.

If the prefix of a DirNamespaceAttribute is xmlns then it accepts prefixed EQNames. If the local name is xmlns without a prefix then it accepts default element\/type EQNames. Otherwise, it does not accept any EQNames.

If the prefix of a NamespaceDeclaration is xmlns then it accepts prefixed EQNames. If the local name is xmlns without a prefix then it accepts default element EQNames. Otherwise, it does not accept any EQNames.

If a SchemaImport contains a SchemaPrefix then it accepts prefixed EQNames. Otherwise, it accepts default element\/type EQNames.

5.4 Annotated PSI Nodes

Symbol	Interface
`FunctionDecl`	`XpmAnnotated`
`FunctionTest`	`XpmAnnotated`
`InlineFunctionExpr`	`XpmAnnotated`
`ItemTypeDecl`	`XpmAnnotated`
`VarDecl`	`XpmAnnotated`

An annotated PSI node is a declaration, expression, or item type that can have annotations associated with it.

The annotations property lists the user-specified annotations on the PSI node.

5.5 Variables

A variable definition is a construct that introduces a variable that can be referenced in the scope the variable is valid for. The following properties are defined for variable definitions:

The variable name property is the xs:QName associated with that definition.

5.5.1 Variable References

Symbol	Interface
`VarRef`	`XpmVariableReference`
`AssignmentExpr`	`XpmVariableReference`

A variable reference is an expression that references a variable definition.

The variable name property is the xs:QName associated with that reference, used to locate the variable definition in the in-scope variables static context.

5.5.2 Variable Declarations

Symbol	Interface
`BlockVarDecl`	`XpmVariableDeclaration`
`VarDecl`	`XpmVariableDeclaration`

A variable declaration is a variable definition that specifies a variable in the prolog or scripting block scope. The following properties are defined for variable declarations:

The is external property is true if the variable is an external variable.
The variable type property is the SequenceType associated with the declaration, if specified.
The variable expression property is the expression that is used to evaluate the value of this variable. For external variables this is only used when a value is not provided from the caller of the module.

5.5.3 Variable Bindings

Symbol	Interface
`CatchClause`	`XpmVariableBinding`
`CountClause`	`XpmVariableBinding`
`CurrentItem`	`XpmVariableBinding`
`FTScoreVar`	`XpmVariableBinding`
`NextItem`	`XpmVariableBinding`
`PositionalVar`	`XpmVariableBinding`
`PreviousItem`	`XpmVariableBinding`

A variable binding is a variable in an expression that is bound to the result of an expression or other context (such as the position of an item in a FLWOR expression) for the scope of the expression.

5.5.3.1 Parameters

Symbol	Interface
`Param`	`XpmParameter`

A parameter is a variable binding that defines a parameter of a function. The following properties are defined for parameters:

The variable type property is the SequenceType associated with the parameter, if specified.
The default expression property is for optional parameters. This is the value of the parameter if no expression is provided for the parameter at the point at which the function is called.

5.5.3.2 Assignable Variables

Symbol	Interface
`CaseClause`	`XpmAssignableVariable`
`CopyModifyExprBinding`	`XpmAssignableVariable`
`DefaultCaseClause`	`XpmAssignableVariable`
`GroupingSpec`	`XpmAssignableVariable`
`LetBinding`	`XpmAssignableVariable`
`SimpleLetBinding`	`XpmAssignableVariable`

An assignable variable is a variable binding that defines a variable that can have a value specified by the user. The following properties are defined for assignable variables:

The variable type property is the SequenceType associated with the variable, if specified.
The variable expression property is the expression that is used to specify the value of the variable.

5.5.3.3 Collection Bindings

Symbol	Interface
`ForBinding`	`XpmCollectionBinding`
`ForMemberBinding`	`XpmCollectionBinding`
`QuantifierBinding`	`XpmCollectionBinding`
`SimpleForBinding`	`XpmCollectionBinding`
`SlidingWindowClause`	`XpmCollectionBinding`
`TumblingWindowClause`	`XpmCollectionBinding`

A collection binding is an variable binding that is bound to the values in a sequence or array.

The variable type property is the SequenceType associated with the variable, if specified.
The binding expression property is the expression that is used to calculate the binding collection for the symbol.

5.6 Functions

Symbol	Interface
`ArgumentPlaceholder`	`XpmExpression`
`ArrowDynamicFunctionCall`	`XpmArrowFunctionCall`
`ArrowFunctionCall`	`XpmArrowFunctionCall`, `XpmFunctionReference`
`DynamicFunctionCall`	`XpmFunctionCall`
`FunctionCall`	`XpmFunctionCall`, `XpmFunctionReference`
`FunctionDecl`	`XpmFunctionDeclaration`
`InlineFunctionExpr`	`XpmFunctionDeclaration`
`NamedFunctionRef`	`XpmFunctionReference`

A function declaration is a declaration or expression that introduces a named or anonymous function.

A function reference is an expression that specifies the name and arity of a function.

A function call is an expression that evaluates to a function reference. It supplies positional and keyword arguments that are used to evaluate the function. If any of the positional arguments are placeholders, this is a partially applied function call.

An arrow function call is a function call that is used in an arrow expression:

The source expression property references the initial unary expression if this is the first arrow target in the chain; otherwise, it references the previous arrow target.
The operation property is chaining for FatArrowTarget (=>), or mapping for ThinArrowTarget (->).

A static function call is a function call that has the name and arity of the function it references.

A dynamic function call is a function call that has an expression that evaluates to the function to be called.

5.7 Constructable Items

A constructable item is an expression that evaluates to an ItemType.

The item type class of a constructable item is the class instance of the associated type, such as XdmArray for an array(*) item.

The item expression is the associated expression that is used to construct the item of the specified type.

5.7.1 Maps

Symbol	Interface
`KeywordArgument`	`XpmMapEntry`
`MapConstructor`	`XpmMapExpression`
`MapConstructorEntry`	`XpmMapEntry`

A map expression is a constructable item that is used to create map(*) or object-node() items.

The entries of a map expression are the map entries that constitute the constructed map.

The key expression of a map entry is the expression used to calculate the key name. For keyword arguments this is the xs:NCName that specifies the parameter or map key name.

The value expression of a map entry is the expression used to calculate the entry value.

5.7.2 Arrays

Symbol	Interface
`SquareArrayConstructor`	`XpmArrayExpression`

An array expression is a constructable item that is used to create array(*) or array-node() items.

The member expressions of an array expression are the expressions that are used to create the members of the constructed array.

5.7.3 Node Constructors

A node constructor is a constructable item that is used to create node() items that correspond to XML nodes in the XQuery data model.

The node kind corresponds to the dm:node-kind accessor.

5.7.3.1 Document Nodes

Symbol	Interface
`CompDocConstructor`	`XdmDocumentNode`

A document node expression is a constructable item that is used to create document-node() node items.

5.7.3.2 Element Nodes

Symbol	Interface
`CompElemConstructor`	`XdmElementNode`
`DirElemConstructor`	`XdmElementNode`

An element node expression is a constructable item that is used to create element() node items.

The attributes property corresponds to the dm:attributes accessor.

The node name property corresponds to the dm:node-name accessor.

The namespace attributes property corresponds to the dm:namespace-attributes accessor.

5.7.3.3 Attribute Nodes

Symbol	Interface
`CompAttrConstructor`	`XdmAttributeNode`
`DirAttribute`	`XdmAttributeNode`

An attribute node expression is a constructable item that is used to create attribute() node items.

The node name property corresponds to the dm:node-name accessor.

The typed value property corresponds to the dm:typed-value accessor.

5.7.3.4 Namespace Nodes

Symbol	Interface
`CompNamespaceConstructor`	`XdmNamespaceNode`
`DefaultNamespaceDecl`	`XpmNamespaceNode`
`DirNamespaceAttribute`	`XpmNamespaceNode`
`ModuleDecl`	`XpmNamespaceNode`
`ModuleImport`	`XpmNamespaceNode`
`NamespaceDecl`	`XpmNamespaceNode`
`NamespaceDeclaration`	`XpmNamespaceNode`
`SchemaImport`	`XpmNamespaceNode`
`UsingDecl`	`XpmNamespaceNode`

A namespace node expression is a constructable item that is used to create namespace-node() node items.

The namespace prefix property is the prefix name that this namespace node binds to. This can have one of the following values:

An empty string xs:NCName – This represents an xmlns namespace declaration.
A non-empty xs:NCName – This represents an xmlns:* namespace declaration.
An empty sequence – This is for where the namespace prefix cannot be determined, e.g. if evaluating a complex expression statically that is dependent on the dynamic context.

The namespace uri property corresponds to the dm:typed-value accessor.

The parent node property corresponds to the dm:parent accessor.

5.7.3.5 Processing Instruction Nodes

Symbol	Interface
`CompPIConstructor`	`XdmProcessingInstructionNode`
`DirPIConstructor`	`XdmProcessingInstructionNode`

A processing instruction node expression is a constructable item that is used to create processing-instruction() node items.

5.7.3.6 Comment Nodes

Symbol	Interface
`CompCommentConstructor`	`XdmCommentNode`
`DirCommentConstructor`	`XdmCommentNode`

A comment node expression is a constructable item that is used to create comment() node items.

5.7.3.7 Text Nodes

Symbol	Interface
`CompTextConstructor`	`XdmTextNode`
`DirTextConstructor`	`XdmTextNode`

A text node expression is a constructable item that is used to create text() node items.

6 Accessors

In addition to the accessors defined in the XPath and XQuery Data Model specification, this plugin makes use of the following additional accessors.

6.1 namespace-attributes Accessor

dm:namespace-attributes($n as node()) as namespace-node()*

The dm:namespace-attributes accessor returns the attributes of a node that define a namespace as a sequence containing zero or more Namespace Nodes. The order of Namespace Nodes is stable but implementation dependent.

It is defined on all seven node kinds.

Note:

These namespace attributes are present on the element node, but are not returned by the dm:attributes accessor. Unlike other attributes, they are presented to the data model as Namespace Nodes.

6.1.1 Nodes

Document Nodes: Returns the empty sequence.
Element Nodes: Returns the value of the namespace attributes property. The order of Namespace Nodes is stable but implementation dependent.
Attribute Nodes: Returns the empty sequence.
Namespace Nodes: Returns the empty sequence.
Processing Instruction Nodes: Returns the empty sequence.
Comment Nodes: Returns the empty sequence.
Text Nodes: Returns the empty sequence.

6.1.2 namespace attributes Property

Element Node properties are derived from the infoset as follows:

namespace attributes

A set of Namespace Nodes constructed from the attribute information items appearing in the [namespace attributes] property. This is the namespace declaration attributes of the form xmlns:* and xmlns.

An Element Node maps to the following infoset properties:

[namespace attributes]

An unordered set of information items obtained by processing each og the dm:namespace-attributes and mapping each to the appropriate attribute information item(s).

A References

A.1 W3C References

XPath and XQuery

W3C. XML Path Language (XPath) 3.1. W3C Recommendation 21 March 2017. See https://www.w3.org/TR/2017/REC-xpath-31-20170321/.
W3C. XQuery 3.1: An XML Query Language. W3C Recommendation 21 March 2017. See https://www.w3.org/TR/2017/REC-xquery-31-20170321/.
W3C. XQuery and XPath Data Model 3.1. W3C Recommendation 21 March 2017. See https://www.w3.org/TR/2017/REC-xpath-datamodel-31-20170321/.
W3C. XPath and XQuery Functions and Operators 3.1. W3C Recommendation 21 March 2017. See https://www.w3.org/TR/2017/REC-xpath-functions-31-20170321/.
W3C. XQuery 1.0 and XPath 2.0 Formal Semantics (Second Edition). W3C Recommendation 14 December 2010. See http://www.w3.org/TR/2010/REC-xquery-semantics-20101214/.

W3C Language Extensions

W3C. XQuery and XPath Full Text 3.0. W3C Recommendation 24 November 2015. See http://www.w3.org/TR/2015/REC-xpath-full-text-30-20151124/.
W3C. XQuery Update Facility 3.0. W3C Working Group Note 24 January 2017. See https://www.w3.org/TR/2017/NOTE-xquery-update-30-20170124/.
W3C. XQuery Scripting Extension 1.0. W3C Working Group Note 18 September 2014. See http://www.w3.org/TR/2014/NOTE-xquery-sx-10-20140918/.

XML Schema

W3C. W3C XML Schema Definition Language (XSD) 1.1 Part 1: Structures. W3C Recommendation 5 April 2012. See http://www.w3.org/TR/2012/REC-xmlschema11-1-20120405/.
W3C. W3C XML Schema Definition Language (XSD) 1.1 Part 2: Datatypes. W3C Recommendation 5 April 2012. See http://www.w3.org/TR/2012/REC-xmlschema11-2-20120405/.

XML Infoset

W3C. XML Information Set (Second Edition). W3C Recommendation 4 February 2004. See http://www.w3.org/TR/2004/REC-xml-infoset-20040204.

A.2 XPath NG Proposals

EXPath. Proposal for Annotation Sequence Types. EXPath Proposal. See https://github.com/expath/xpath-ng/pull/10.
EXPath. Proposal for Restricted Sequences. EXPath Proposal. See https://github.com/expath/xpath-ng/pull/11.

XQuery IntelliJ Plugin 1.9 Data and Processing Model

Abstract

Table of Contents

1 Introduction

1.1 PSI Tree and Data Model Construction

2 Basics

2.1 Type System

2.1.1 Part 1: Items

2.1.2 Part 2: Simple and Complex Types

2.1.3 Part 3: Atomic Types

2.1.4 Part 4: Sequences

2.1.5 Sequence Types

3 Type Manipulation

3.1 Upper and Lower Bounds

3.1.1 Minimum of two bounds

3.1.2 Maximum of two bounds

3.1.3 Sum of two bounds

3.2 Aggregate Types

3.2.1 Item Type Union

3.2.2 Sequence Type Union

3.3.3 Sequence Type Addition

4 Data Model

4.1 Literals

4.2 EQNames and Wildcards

4.2.1 Accepts Namespace Types

4.3 Annotations

5 Operation Tree

5.1 Expressions

5.1.1 Concatenating Expressions

5.1.2 Try/Catch Expressions

5.1.3 Context Item Expressions

5.1.4 Lookup Expressions

5.1.5 Sequence Type Expressions

5.1.6 FLWOR Expressions

5.1.6.1 For Clauses

5.1.6.2 Let Clauses

5.2 Path Steps

5.2.1 Abbreviated Syntax

5.3 Namespace Declarations

5.4 Annotated PSI Nodes

5.5 Variables

5.5.1 Variable References

5.5.2 Variable Declarations

5.5.3 Variable Bindings

5.5.3.1 Parameters

5.5.3.2 Assignable Variables

5.5.3.3 Collection Bindings

5.6 Functions

5.7 Constructable Items

5.7.1 Maps

5.7.2 Arrays

5.7.3 Node Constructors

5.7.3.1 Document Nodes

5.7.3.2 Element Nodes

5.7.3.3 Attribute Nodes

5.7.3.4 Namespace Nodes

5.7.3.5 Processing Instruction Nodes

5.7.3.6 Comment Nodes

5.7.3.7 Text Nodes

6 Accessors

6.1 namespace-attributes Accessor

6.1.1 Nodes

6.1.2 namespace attributes Property

A References

A.1 W3C References

A.2 XPath NG Proposals