The KDL Query Language is a small language specially tailored for querying KDL documents to extract nodes and even specific data. It is loosely based on CSS selectors for familiarity and ease of use. Think of it as CSS Selectors or XPath, but for KDL!
This document describes KQL next
. It is unreleased.
Selectors use selection operators to filter nodes that will be returned by an
API using KQL. The main differences between this and CSS selectors are the
lack of *
(use []
instead), the specific syntax for descendants and siblings, and the specific syntax for
matchers (the stuff between [
and ]
), which is similar, but not identical to CSS.
a > b
: Selects anyb
element that is a direct child of ana
element.a >> b
: Selects anyb
element that is a descendant of ana
element.a >> b || a >> c
: Selects allb
andc
elements that are descendants of ana
element. Any selector may be on either side of the||
. Multiple||
are supported.a + b
: Selects anyb
element that is placed immediately after a siblinga
element.a ++ b
: Selects anyb
element that follows ana
element as a sibling, either immediately or later.[accessor()]
: Selects any element, filtered by an accessor. (accessor()
is a placeholder, not an actual accessor)a[accessor()]
: Selects anya
element, filtered by an accessor.[]
: Selects any element.
Matchers are used to filter nodes by their various attributes (such as values,
properties, node names, etc). With the exception of top()
and ()
, they are all
used inside a []
selector. Some matchers are unary, but most of them involve
binary operators.
The top()
matcher can only be used as the first matcher of a selector. This means
that it cannot be the right operand of the >
, >>
, +
, or ++
operators. As ||
combines selectors, the top()
can appear just after it. For instance,
a > b || top() > b
is valid, but a > top()
is not.
top()
: Returns all toplevel children of the current document.top() > []
: Equivalent totop()
on its own.(foo)
: Selects any element whose type annotation isfoo
.()
: Selects any element with any type annotation.[val()]
: Selects any element with a value.[val(1)]
: Selects any element with a second value.[prop(foo)]
: Selects any element with a property namedfoo
.[prop]
: Selects any element with a property namedprop
.
Attribute matchers support certain binary operators:
[val() = 1]
: Selects any element whose first value is 1.[prop(name) = 1]
: Selects any element with a propertyname
whose value is 1.[name = 1]
: Equivalent to the above.[name() = hi]
: Selects any element whose node name is "hi". Equivalent to justhi
, but more useful when using string operators.[tag() = hi]
: Selects any element whose tag is "hi". Equivalent to just(hi)
, but more useful when using string operators.[val() != 1]
: Selects any element whose first value exists, and is not 1.
The following operators work with any val()
or prop()
values.
If the value is not of the same type, the operator will always fail ("1" is
never coerced to 1, and there is no "universal" ordering across all types.):
[val() > 1]
: Selects any element whose first value is greater than 1.[val() >= 1]
: Selects any element whose first value is greater than or equal to 1.[val() < 1]
: Selects any element whose first value is less than 1.[val() <= 1]
: Selects any element whose first value is less than or equal to 1.
The following operators work only with string val()
, prop()
, tag()
, or name()
values.
If the value is not a string, the matcher will always fail:
[val() ^= foo]
: Selects any element whose first value starts with "foo".[val() $= foo]
: Selects any element whose first value ends with "foo".[val() *= foo]
: Selects any element whose first value contains "foo".
The following operators work only with val()
or prop()
values. If the value
is not one of those, the matcher will always fail:
[val() = (foo)]
: Selects any element whose type annotation isfoo
.
Given this document:
package {
name foo
version "1.0.0"
dependencies platform=windows {
winapi "1.0.0" path="./crates/my-winapi-fork"
}
dependencies {
miette "2.0.0" dev=#true integrity=(sri)sha512-deadbeef
}
}
Then the following queries are valid:
package >> name
- -> fetches the
name
node itself
- -> fetches the
top() > package >> name
- -> fetches the
name
node, guaranteeing thatpackage
is in the document root.
- -> fetches the
dependencies
- -> deep-fetches both
dependencies
nodes
- -> deep-fetches both
dependencies[platform]
- -> fetches any dependencies nodes with a
platform
prop (just the one, in this case)
- -> fetches any dependencies nodes with a
dependencies[prop(platform)]
- -> Identical to the above. Plain identifiers are equivalent to
prop(<identifier>)
.
- -> Identical to the above. Plain identifiers are equivalent to
dependencies > []
- -> fetches all direct-child nodes of any
dependencies
nodes in the document. In this case, it will fetch bothmiette
andwinapi
nodes.
- -> fetches all direct-child nodes of any
Rules that are not defined in this grammar are prefixed with $
, see the KDL
grammar for
what they expand to.
query-str := $bom? query
query := selector q-ws* "||" q-ws* query | selector
selector := filter q-ws* selector-operator q-ws* selector-subsequent | filter
selector-subsequent := matchers q-ws* selector-operator q-ws* selector-subsequent | matchers
selector-operator := ">>" | ">" | "++" | "+"
filter := "top(" q-ws* ")" | matchers
matchers := type-matcher $string? accessor-matcher* | $string accessor-matcher* | accessor-matcher+
type-matcher := "(" q-ws* ")" | $type
accessor-matcher := "[" q-ws* (comparison | accessor)? q-ws* "]"
comparison := accessor q-ws* matcher-operator q-ws* ($type | $string | $number | $keyword)
accessor := "val(" q-ws* $integer q-ws* ")" | "prop(" q-ws* $string q-ws* ")" | "name(" q-ws* ")" | "tag(" q-ws* ")" | "values(" q-ws* ")" | "props(" q-ws* ")" | $string
matcher-operator := "=" | "!=" | ">" | "<" | ">=" | "<=" | "^=" | "$=" | "*="
q-ws := $plain-node-space