Skip to main content
Skip table of contents

DTCP Schema Guide

This guide provides a short description of each element and attribute defined in the XML schema for the DATACOMPARATOR Pipeline (DTCP). A fuller description of DTCP, along with an example and an explanation of the main concepts can be found in the DTCP User Guide.

Element Index

Element Detail

Elements are listed in document-tree order, top-level elements first, then alphabetically:

Element: dataComparator 1

The root element for defining the overrides to a DataComparator whose defaults are as described in the API documentation.

A Data Comparator instance with default settings is created if no child elements are present.

Contained by

/

Contains
AttributeTypeDescription
idanyNameType A unique identifier for this pipeline configuration - listed as 'configuration id' in the command-line description.
versionstring The DTCP specification version this conforms to - fixed at '1.0'
descriptionstring Short description of this DTCP configuration.
ElementDescription
advancedConfig Configuration options providing low-level control of the comparison, more general configuration options are in 'standardConfig'
extensionPoints Declare the extension points and contained filters to be inserted within the DataComparator pipeline
fullDescription Designed to provide meaningful description and basic help information to the user
pipelineParameters Container for all pipeline parameters
standardConfig Genaral configuration options for the DataComparator - see 'advancedConfig' for further options

Element: advancedConfig 2

Configuration options providing low-level control of the comparison, more general configuration options are in 'standardConfig'

Contained by
Contains
ElementDescription
outputProperties Set Serializer property settings for the built in Saxon Serializer
parserFeatures Set features on the underlying SAX parser used in the pipeline
parserProperties Set properties on the underlying SAX parser used in the pipeline
transformerConfigurationProperties Set configuration option on the Saxon XSLT transformers used in the pipeline

Element: extensionPoints 3

Declare the extension points and contained filters to be inserted within the DataComparator pipeline.

In EBNF the required sequence S of child elements is:

  • S := 'inputPreFlatteningPoint'? IP 'outputExtensionPoints'?
  • IP := 'inputExtensionPoints'? | ( 'inputAExtensionPoints'? 'inputBExtensionPoints'? )

Contained by
Contains
ElementDescription
inputExtensionPoints Extension points for modifying A and B input filters
outputExtensionPoints Extension points for modifying output filter chains

Element: fullDescription 4

Designed to provide meaningful description and basic help information to the user.

It can contain PCDATA content. It should include a description of the Data Comparator configuration defined by the DTCP. How this information is presented to users is a tool-dependent operation, for example a GUI-based tool may provide a pop-up window and show HTML formatted content.

Contained by
Contains
ElementDescription
[any]Any element permitted [mixed content]

Element: pipelineParameters 5

Container for all pipeline parameters.

Pipeline parameters have global scope and are referenced using the 'paremeterRef' attribute. Pipeline parameters have a default value that can be overridden through the API. The maximum number of child elements is not restricted.

Contained by
Contains
ElementDescription
booleanParameter Declare a boolean parameter that may be referenced by 'parameterRef' attributes or as $variables from within XPath expressions
stringParameter Declare a string parameter that may be referenced by 'parameterRef' attributes or as $variables from within XPath expressions

Element: standardConfig 6

Genaral configuration options for the DataComparator - see 'advancedConfig' for further options.

Contained by
Contains
ElementDescription
attributeSplittingConfig Specifies configuration options for attribute splitting
comparisonReport Specifies whether and where to generate comparison report which contains comparison analysis and recommendations to improve comparison result
ignoreChangesConfig Specifies configuration options for Ignore Changes processing
moveDetectionConfig Specifies configuration options for moves
namespaceConfiguration Defines a set of prefix uri pairs for namespaces that can be used within XPath expressions provided to certain configurations such as ignoreChangesConfig or subtreeProcessingMode
resultReadabilityOptions Sets options to change the granularity and ordering of changes in the result in order to improve readability
subtreeProcessingMode Specifies configuration options for subtree processing

Element: outputProperties 7

Set Serializer property settings for the built in Saxon Serializer.

Contained by
Contains
ElementDescription
property Sets the string value of a named property

Element: parserFeatures 8

Set features on the underlying SAX parser used in the pipeline.

For more detail, see setParserFeature in the API documentation.

Contained by
Contains
ElementDescription
feature Sets the boolean value of a named feature

Element: parserProperties 9

Set properties on the underlying SAX parser used in the pipeline.

For more detail, see setParserProperty in the API documentation.

Contained by
Contains
ElementDescription
property Sets the string value of a named property

Element: transformerConfigurationProperties 10

Set configuration option on the Saxon XSLT transformers used in the pipeline.

The maximum number of child elements is not restricted.

Contained by
Contains
ElementDescription
booleanProperty A named boolean property
stringProperty A named string property

Element: inputExtensionPoints 11

Extension points for modifying A and B input filters

Contained by
Contains
ElementDescription
inputPoint Extension point for modifying A and B input filters, after element flattening
inputPreFlatteningPoint Extension point for modifying A and B input filters, before element flattening

Element: outputExtensionPoints 12

Extension points for modifying output filter chains.

Contained by
Contains
ElementDescription
finalPoint Final filter extension point in the DataComparator output pipeline

Element: booleanParameter 13

Declare a boolean parameter that may be referenced by 'parameterRef' attributes or as $variables from within XPath expressions.

Contained by
Contains
AttributeTypeDescription
nameNCName The boolean parameter name
defaultValueboolean The default boolean value - may be overriden externally
ElementDescription
description Short summary of the purpose of the parameter

Element: stringParameter 14

Declare a string parameter that may be referenced by 'parameterRef' attributes or as $variables from within XPath expressions.

Contained by
Contains
AttributeTypeDescription
nameNCName The string parameter name
defaultValuestring The default string value - may be overriden externally
ElementDescription
description Short summary of the purpose of the parameter

Element: attributeSplittingConfig 15

Specifies configuration options for attribute splitting.

These options can be specified on a comparator to configure its behaviour. Attribute Splitting is off by default.

Contained by
Contains
AttributeTypeDescription
defaultMode[enum]

Permitted values / descriptions:


narrativeText
This mode sets the default mode to narrative text.
dataSet
This mode sets the default mode to data set.
dataList
This mode sets the default mode to data list.
ElementDescription
attributeLocations This class is used to represent locations for attributes which will be processed and split
isEnabled Sets whether the comparator should enable the feature or not

Element: comparisonReport 16

Specifies whether and where to generate comparison report which contains comparison analysis and recommendations to improve comparison result.

Contained by
Contains
ElementDescription
generateReport Specifies whether to generate comparison report
reportDirectory Specifies where to generate comparison report

Element: ignoreChangesConfig 17

Specifies configuration options for Ignore Changes processing.

These options can be specified on a comparator to configure its behaviour.

Contained by
Contains
ElementDescription
locations This class is used to represent locations for elements and attributes on which the changes will be ignored

Element: moveDetectionConfig 18

Specifies configuration options for moves.

These configuration options can be specified on a comparator to configure its behaviour.

Contained by
Contains
ElementDescription
isEnabled Sets whether the comparator should enable the feature or not
moveCandidates This class is used to represent move candidates
moveDetectionType Sets the type of move detection, determining how move processing should be executed
showMoveSource Sets whether the comparator should show move source or not

Element: namespaceConfiguration 19

Defines a set of prefix uri pairs for namespaces that can be used within XPath expressions provided to certain configurations such as ignoreChangesConfig or subtreeProcessingMode.

Contained by
Contains
ElementDescription
defaultNamespace This element is used to define the default namespace
userNamespaces This element is used to define user namespaces

Element: resultReadabilityOptions 20

Sets options to change the granularity and ordering of changes in the result in order to improve readability.

Contained by
Contains
ElementDescription
changeGatheringEnabled Sets whether to change the order of consecutive changed items to improve readability
characterByCharacterEnabled Sets whether to enable character by character comparison
elementSplittingEnabled Sets whether modified elements containing text should be split when the amount of unchanged text falls below a given percentage
elementSplittingThreshold Sets the percentage of unchanged text present in a modified element below which the element will be split
mixedContentDetectionScope Set the scope to use for determining if each element in the document is of a mixed-content type
modifiedWhitespaceBehaviour Set the ModifiedWhitespaceBehaviour to use for changes to whitespace
orphanedWordDetectionEnabled States whether or not orphaned word detection is enabled
orphanedWordLengthLimit Sets the maximum number of words to consider for orphaned word detection
orphanedWordMaxPercentage Sets the maximum proportion of the total change size that orphaned words can take while still being considered orphans

Element: subtreeProcessingMode 21

Specifies configuration options for subtree processing.

These options can be specified on a comparator to configure its behaviour.

Contained by
Contains
AttributeTypeDescription
defaultMode[enum]

Permitted values / descriptions:


text
This mode sets the subtree mode to text.
data
This mode sets the subtree mode to data.
ElementDescription
subtrees This class is used to represent subtrees

Element: property 22

Sets the string value of a named property

Contained by
Contains
AttributeTypeDescription
nameanyNameTypeThe parameter name
literalValuestring The literal string value
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: feature 23

Sets the boolean value of a named feature.

Contained by
Contains
AttributeTypeDescription
literalValueboolean The literal boolean value for the feature setting.
nameanyURI The fully qualitifed feature name.
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: booleanProperty 24

A named boolean property

Contained by
Contains
AttributeTypeDescription
nameanyNameTypeThe parameter name
literalValuebooleanThe literal boolean value
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: stringProperty 25

A named string property

Contained by
Contains
AttributeTypeDescription
nameanyNameTypeThe parameter name
literalValuestring The literal string value
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: inputPoint 26

Extension point for modifying A and B input filters, after element flattening.

Contained by
Contains
ElementDescription
filter An XSLT or Java XML processing filter to be loaded into the comparator pipeline

Element: inputPreFlatteningPoint 27

Extension point for modifying A and B input filters, before element flattening.

Contained by
Contains
ElementDescription
filter An XSLT or Java XML processing filter to be loaded into the comparator pipeline

Element: finalPoint 28

Final filter extension point in the DataComparator output pipeline.

Contained by
Contains
ElementDescription
filter An XSLT or Java XML processing filter to be loaded into the comparator pipeline

Element: description 29

Short summary of the purpose of the parameter.

Contained by
Contains

Type: xs:string

Element: attributeLocations 30

This class is used to represent locations for attributes which will be processed and split.

Contained by
Contains
ElementDescription
attributeLocation This is used to represent an attribute location XPath, separator, regular expression or output separator

Element: isEnabled 31

Sets whether the comparator should enable the feature or not.

Contained by
Contains
AttributeTypeDescription
literalValuebooleanSet 'true' to enable the feature.
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: generateReport 32

Specifies whether to generate comparison report.

Contained by
Contains
AttributeTypeDescription
literalValueboolean Set 'true' to generate comparison report as an additional output.
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: reportDirectory 33

Specifies where to generate comparison report.

Contained by
Contains
AttributeTypeDescription
literalValuestring The location of the directory to write the comparison report to.
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: locations 34

This class is used to represent locations for elements and attributes on which the changes will be ignored.

Contained by
Contains
ElementDescription
location This is used to represent location XPath and result rule

Element: moveCandidates 35

This class is used to represent move candidates.

Contained by
Contains
ElementDescription
moveCandidate This is used to represent move candidate element XPath and class XPath pairs

Element: moveDetectionType 36

Sets the type of move detection, determining how move processing should be executed.

Contained by
Contains
AttributeTypeDescription
literalValue[enum]

Permitted values / descriptions:


restricted
This mode only detects moves at the roots of added or deleted subtrees. This may offer a performance benefit in some cases.
unrestricted
This mode is used to detect moves anywhere in the tree.
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: showMoveSource 37

Sets whether the comparator should show move source or not.

Contained by
Contains
AttributeTypeDescription
literalValuebooleanSet 'true' to show move source.
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: defaultNamespace 38

This element is used to define the default namespace.

Contained by
Contains
AttributeTypeDescription
uriSets the uri of the default namespace.

Element: userNamespaces 39

This element is used to define user namespaces.

Contained by
Contains
ElementDescription
userNamespace This element is used to define user namespace

Element: changeGatheringEnabled 40

Sets whether to change the order of consecutive changed items to improve readability.

If the result contains a sequence of elements whose deltaxml:deltaV2 attribute values are mixed up in a sequence of As and Bs, enabling this feature will cause them to be reordered so that they are not mixed.

Contained by
Contains
AttributeTypeDescription
literalValuebooleanSet true to enabled change gathering
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: characterByCharacterEnabled 41

Sets whether to enable character by character comparison

Contained by
Contains
AttributeTypeDescription
literalValuebooleanSet true to enable character by character comparison
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: elementSplittingEnabled 42

Sets whether modified elements containing text should be split when the amount of unchanged text falls below a given percentage.

Contained by
Contains
AttributeTypeDescription
literalValuebooleanSet true to enable element splitting
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: elementSplittingThreshold 43

Sets the percentage of unchanged text present in a modified element below which the element will be split.

Contained by
Contains
AttributeTypeDescription
literalValuePercentage The threshold percentage as in integer (1 to 100)
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: mixedContentDetectionScope 44

Set the scope to use for determining if each element in the document is of a mixed-content type.

The mixed content type affects whitespace processing. If DTD or XML Schema validation is used this setting has no effect.

Contained by
Contains
AttributeTypeDescription
literalValue[enum]

Permitted values / descriptions:


document
Determine if an element is mixed-content using information from elements of the same name in the document. Using this scope significantly slows processing for large files.
local
Determine mixed content information for each element in turn, based on the contents of that element alone.
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: modifiedWhitespaceBehaviour 45

Set the ModifiedWhitespaceBehaviour to use for changes to whitespace.

Here, both documents must have some whitespace at a given point in order for there to be a change in whitespace. This will then be processed in accordance with the specified behaviour. Whitespace insertions and deletions are not affected by the modified whitespace behaviour.

Contained by
Contains
AttributeTypeDescription
literalValue[enum]

Permitted values / descriptions:


useDefault
The context dependent automatic whitespace setting.
ignore
Ignore differences in whitespace that is not explicitly preserved.
keepA
Similar to 'ignore' except that 'A' document's whitespace is kept (instead of the 'B' document's whitespace).
normalize
Normalize whitespace in inputs before comparison.
show
Display the differences in whitespace where possible
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: orphanedWordDetectionEnabled 46

States whether or not orphaned word detection is enabled.

Contained by
Contains
AttributeTypeDescription
literalValueboolean Enable/disable.
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: orphanedWordLengthLimit 47

Sets the maximum number of words to consider for orphaned word detection.

Sequences of words longer than the specified length will never be detected as orphaned words, regardless of the amount of changed words around them.

Contained by
Contains
AttributeTypeDescription
literalValueunsignedLong
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: orphanedWordMaxPercentage 48

Sets the maximum proportion of the total change size that orphaned words can take while still being considered orphans.

If the percentage value for a possibly orphaned section is less than or equal to this value, then it is classified as orphaned (unless there are more words than the length limit allows). The percentage value for a possibly orphaned section is calculated as follows:

Contained by
Contains
AttributeTypeDescription
literalValuePercentageAn integer value (1 to 100) that is the max percentage of the total change size were a change is considered to be orphaned.
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: subtrees 49

This class is used to represent subtrees.

Contained by
Contains
ElementDescription
subtree This is used to represent subtree XPath and type

Element: filter 50

An XSLT or Java XML processing filter to be loaded into the comparator pipeline.

There must be one 'class', 'http', 'resource' or 'file' child element for a filter element as this defines the filter type and how it is to be loaded. Attributes on the filter element may be used to control whether the filter is enabled or disabled.
Child 'parameter' elements may also be added so that parameter values are passed on to matching parameters in the XML filter. Any number of filter elements may be added to an extension point, filters are processed in the pipeline in order of occurrence.

Contained by
Contains
AttributeTypeDescription
ifNCName Enable filter when named boolean pipelineParameter is true.
unlessNCName Disable filter when named boolean pipelineParameter is true.
whenstring Enable filter when XPath expression evaluates true.
ElementDescription
class Load a Java class implementing the SAX XMLFilter interface from the ClassPath
file Load an XSLT filter from the file system
http Load XSLT filter from an identified HTTP resource
parameter A named parameter to supply to a filter - any XPath-item type (including a sequence) can be supplied to an XSLT filter using the xpath attribute
resource Load an XSLT filter as a resource in a jar file

Element: attributeLocation 51

This is used to represent an attribute location XPath, separator, regular expression or output separator.

Contained by
Contains
AttributeTypeDescription
enabledbooleanSet if attribute splitting is enabled or disabled for the attribute at this location
attributeXpathSet the attribute XPath
mode[enum]

Permitted values / descriptions:


narrativeText
This mode sets the default mode to narrative text.
dataSet
This mode sets the default mode to data set.
dataList
This mode sets the default mode to data list.
separatorstringSets the separator used to split the attribute text.
regexstringSets the regular expression used to split the attribute text.
outputTokenSeparatorstringSets the output separator used to separate tokens in the output.

Element: location 52

This is used to represent location XPath and result rule.

Contained by
Contains
AttributeTypeDescription
ignoreXpathSets the node XPath.
resultRule[enum]

Permitted values / descriptions:


BA
Default. This copies new value if it exists, otherwise copy old value.
AB
This copies old value if it exists, otherwise copy new value.
A
This copies old value if it exists, otherwise don’t output.
B
This copies new value if it exists, otherwise don’t output
DELETE
Don’t copy under any circumstances (but process the subtree if present)

Element: moveCandidate 53

This is used to represent move candidate element XPath and class XPath pairs.

Contained by
Contains
AttributeTypeDescription
elemXpathSets the element Xpath.
classXpathSets the class Xpath.

Element: userNamespace 54

This element is used to define user namespace.

Contained by
Contains
AttributeTypeDescription
prefixSets the prefix of the user namespace.
uriSets the uri of the user namespace.

Element: subtree 55

This is used to represent subtree XPath and type.

Contained by
Contains
AttributeTypeDescription
elemXpathSets the element XPath.
mode[enum]

Permitted values / descriptions:


text
This mode sets the subtree mode to text.
data
This mode sets the subtree mode to data.
orderedbooleanSets the boolean where true represents ordered alignment, and false represents unordered alignment.
ElementDescription
childAlignment This class represents child alignment specifiers for the subtree configuration

Element: class 56

Load a Java class implementing the SAX XMLFilter interface from the ClassPath.

Contained by
Contains
AttributeTypeDescription
nameanyNameTypeThe fully qualified name of the class.

Element: file 57

Load an XSLT filter from the file system.

Contained by
Contains
AttributeTypeDescription
pathstring The path of the filter to be loaded, relative paths are resolved according to the setting of the 'relBase' attribute.
relBase[enum]The relBase attribute is used to specify how the relative path to a file is resolved.

Permitted values / descriptions:


current
Resolve using the current working directory, obtained from the Java user.dir system property.
home
Resolve using the user's home directory.
dxp
Resolve using the directory containing the DXP file, when it is loaded from a file.

Element: http 58

Load XSLT filter from an identified HTTP resource.

Contained by
Contains
AttributeTypeDescription
urlanyURI The URL of the HTTP resource.

Element: parameter 59

A named parameter to supply to a filter - any XPath-item type (including a sequence) can be supplied to an XSLT filter using the xpath attribute.

Contained by
Contains
AttributeTypeDescription
nameanyNameTypeThe parameter name
literalValuestring The literal string value
parameterRefstringName of referenced pipelineParameter
xpathXPath expression returning the required type

Element: resource 60

Load an XSLT filter as a resource in a jar file.

Contained by
Contains
AttributeTypeDescription
nameanyNameType The resource name, for example, '/xsl/resource.xsl'

Element: childAlignment 61

This class represents child alignment specifiers for the subtree configuration.

Contained by
Contains
AttributeTypeDescription
childXpathSets the child xpath
keyXpathSets the key xpath specifying the key for the associated child element

This documentation was auto-generated from the XML Schema XSD.

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.