Use the WORD operator to search for a specific data pattern. A location will be returned as a match if the data pattern is found in the location.
Using the WORD operator, you can define a word, a sentence, or any string of characters as the data pattern to search for.
The generic GLASS syntax for WORD is:
WORD '<data pattern>'
For example, search for a specific:
Account
), or
WORD 'Account'
Employee ID
), or
WORD 'Employee ID'
62
), or
WORD '62'
<<<<<
).
WORD '<<<<<'
Matches are not limited by traditional word boundaries (e.g. whitespaces, new lines) and can happen anywhere in a string or data stream, unless pattern boundaries are defined.
You write a simple GLASS expression to search for the term ID:
WORD 'ID'
As no pattern boundaries are defined, all the following lines will be returned as match locations by the GLASS pattern matching engine:
1 | Employee ID: 1234567890 |
2 | IDENTITY CARD |
3 | AMOUNT PAID $ 150.33 |
:
, whitespace
, comma ,
),
use the BOUND operator.The equivalent configuration in GLASS Studio Visual Builder mode is:
The NOCASE (No Case) option determines if the GLASS pattern matching engine treats uppercase and lowercase characters as distinct (case sensitive) or equal (case-insensitive) characters.
By default, GLASS patterns are case sensitive.
This means uppercase and lowercase characters are distinct characters.
For example, lowercase a
and uppercase A
are treated as different
characters by the GLASS pattern matching engine.
The GLASS syntax to define a case-sensitive WORD is:
WORD '<data pattern>'
If the NOCASE
(No Case) option is
specified or checked in
GLASS Studio Visual Builder
mode, uppercase and lowercase characters are equal.
For example, lowercase a
and uppercase A
are treated as the same character
by the GLASS pattern matching engine.
The GLASS syntax to define a case-insensitive WORD is:
WORD NOCASE '<data pattern>'
WORD NOCASE 'ID'
As the NOCASE option is specified, all the following lines will be returned as match locations by the GLASS pattern matching engine:
1 | Employee ID: 1234567890 |
2 | Identity Card |
3 | Amount Paid $ 150.33 |
The equivalent configuration in GLASS Studio Visual Builder mode is:
If the DECOMPOSE (Decompose) option is specified or checked, the GLASS pattern matching engine matches the search pattern both in its original form and normalized (ASCII) form, where applicable.
This is useful when searching for data patterns that contain language-specific
accents or diacritics (e.g. Zürich
).
The GLASS syntax to decompose a WORD is:
WORD DECOMPOSE '<data pattern>'
Your organization receives a subject access request from an individual named José María. You create a search pattern to find all data storage locations across your organization that contain personal information pertaining to José María.
WORD NOCASE DECOMPOSE 'José María'
Using the custom GLASS expression above, all the following lines will be returned as match locations by the GLASS pattern matching engine:
1 | José María |
2 | josé maría |
3 | Jose Maria |
The equivalent configuration in GLASS Studio Visual Builder mode is:
If the DECOMPOSE option is not used in Example 3, Line 3 will not be returned as a match location by the GLASS pattern matching engine.
José Maria
will
not match as the GLASS engine searches for the
WORD pattern where all characters with accents or diacritics are
either in their original form (e.g. é and
í), or in their normalized form (e.g.
e and i).
You can define the following pattern rules for the WORD data pattern:
The BOUND precision rule lets you define the list possible characters that the must be found before (BOUND LEFT), after (BOUND RIGHT), or surrounding (BOUND) a WORD for it to be a match.
See BOUND Rule for more information.
To search for a specific data pattern using the WORD operator in GLASS Studio Visual Builder Mode:
To edit the WORD component:
To duplicate the WORD component:
See Duplicating a Component for more information.
To remove the WORD component along with the corresponding GLASS code:
To change from WORD to another base pattern:
Click on WORD at the top bar and select a different
base pattern.