Glossary¶
- candidate
Any node (target or not) brought into the ruleset by a
dom()
orelement()
call for consideration- fnode
A wrapper around a DOM node, holding scores, notes, and types pertaining to it. See Fnodes.
- note
An arbitrary, opaque-to-Fathom piece of data attached to a given type on a fnode. Notes can be consulted by scoring callbacks and are a good place to park expensive-to-recompute information. They are the main way of passing data between rules.
- ruleset
The unordered collection of rules that forms a Fathom program. See Writing Rules for more on the relationships between top-level constructs.
- score
The fuzzy-edged part of fnode state. A floating-point number, typically between 0 and 1, attached to a certain type on a fnode. They represent the confidence with which a node belongs to a type.
- subscore
A single rule’s contribution to a node’s score for some type. In Fathom’s current incarnation as a series of (single-layer) perceptrons, each rule’s subscore is multiplied by a coefficient, which is derived from training. The weighted subscores are then added together and fed through a sigmoid function to get the final score for a node for a type.
- target
A “right answer” DOM node, one that should be recognized as belonging to some type
- type
A string-typed category assigned to a fnode. Types are the boolean, hard-edged, enumerated parts of fnode state. They also largely determine inter-rule dependencies and thus which rules get run in response to a query.
- vectorize
To turn a collection of sample HTML pages into vectors of numbers which the trainer then imbibes.