QOSI

QOSI - Quadrillion Open Source Indexer

QOSI - Quadrillion Open Source Indexer

Glossary

Basic Descriptors

Object

Target

Source

Candidate

Identified

Index

Immutable


Dataset

Dataset

Source Dataset

Target Dataset


Token and Sequence

Token

Syntax Token

String Token

Sequence and Subsequence

Minimum Token Sequence (MTS) and Minimum Token Sequence Unit (MTSU)

Minimum Subsequence (MSs) (or Minimum Token Subsequence (MTSs))

Hash Function


File and Tokenized File

File

Tokenized File

Token Index


Partitioning


Query

Query, Query Request, and Query Result

Candidate Query

Definition:** A preliminary, broad search returns only a list of candidate files from the Source.

Batch Query


Preprocess Phase

Preprocessor

Tokenizer

File Archiver


Index Phase

Candidate Index (The Index)

Indexer (Source File Indexer)

Index Compressor


Query Phase

Querier

Nominator


Comparison Phase

Merger

File Extractor

Comparator

Evaluator

Reporter


Partitioning

Language Family


Parameter

Parameter