$stop_words
$stop_words :
A list of frequently occurring terms for this locale which should be excluded from certain kinds of queries. This is also used for language detection
Tagalog (spoken in Philipines) specific tokenization code.
Typically, tokenizer.php either contains a stemmer for the language in question or it specifies how many characters in a char gram