ArangoDB v3.13 is under development and not released yet. This documentation is not final and potentially incomplete.

Searching Full-text with ArangoSearch

Search for tokens in full-text that can occur in any order

Full-text strings can be tokenized by text Analyzers so that each token (usually a word) gets indexed separately. Subsequently, it becomes possible to search for individual tokens. This is also possible with arrays of strings, where each element is one token (or multiple, additionally tokenized by a text Analyzer).

There are two ways to search for tokens:

  • Token search: The tokens can occur in any order. The words you search for merely need to be contained in the source string somehow.
  • Phrase search: The order of the tokens needs to match the query. This allows you to search for partial or full sentences.

Token search is covered below. For phrase search see Phrase and Proximity Search.

Dataset

IMDB movie dataset

View definition

db.imdb_vertices.ensureIndex({ name: "inv-text", type: "inverted", fields: [ { name: "description", analyzer: "text_en" } ] });
db._createView("imdb_alias", "search-alias", { indexes: [ { collection: "imdb_vertices", index: "inv-text" } ] });
{
  "links": {
    "imdb_vertices": {
      "fields": {
        "description": {
          "analyzers": [
            "text_en"
          ]
        }
      }
    }
  }
}

AQL queries

Example: Match one out of multiple tokens

Search for movies with dinosaur or park (or both) in their description.

FOR doc IN imdb_alias
  SEARCH doc.description IN TOKENS("dinosaur park", "text_en")
  RETURN {
    title: doc.title,
    description: doc.description
  }
FOR doc IN imdb
  SEARCH ANALYZER(doc.description IN TOKENS("dinosaur park", "text_en"), "text_en")
  RETURN {
    title: doc.title,
    description: doc.description
  }
titledescription
Alone in the Wilderness… the Aleutian Peninsula, in what is now Lake Clark National Park. …
Madagascar Collection… The plot follows the adventures of story of four Central Park Zoo animals …
Sea Monsters: A Prehistoric AdventureJourney 80 million years back in time to an age when mighty dinosaurs dominated the land …
Land of the Lost… their only ally in a world full of dinosaurs and other fantastic creatures.

Example: Match multiple tokens

Search for movies with both dinosaur and park in their description:

FOR doc IN imdb_alias
  SEARCH TOKENS("dinosaur park", "text_en") ALL == doc.description
  RETURN {
    title: doc.title,
    description: doc.description
  }
FOR doc IN imdb
  SEARCH ANALYZER(TOKENS("dinosaur park", "text_en") ALL == doc.description, "text_en")
  RETURN {
    title: doc.title,
    description: doc.description
  }
titledescription
Jurassic Park Series… the Jurassic Park films. This trilogy of films brings all our childhood fantasies to life with dinosaurs
Jurassic ParkA wealthy entrepreneur secretly creates a theme park featuring living dinosaurs
The Lost World: Jurassic ParkFour years after Jurassic Park’s genetically bred dinosaurs ran amok …
Cesta do pravěku… Four young boys visit a dinosaur exhibit … They then row out onto Central Park Lake …

The results will be the same if you use park dinosaur as search terms, as the tokens can be ordered arbitrarily.

Example: Match a certain number of tokens

Search for movies with two out of the three provided tokens in their description, so dinosaur and scientist, dinosaur and moon, or scientist and moon, in any order:

FOR doc IN imdb_alias
  SEARCH TOKENS("dinosaur scientist moon", "text_en") AT LEAST (2) == doc.description
  RETURN {
    title: doc.title,
    description: doc.description
  }
FOR doc IN imdb
  SEARCH ANALYZER(TOKENS("dinosaur scientist moon", "text_en") AT LEAST (2) == doc.description, "text_en")
  RETURN {
    title: doc.title,
    description: doc.description
  }
titledescription
EarthstormA massive asteroid impact on the moon … a team of scientists turn to one man … to find a solution and secure the moon.
100 Million BCModern scientists discover the deadly consequences of time travel when they retrieve a flesh-eating dinosaur
The Lost WorldProfessor Challenger leads an expedition of scientists … to verify his claim that dinosaurs still live there.
The Lost WorldA scientist discovers dinosaurs on a remote plateau in Mongolia.
By Rocket to the MoonA scientist discovers that there’s gold on the moon
Doctor XA wisecracking New York reporter intrudes on a research scientist’s quest to unmask The Moon Killer.