ArangoDB v3.13 is under development and not released yet. This documentation is not final and potentially incomplete.
Highlights by Version
The most notable features in the Community and Enterprise Edition of ArangoDB, grouped by version
Version 3.13
All Editions
Enterprise Edition
Also see What’s New in 3.13.
Version 3.12
All Editions
Improved memory accounting: Better tracking and observability of memory consumption for ArangoDB deployments and reduced memory usage.
wildcard
Analyzer: AccelerateLIKE
searches with_
and%
wildcards against Views and inverted indexes with n-grams to quickly find candidate matches.Multi-dimensional indexes: An index type to efficiently intersect multiple range queries, like finding all appointments that intersect a time range. Optionally with prefix fields, with support for using it as vertex-centric index in graph traversals.
External versioning: Specify any top-level attribute to compare whether the version number is higher than the currently stored one when updating or replacing documents.
Improved dump performance Create logical backups faster with arangodump thanks to parallel dumping at the shard level, as well as transfer compression and file splitting.
Request and response compression Speed up data transfers and reduce traffic with transparent compression of requests and responses between ArangoDB servers and client tools.
Enterprise Edition
- ArangoSearch WAND optimization: Retrieve search results for the highest-ranking matches from Views faster by defining a list of sort expressions to optimize.
Also see What’s New in 3.12.
Version 3.11
All Editions
Parallel gather: Faster, more memory-efficient processing of cluster queries by combining results on Coordinators in parallel.
Index cache refilling: Automatically repopulate in-memory index caches after writes that affect an edge index or cache-enabled persistent indexes to maximize cache hits and thus query performance.
Enterprise Edition
ArangoSearch column cache: Always cache field normalization values, Geo Analyzer auxiliary data, stored values, primary sort columns, and primary key columns in memory to improve the performance of Views and inverted indexes.
geo_s2
Analyzer: Efficiently index geo-spatial data using different binary formats, tuning the size on disk, the precision, and query performance.
Also see What’s New in 3.11.
Version 3.10
All Editions
Native ARM Support: Packages for the ARM architecture are now available, including native support for Apple silicon.
Computed Values: Persistent document attributes that are generated when documents are created or modified, using an AQL expression.
Inverted indexes: A new, eventually consistent index type that can accelerate a broad range of queries, providing similar search capabilities as
arangosearch
Views, but defined per collection and simpler to use.search-alias
Views: Add inverted indexes tosearch-alias
Views for searching multiple collections at once, with ranking and search highlighting capabilities, as a lightweight alternative toarangosearch
Views.Persistent indexes: An optional In-memory Cache for faster lookups and Stored Values to let persistent indexes cover additional attributes of projections.
AQL Graph Traversals: All Shortest Paths allows you to query for all paths of shortest length between two documents.
Enterprise Edition
EnterpriseGraphs: A new specialized version of SmartGraphs, with an automatic sharding key selection.
Search highlighting: Get the substring positions of matched terms, phrases, or n-grams.
Nested search: Match arrays of objects with all the conditions met by a single sub-object, and define for how many of the elements this must be true.
ArangoSearch: New
minhash
Analyzer for locality-sensitive hashing to approximate the Jaccard similarity, with inverted index andarangosearch
View support that allows you to implement entity resolution.Parallelism for sharded graphs: Parallel execution of AQL traversal queries with many start vertices for all types of sharded graphs, leading to faster results.
Traversal Projections: Optimized data loading for AQL traversal queries if only a few document attributes are accessed.
Read from followers in clusters: Allow dirty reads so that Coordinators can read from any shard replica and not only from the leader, for scaling reads.
Also see What’s New in 3.10.
Version 3.9
All Editions
- ArangoSearch: New Segmentation Analyzer for language-agnostic tokenization of text. A Collation Analyzer to honor the alphabetical order of the specified language in range queries.
Enterprise Edition
- (Disjoint) SmartGraphs using SatelliteCollections: SatelliteCollections can be used in (Disjoint) SmartGraphs to enable more local execution of graph queries.
Also see What’s New in 3.9.
Version 3.8
All Editions
Weighted traversals and k Paths: Two new AQL graph traversal methods to emit paths in order of increasing weights and to enumerate all paths between a source and a target vertex that match a given length.
ArangoSearch: New Pipeline Analyzer that allows you to combine multiple Analyzers, enabling case-insensitive n-gram-based fuzzy search and more. New AQL Analyzer so that you can use an AQL query to pre-process and filter your data for indexing. Support for geo-spatial queries through new Geo Analyzers and ArangoSearch Geo functions. A new Stop words Analyzer that can be used standalone or in an Analyzer pipeline.
A
WINDOW
operation for aggregations over adjacent rows, value ranges or time windows.
Enterprise Edition
- Encryption at Rest utilizes hardware acceleration capabilities of modern CPUs.
Also see What’s New in 3.8.
Version 3.7
All Editions
ArangoSearch: Wildcard and fuzzy search (Levenshtein distance and n-gram based), enhanced phrase and proximity search, improved late document materialization and Views covering queries using their indexes without touching the storage engine, as well as a new SIMD-based index format for faster processing and stemming support for 15 additional languages.
Schema Validation: Enforce a JSON Schema for documents on collection level. Invalid documents can be rejected automatically by the database system, making it easy to maintain data quality.
Insert-Update and Insert-Ignore: New document API operations to upsert documents and to efficiently insert documents while skipping the creation if the document exists already.
AQL: Improved subquery and graph traversal performance, among many optimizations and enhancements.
HTTP/2 support: Better load-balancer and Kubernetes compatibility, improved request throughput.
Enterprise Edition
SatelliteGraphs: Synchronously replicated graphs with local traversal execution.
Disjoint SmartGraphs: Improve traversal execution times for SmartGraphs without edges between vertices with different SmartGraph attributes.
Traversal parallelization: Optional parallel execution of nested traversals for single servers and OneShard clusters.
Security: Added support for multiple JWT Secrets and the ability to hot-reload them from disk, TLS key and certificate rotation, Encryption at rest key rotation and Server Name Indication (SNI).
Also see What’s New in 3.7.
Version 3.6
All Editions
AQL: Improved query performance thanks to early pruning, subquery splicing, late document materialization, parallelization for certain cluster queries and more. New server-side
maxRuntime
option for queries.ArangoSearch: New Analyzer options for edge n-grams (
text
Analyzer), UTF-8 encoded n-gram input and optional start/end markers (ngram
Analyzer). Support for dynamic expressions using arrays (array comparison operators inSEARCH
queries and theTOKENS()
/PHRASE()
functions accept arrays). Views can benefit from the SmartJoins optimization.
Enterprise Edition
- OneShard deployments offer a practicable solution that enables significant performance improvements by massively reducing cluster-internal communication. A database created with OneShard enabled is limited to a single DB-Server node but still replicated synchronously to ensure resilience. This configuration allows running transactions with ACID guarantees on shard leaders.
Also see What’s New in 3.6.
Version 3.5
All Editions
ArangoSearch: The search and ranking engine received an upgrade and now features Configurable Analyzers, Sorted Views and several improvements to the AQL integration.
AQL Graph Traversals: k Shortest Paths allows you to query not just for one shortest path between two documents but multiple, sorted by length or weight. With PRUNE you can stop walking down certain paths early in a graph traversal to improve its efficiency.
Stream Transaction API: Perform multi-document transactions with individual begin and commit / abort commands using the new HTTP endpoints or via a supported driver.
Time-to-Live Indexes: TTL indexes can be used to automatically remove documents in collections for use cases like expiring sessions or automatic purging of statistics or logs.
Index Hints & Named Indexes : Indexes can be given names and an optional AQL inline query option
indexHint
was added to override the internal optimizer decision on which index to utilize.Data Masking: arangodump provides a convenient way to extract production data but mask critical information that should not be visible.
Enterprise Edition
Hot Backups: Create automatic, consistent backups of your cluster without noticeable impact on your production systems. In contrast to arangodump, hot backups are taken on the level of the underlying storage engine and hence both backup and restore are considerably faster.
SmartJoins: Run joins between identically sharded collections with performance close to that of a local join operation.
Advanced Data Masking: There are additional data masking functions available in the Enterprise Edition, such as for substituting email addresses and phone numbers with similar looking pseudo-data.
Also see What’s New in 3.5.
Version 3.4
All Editions
ArangoSearch: Search and similarity ranking engine integrated natively into ArangoDB and AQL. ArangoSearch combines Boolean retrieval capabilities with generalized ranking algorithms (BM25, TFDIF). Support of e.g. relevance-based searching, phrase and prefix-matching, complex boolean searches and query time relevance tuning. Search can be combined with all supported data models in a single query. Many specialized language Analyzers are already included for e.g. English, German, French, Chinese, Spanish and many other language.
GeoJSON Support and S2 Geo Index: ArangoDB now supports all geo primitives. (Multi-)Point, (Multi-)LineStrings, (Multi-)Polygons or intersections can be defined and queried for. The Google S2 geo index is optimized for RocksDB and enables efficient querying. Geo query results are automatically visualized with an OpenStreetMap integration within the Query Editor of the web interface.
Query Profiler: Enables the analysis of queries and adds additional information for the user to identify optimization potentials more easily. The profiler can be accessed via arangosh with
db._profileQuery(...)
or via the Profile button in the Query Editor of the web interface.Streaming Cursors: Cursors requested with the stream option on make queries calculate results on the fly and make them available for the client in a streaming fashion, as soon as possible.
RocksDB as Default Storage Engine: With ArangoDB 3.4 the default storage engine for fresh installations will switch from MMFiles to RocksDB. Many optimizations have been made to RocksDB since the first release in 3.2. For 3.4 we optimized the binary storage format for improved insertion, implemented “optional caching”, reduced the replication catch-up time and much more.
Also see What’s New in 3.4.
Version 3.3
Enterprise Edition
Datacenter-to-Datacenter Replication: Replicate the entire structure and content of an ArangoDB cluster asynchronously to another cluster in a different datacenter with ArangoSync. Multi-datacenter support means you can fallback to a replica of your cluster in case of a disaster in one datacenter.
Encrypted Backups: arangodump can create backups encrypted with a secret key using AES256 block cipher.
All Editions
Server-level Replication: In addition to per-database replication, there is now an additional
globalApplier
. Start the global replication on the Follower once and all current and future databases will be replicated from the Leader to the Follower automatically.Asynchronous Failover: Make a single server instance resilient with a second server instance, one as Leader and the other as asynchronously replicating Follower, with automatic failover to the Follower if the Leader goes down.
Also see What’s New in 3.3.
Version 3.2
All Editions
RocksDB Storage Engine: You can now use as much data in ArangoDB as you can fit on your disk. Plus, you can enjoy performance boosts on writes by having only document-level locks.
Pregel We implemented distributed graph processing with Pregel to discover hidden patterns, identify communities and perform in-depth analytics of large graph data sets.
Fault-Tolerant Foxx: The Foxx management internals have been rewritten from the ground up to make sure multi-coordinator cluster setups always keep their services in sync and new Coordinators are fully initialized even when all existing Coordinators are unavailable.
Enterprise Edition
LDAP integration: Users and permissions can be managed from outside ArangoDB with an LDAP server in different authentication configurations.
Encryption at Rest: Let the server persist your sensitive data strongly encrypted to protect it even if the physical storage medium gets stolen.
SatelliteCollections: Faster join operations when working with sharded datasets by synchronously replicating selected collections to all DB-Servers in a cluster, so that joins can be executed locally.
Also see What’s New in 3.2.
Version 3.1
All Editions
Vertex-centric indexes: AQL traversal queries can utilize secondary edge collection indexes for better performance against graphs with supernodes.
VelocyPack over HTTP : In addition to JSON, the binary storage format VelocyPack can now also be used in transport over the HTTP protocol.
Enterprise Edition
SmartGraphs: Scale with graphs to a cluster and stay performant. With SmartGraphs you can use the “smartness” of your application layer to shard your graph efficiently to your machines and let traversals run locally.
Encryption Control: Choose your level of SSL encryption
Auditing: Keep a detailed log of all the important things that happened in ArangoDB.
Also see What’s New in 3.1.
Version 3.0
self-organizing cluster with synchronous replication, master/master setup, shared nothing architecture, cluster management Agency.
Deeply integrated, native AQL graph traversal
VelocyPack as new internal binary storage format as well as for intermediate AQL values.
Persistent indexes via RocksDB suitable for sorting and range queries.
Foxx 3.0: overhauled JS framework for data-centric microservices
Significantly improved Web Interface
Also see What’s New in 3.0.