ArangoDB Datasets
arango_datasets
is a Python package for loading sample datasets into ArangoDB
You can use the arango_datasets
package in conjunction with the python-arango
driver to load example data into your ArangoDB deployments. The data is hosted
on AWS S3. There are a number of existing datasets already available and you can
view them by calling the list_datasets()
method as shown below.
Install
To install the Python package, you can use the pip
command to directly install
it from PyPi :
pip install arango-datasets
You can find the source code repository of the module on GitHub: https://github.com/arangoml/arangodb_datasets
Usage
Once you have installed the arango_datasets
package, you can use it to
download and import datasets into your deployment with arango_datasets.Datasets
.
The Datasets
constructor requires a valid python-arango
database object as input. It defines the target deployment, database, and
credentials to load a dataset.
from arango import ArangoClient
db = ArangoClient(hosts='http://localhost:8529').db("dbName", username="root", password="")
Pass the database object to the Datasets
constructor:
from arango_datasets import Datasets
datasets = Datasets(db)
List the available datasets:
print(datasets.list_datasets())
List more information about a particular dataset:
print(datasets.dataset_info("IMDB_X"))
Import the dataset:
datasets.load("IMDB_X")