The OpenEQA dataset includes questions, answers, and episode histories
The question-answer pairs are in open-eqa-v0.json and the instructions below describe how to download the episode histories.
Episode histories are sourced form HM3D and ScanNet. Follow the instructions below to download the episode histories from both sources.
For HM3D episode histories, we provide two options: (1) directly download the RGB frames only. This is hosted by a third party repo (not by Meta), and may not be future proof, but can be faster or (2) extract the RGB, depth, and camera pose information from HM3D using the Habitat simulator.
The RGB frames for the HM3D episode histories are available in this third party location (12 Gb). You can use the following commands to download and extract the data:
wget -O open-eqa-hm3d-frames-v0.tgz <link above>
md5sum open-eqa-hm3d-frames-v0.tgz # 286aa5d2fda99f4ed1567ae212998370
mkdir -p data/frames
tar -xzf open-eqa-hm3d-frames-v0.tgz -C data/frames
rm open-eqa-hm3d-frames-v0.tgz
Afterwards, your top-level directory structure should look like this:
|- data
|- frames
|- hm3d-v0
|- 000-hm3d-BFRyYbPCCPE
|- ...
| - openeqa
| - ...
Step 1: Install the Habitat simulator (version 2.5) by following the instructions here.
For example, on a headless server use:
conda install habitat-sim==0.2.5 headless -c conda-forge -c aihabitat
Step 2: Download the HM3D validation data by first following the instructions here to get access to the dataset.
Then, use your api token-id and token-secret with this command:
python -m habitat_sim.utils.datasets_download --username <api-token-id> --password <api-token-secret> --uids hm3d_val_v0.2
Step 3: Download the agent state information for the episode histories form here.
wget -O open-eqa-hm3d-states-v0.tgz "https://www.dropbox.com/scl/fi/wg1uj1gvr4tkcz9aq3tzb/open-eqa-hm3d-states-v0.tgz?rlkey=i69chnpib8ui4cfabxa3iy9oj"
md5sum open-eqa-hm3d-states-v0.tgz # 0db1a723573a5b9f5139e0df2ae960e0
mkdir -p data/frames
tar -xzf open-eqa-hm3d-states-v0.tgz -C data/frames
rm open-eqa-hm3d-states-v0.tgz
Afterwards, your directory structure should look like this:
|- data
|- frames
|- hm3d-v0
|- scene_datasets
|- hm3d
| - openeqa
| - ...
Step 4: Extract frames from HM3D scenes.
You can either only extract RGB frames or extract RGB, depth, camera intrinsics, and camera pose information.
Format | Size | Extraction Time |
---|---|---|
RGB-only | 12 Gb | ~1 hrs |
RGB-D + Intrinsics + Pose | 16Gb Gb | ~1.5 hrs |
To extract only the RGB frames, run:
python data/hm3d/extract-frames.py --rgb-only
To extract the RGB, depth, camera intrinsics, and camera pose information, run:
python data/hm3d/extract-frames.py
Afterwards, your top-level directory structure should look like this:
|- data
|- frames
|- hm3d-v0
|- 000-hm3d-BFRyYbPCCPE
|- ...
|- ...
| - openeqa
| - ...
Step 1: Download ScanNet by following the instructions here.
Place the data in data/raw/scannet
. Afterwards, your top-level directory should look like this:
|- data
|- raw
|- scannet
|- scans
|- <scanId>
|- <scanId>.sens
|- ...
|- scans_test
|- <scanId>
|- <scanId>.sens
|- ...
|- openeqa
|- ...
Step 2: Extract episode histories
You can either only extract RGB frames or extract RGB, depth, camera intrinsics, and camera pose information.
Format | Size | Extraction Time |
---|---|---|
RGB-only | 62 Gb | ~8 hrs |
RGB-D + Intrinsics + Pose | 70 Gb | ~10 hrs |
To extract only the RGB frames, run:
python data/scannet/extract-frames.py --rgb-only
To extract the RGB, depth, camera intrinsics, and camera pose information, run:
python data/scannet/extract-frames.py
Afterwards, your directory structure should look like this:
|- data
|- frames
|- scannet-v0
|- 002-scannet-scene0709_00
|- ...
|- raw
| - openeqa
| - ...