Dockerfile assumes the following structure in the ./data
folder
tree ./data
data/
├── groundingdino_swint_ogc.pth
├── GroundingDINO_SwinT_OGC.py
└── huggingface
└── hub
├── models--bert-base-uncasee
| | <huggingface llava model>
├── models--llava-hf--vip-llava-7b-hf
| | <huggingface llava model>
└── version.txt
The Grounding Dino and Bert models can be found in this github release. The Llava model is too large to host there. If you don't have a local version you can copy over, the model will be downloaded at runtime. If you need to do that, I reccomend saving the model to host, and then rebuilding the docker file, as downloading Llava can take a long time.
To build the image, you'll need access to the following repos. They will be cloned during the build.
The build script assumes a github key (currently id_github
), change this for your needs
The build the image
sh build.sh
The following will run the autonomy stack on the jackal
- Jackal autonomy
roslaunch infrastructure_jackal jackal_autonomy.launch
- vision
roslaunch open_vocab_vision_ros vison.launch
- planning
roslaunch llm_planning_ros planner.launch
planner.launch
doesn't not load a prior map. Replace with your needs.
Warning This records a lot of data.
- [optional] recording
rosrun infrastructure_jackal record_jackal_data.sh
rossservice call /llm_planning_ros/task "task: 'YOUR TASK'"