Recent Books
Data Curation
Projects
Descriptions of funded projects/DARPA programs, etc.
Datasets
Descriptions of datasets and how to work with them.
Physio Data Collection
This information is for ML4AI researchers collecting human subjects data in the Lang Lab.
Teaching
Lab Manual
Public
Documentation for the general public.
Recently Updated Pages
Onboarding Checklist
You will have received an email titled "Set up your ML4AI Lab account" with a secure link. Click...
Managing SSH keys with Kanidm
For lab members who joined before 2026-05-22. orca's logins are now managed centrally through Ka...
LiteLLM
The lab runs a LiteLLM proxy that gives you access to large language models running on the lab's ...
Monitoring
Overview The lab uses a self-hosted monitoring stack to track CPU, GPU, memory, disk, network,...
Next-Generation Teams
Good teamwork enables teams to perform beyond the sum of their parts. The next generation of team...
Compute and Storage
Compute The ML4AI lab has the following compute VMs: VM Name CPU RAM GPUs ...
PosgreSQL on Debian
There is an instance of PostgreSQL running on the orca VM. Here are some pointers on working with...
Exporting Postgres DB to SQLite DB
This is a step-by-step procedure for exporting all tables and data from a Postgres database to a ...
Accessing the ToMCAT Dataset
Overview This guide provides instructions on how to access the ToMCAT database using either the p...
Populating or updating the ToMCAT database
This procedure will show how to setup, initialize, and configure the virtual environment needed t...