Database
PlasmidNet uses a local SQLite database built from PLSDB and NCBI data.
Data Sources
Source |
Plasmids |
Description |
|---|---|---|
72,556 |
Curated plasmid database (version 2024_05_31_v2) |
|
109,806 |
Additional complete plasmids not in PLSDB |
|
Total |
182,362 |
Tables
Table |
Rows |
Description |
|---|---|---|
|
72,556 |
Plasmid metadata (accession, length, GC, topology, dates) |
|
7,439 |
Host taxonomy (kingdom to strain) |
|
72,556 |
MOB typing (Inc groups, mobility, relaxase, MPF, pMLST) |
|
600,896 |
AMR gene annotations from PLSDB + NCBI (gene, drug class, positions) |
|
60,619 |
PlasmidFinder typing results |
|
333,648 |
MOB suite detailed markers |
|
613,902 |
PGAP annotations (TA, phage, transposase) |
|
23,522 |
Geographic locations with host category |
|
109,806 |
NCBI plasmids not in PLSDB |
Building the Database
# 1. Download PLSDB CSVs from Figshare
mkdir -p data/raw
# Files are downloaded automatically by build_db.py
# 2. Build SQLite database
python build_db.py
# 3. (Optional) Harvest additional plasmids from NCBI
python ncbi_harvest.py
# 4. Rebuild with NCBI data
python build_db.py
Updating
To update when PLSDB releases a new version:
Download new CSVs from Figshare
Place in
data/raw/Run
python build_db.pyRun
python ncbi_harvest.pyto catch new NCBI plasmidsDelete
data/correlations_cache.jsonto rebuild correlations
Database Size
SQLite file: ~200 MB
Startup memory: ~107 MB (correlations cached to JSON)
Heroku compatible: Built during deploy via
bin/post_compile