You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This [SQLite3 loadable extension](https://www.sqlite.org/loadext.html) adds features to the [ubiquitous](https://www.sqlite.org/mostdeployed.html) embedded RDBMS supporting applications in genome bioinformatics:
6
6
@@ -10,8 +10,32 @@ This [SQLite3 loadable extension](https://www.sqlite.org/loadext.html) adds feat
10
10
11
11
Notice: this project is not associated with the SQLite developers.
12
12
13
+
### Use cases
14
+
15
+
The extension makes SQLite an efficient foundation for:
16
+
17
+
1. Integrative genomics data warehouse
18
+
19
+
* BED, GFF/GTF, FASTA, FASTQ, SAM, VCF, ...
20
+
21
+
* One file, zero administration, portable between platforms and languages
3. Transactional storage engine for API services, incremental reanalysis, real-time basecalling & metagenomics, ...
26
+
27
+
4. Experimental new data models, before dedicated storage format & tooling are warranted, if ever.
28
+
29
+
### Contraindications
30
+
31
+
1. Huge numerical arrays: see [HDF5](https://www.hdfgroup.org/solutions/hdf5/), [Zarr](https://zarr.readthedocs.io/en/stable/), [Parquet](https://parquet.apache.org/), [Arrow](https://arrow.apache.org/). <small>SQLite's [BLOB I/O](https://www.sqlite.org/c3ref/blob_open.html) leaves the door open for mash-ups!</small>
32
+
33
+
2. Parallel SQL analytics / OLAP: see [Spark](https://spark.apache.org/), [DuckDB](https://duckdb.org/), many commercial products. <small>Some bases can be covered with a sharding harness for a pool of threads with their own SQLite connections...</small>
34
+
35
+
3. Streaming: SQLite I/O, while often highly sequential in practice, relies on randomly seeking throughout the database file.
0 commit comments