Conecta - Overview.
This is the documentation for Conecta,
a python library designed to load data from SQL databases to arrow
with maximum speed
and memory efficiency by leveraging zero-copy and true concurrency.
Getting started
The fastest way to get started is to run:
And start loading data:
from conecta import read_sql
table = read_sql(
"postgres://user:password@localhost:5400/database",
queries=["select * from lineitem"],
partition_on="l_orderkey",
partition_num=4
)
Documentation overview:
This documentation follows Diátaxis
I recommend that you start on the how-to guide Load data
Features
- Connection pooling
- Real parallel multithreading
- Zero-copy mindset, data is only copied once
- Rich datatypes