To Nha Notes | Aug. 9, 2024, 1:43 p.m.
DuckDB allows direct querying of S3 data, making it efficient for large-scale data analysis. Here’s a quick guide:
Connect to DuckDB: Launch DuckDB from your terminal by running:
duckdb
Create Secrets Configuration: Securely store your S3 credentials in DuckDB using:
CREATE SECRET secret1 ( TYPE S3, KEY_ID 'your-access-key-id', SECRET 'your-secret-access-key', REGION 'your-region' );
Query S3 Data: Use the stored secret to query data directly from an S3 URI:
https://duckdb.org/docs/extensions/httpfs/s3api
https://github.com/davidgasquez/awesome-duckdb?tab=readme-ov-file