Skip to content

Instantly share code, notes, and snippets.

@platypii
platypii / webpack-hyparquet-demo.js
Last active January 2, 2025 18:49
Bundle Hyparquet with Webpack
import { asyncBufferFromUrl, parquetRead } from 'hyparquet'
// Load parquet data from a url using hyparquet
const url = 'https://hyperparam-public.s3.amazonaws.com/bunnies.parquet'
async function main() {
const file = await asyncBufferFromUrl({ url })
await parquetRead({
file,
onComplete: (data) => console.log(data),
})
@platypii
platypii / lambda-hyparquet.js
Last active January 2, 2025 18:50
Hyparquet Lambda function to generate parquet file metadata when a file is uploaded to S3
import { S3Client, GetObjectCommand } from '@aws-sdk/client-s3'
const s3 = new S3Client()
import { parquetMetadata, toJson } from 'hyparquet'
export async function handler(event) {
console.log('Event received:', JSON.stringify(event, null, 2))
// The event structure can contain multiple records if multiple files are uploaded at once.
for (const record of event.Records) {
const bucket = record.s3.bucket.name