This document curates my professional activities during my 5 years and 10 months at AWS (from August 2019 to May 2025), including blog posts, presentation decks, YouTube videos, and open-source GitHub projects. The content primarily focuses on cloud computing with AWS, covering topics such as Generative AI, LLMs, RAG, Data Analytics, Machine Learning, MLOps, and System Architecture.
- How Selectstar handles AI Red teaming API stream processing using Amazon API Gateway WebSocket / 셀렉트스타의 Amazon API Gateway WebSocket 을 활용한 AI Red teaming API 스트림 처리 방법 (2024-05-29) 👥
- Hosting Langfuse with AWS CDK Python using Amazon ECS and AWS Fargate / Amazon ECS와 AWS Fargate를 사용하여 AWS CDK Python으로 Langfuse 호스팅하기 (2024-05-22)
- Klleon's Case Study: Reducing Digital Human Generative Model Inference Costs by 50% with AWS Inferentia / 클레온의 AWS Inferentia를 이용한 디지털 휴먼 생성 모델 추론 비용 50% 절감 사례 (2024-05-21) 👥
- Journey to Implementing a Medical Q&A Chatbot with Amazon Bedrock / Amazon Bedrock 으로 만든 의료 전문 Q&A 챗봇 구현 여정 (2024-05-17) 👥
- Deploying the LLaVA-NeXT-Video Multi-modal Inference Model on Amazon SageMaker / 이미지 비디오 Multi-modal 추론 모델, LLaVA-NeXT-Video 모델을 Amazon SageMaker에 배포하기 (2024-05-13)
- Automating OpenAI Whisper Model Deployment on Amazon SageMaker Endpoint with AWS CDK / AWS CDK를 활용한 OpenAI Whisper 모델 Amazon SageMaker Endpoint 배포 자동화 (2024-05-09)
- Nota AI's Guide to Easily Deploying Transformer Models on AWS Inferentia and Trainium / Nota AI가 제안하는 Transformer 모델을 AWS Inferentia과Trainium에 손쉽게 배포하는 방법 (2024-05-08) 👥
- LinqAlpha's Company Screener Agent for Hedge Funds using Amazon Bedrock and Amazon OpenSearch / LinqAlpha 의 Amazon Bedrock과 Amazon OpenSearch 를 활용한 헤지펀드 투자사를 위한 Company Screener Agent (2024-05-07) 👥
- Musicow's Easy Guide to Building a Zero-ETL CDC Pipeline between Amazon RDS and Amazon Redshift / 뮤직카우의 Amazon RDS와 Amazon Redshift 간 CDC 파이프라인 Zero-ETL로 쉽게 구축하기 (2024-05-23) 👥
- Building a SaaS Metering System with AWS Analytics Services / AWS 분석 서비스를 활용하여 SaaS 미터링 시스템 구축하기 (2024-05-16) 👥
- Building a Real-time CDC Data Collection and Analysis Pipeline with AWS DMS / AWS DMS를 이용한 CDC 데이터 실시간 수집 및 분석 데이터 파이프라인 구축하기 (2024-05-14) 👥
- AWS Customer Story: Dable Utilizes AWS Big Data Solutions for Real-time Recommendation Service / AWS 고객사례: Dable, 실시간 추천 서비스에 AWS 빅데이터 솔루션 활용 (2017-01-01) 👥
- Develop an automatic review image inspection service with Amazon SageMaker (2022-01-10) 👥
- Amazon SageMaker기반 무신사 상품 후기 이미지 자동 검수 서비스 개발 사례 (2022-01-18) 👥
- DeepSeek on AWS (2025-03-19)
- RAG from Concept to Implementation - Knowledge Bases for Amazon Bedrock (2024-04-19)
- RAG Architecture: From Concept to Implementation (2024-03-21)
- Generative AI on AWS - Deep Dive (2023-06-27)
- JumpStart to Build Generative AI with Amazon SageMaker (2023-03-31)
- Building Transactional Data Lake using Amazon DataFirehose and Apache Iceberg (2025-02-19)
- Building a Modern Transactional Data Lake for Real-time CDC Data Processing / 실시간 CDC 데이터 처리! Modern Transactional Data Lake 구축하기 (2023-05-08)
- Modern Transactional Data Lake using Apache Iceberg in Amazon S3 (2023-04-26)
- Streaming Data Processing in Real-time: Amazon Kinesis Data Streams vs. MSK (2023-04-20)
- Analytics 101 - Build BI System from Scratch (2023-04-12)
- Real-time Analytics on AWS (2022-06-23)
- Introduction to Amazon Athena (2022-06-23)
- Amazon Athena Essential Tips (2022-05-24)
- AWS Analytics Immersion Day - Build BI System from Scratch (2022-04-22)
- Choose Right Stream Storage: Amazon Kinesis Data Streams vs MSK (2022-04-21)
- Amazon Fraud Detector (2023-03-28)
- Amazon SageMaker Model Deployment Strategies (2022-07-27)
- Amazon SageMaker Canvas - a Visual, No-Code, AutoML tool for Business Analysts (2022-07-13)
- Getting started with AI/ML on AWS (2022-06-14)
- Octember - Social Graph Based People Recommender with Amazon Neptune and Textract (2022-06-07)
- Learn ML by building Computer Vision Application with Amazon Rekognition and SageMaker (2022-05-04)
- Amazon Personalize Recipes Deep Dive (2022-05-04)
- Building a Data Analysis System for Recommendation Systems / 추천 시스템을 위한 데이터 분석 시스템 구축하기 (2022-04-26)
- Amazon Personalize - Principles and Case Studies of Recommendation Systems / Amazon Personalize - 추천 시스템의 원리와 구축 사례 (2022-04-26)
- Building a Deep Learning-Based Image Search Service with Amazon SageMaker / Amazon SageMaker로 딥 러닝 기반 이미지 검색 서비스 만들기 (2022-04-19)
- Catching Up on Machine Learning Concepts in an Hour / 1시간 만에 머신 러닝 개념 따라잡기 (2022-04-19)
- End-to-End Machine Learning with Amazon SageMaker (2022-04-19)
- Amazon Bedrock과 SageMaker를 활용한 DeepSeek R1 모델 배포 및 운영 방법 (2025-07-14)
- RAG 개념부터 구현까지 - Knowledge Bases for Amazon Bedrock (2024-07-10)
- RAG 아키텍처 – 개념부터 구현까지 (2024-05-28)
- LLM 앱 디버깅 툴, Langfuse를 Amazon ECS에 배포하는 방법 (2024-09-04)
- 클릭 몇 번만으로 ChatGPT 같은 생성 AI (Generative AI) 모델 만들기 (2023-05-12)
- 트웰브랩스의 AWS 기반 초거대 영상 이해/언어 모델 - 개발부터 서비스 구축까지 - AWS Industry Week 2024 (2025-01-10)
- 초거대 영상 이해 모델 스타트업 트웰브랩스의 AI 인프라 고도화 여정 - AWS Summit Korea 2025 (2025-08-13)
- Amazon Data Firehose와 Apache Iceberg를 이용한 Transactional Data Lake 구축하기 (2025-02-19)
- (데모) Amazon Data Firehose와 Apache Iceberg를 이용한 Transactional Data Lake 구축하기 (2025-02-19)
- Modern Transactional Data Lake (2023-11-21)
- 실시간 CDC 데이터 처리! Modern Transactional Data Lake 구축하기 - AWS Summit Seoul 2023 (2023-09-04)
- 데이터 분석 실시간으로 처리하기: Kinesis Data Streams vs MSK - AWS Summit Korea 2022 (2022-09-07)
- Tappytoon 데이터 분석 파이프라인 구축기 (2021-12-02)
- AWS 서비스를 이용하여 실시간 분석 시스템 구축하기 (2021-12-02)
- Amazon Athena에 대해 알아보기 (2021-11-22)
- Amazon Kinesis Data Streams와 MSK를 비교해 보기 (2021-11-22)
- AWS 데이터 분석 시스템 구축 | Part 1. 개념 및 워크 플로우 (2021-11-22)
- AWS 데이터 분석 시스템 구축 | Part 2. 데모로 확인하기 (2021-11-22)
- GI VITA는 어떻게 MLOps를 구축했을까? (2022-09-30)
- 코딩 없이 머신 러닝 모델 학습하는 방법, Amazon SageMaker Canvas (2022-09-30)
- 테이블 하나로 시작해서 AI 프로덕트까지,무신사의 AI Transformation 여정 - AWS Summit Korea 2022 (2022-09-06)
- 부정 거래 탐지를 위한 Amazon Fraud Detector 서비스의 활용 방법 (2022-07-08)
- Amazon Rekognition을 이용한 이미지 분석 및 검색 서비스 만들기 (2022-05-03)
- Amazon Rekognition Custom Labels를 이용한 나만의 이미지 분석 모델 만들기 (2022-05-03)
- Amazon SageMaker로 딥 러닝 기반 이미지 검색 서비스 만들기 | 개념부터 구현까지 완전 정복 (2022-04-06)
- Amazon SageMaker로 딥 러닝 기반 이미지 검색 서비스 만들기 | 데모로 확인하기 (2022-04-06)
- 스타트업을 위한 AWS의 AI/ML 서비스 활용 방법 및 도입 전략 (2021-12-02)
- Amazon Textact와 Amazon Neptune을 이용한 인맥 추천 서비스 만들기 (2021-12-02)
- Catching Up on Machine Learning Concepts in an Hour / 1시간 만에 머신 러닝 개념 따라 잡기 (2021-11-22)
- End-to-End Machine Learning with Amazon SageMaker / Amazon SageMaker 통해 머신러닝 시작하기 (2021-11-22)
- 추천 시스템의 원리와 구축 사례 (2021-11-22)
- 추천 서비스를 위한 데이터 분석 시스템 구축하기 (2021-11-22)
- (Demo) Amazon Personalize로 추천 시스템 구축하기 (2021-11-05)
- deploy-langfuse-on-ecs-with-fargate: Provides AWS CDK code to deploy Langfuse, an open-source LLM engineering platform, on Amazon ECS with AWS Fargate.
- ocr-with-amazon-bedrock: This project is an OCR (Optical Character Recognition) web application that uses Amazon Bedrock for its character recognition capabilities. The application is deployed on Amazon ECS Fargate and the infrastructure is provisioned using the AWS CDK.
- rag-with-amazon-bedrock-and-kendra: A Question Answering application built with a Retrieval Augmented Generation (RAG) approach, using Amazon Bedrock and Amazon Kendra as the knowledge base.
- rag-with-amazon-bedrock-and-documentdb: A RAG-based Question Answering application using Large Language Models (LLMs) powered by Amazon Bedrock, with Amazon DocumentDB serving as the vector store.
- rag-with-amazon-bedrock-and-memorydb: A RAG-based Question Answering application using LLMs powered by Amazon Bedrock, with Amazon MemoryDB for Redis as the vector store.
- rag-with-amazon-bedrock-and-opensearch-serverless: A RAG-based Question Answering application using LLMs powered by Amazon Bedrock, with Amazon OpenSearch Serverless as the vector store.
- rag-with-amazon-bedrock-and-opensearch: A RAG-based Question Answering application using LLMs powered by Amazon Bedrock, with Amazon OpenSearch Service as the vector store.
- rag-with-amazon-bedrock-and-postgresql-using-pgvector: A RAG-based Question Answering application using LLMs powered by Amazon Bedrock, with Amazon Aurora PostgreSQL and pgvector as the vector store.
- rag-with-amazon-documentdb-and-sagemaker: A RAG-based Question Answering application using LLMs with Amazon SageMaker for model hosting and Amazon DocumentDB as the knowledge base.
- rag-with-amazon-kendra-and-sagemaker: A RAG-based Question Answering application using LLMs with Amazon SageMaker for model hosting and Amazon Kendra as the knowledge base.
- rag-with-amazon-memorydb-and-sagemaker: A RAG-based Question Answering application using LLMs with Amazon SageMaker for model hosting and Amazon MemoryDB for Redis as the vector store.
- rag-with-amazon-opensearch-and-sagemaker: A RAG-based Question Answering application using LLMs with Amazon SageMaker for model hosting and Amazon OpenSearch Service as the knowledge base.
- rag-with-amazon-opensearch-serverless-and-sagemaker: A RAG-based Question Answering application using LLMs with Amazon SageMaker and Amazon OpenSearch Serverless as the knowledge base.
- rag-with-amazon-postgresql-using-pgvector-and-sagemaker: A RAG-based Question Answering application using LLMs with Amazon SageMaker and Amazon Aurora PostgreSQL equipped with the pgvector extension for vector storage.
- rag-with-knowledge-bases-for-amazon-bedrock-using-aurora-postgresql: A RAG-based Question Answering application using LLMs powered by Amazon Bedrock, with Knowledge Bases for Amazon Bedrock and Amazon Aurora PostgreSQL as the vector store.
- rag-with-knowledge-bases-for-amazon-bedrock-using-L1-cdk-constructs: A RAG-based Question Answering application using LLMs powered by Amazon Bedrock, with Knowledge Bases for Amazon Bedrock created using L1 CDK constructs.
- rag-with-knowledge-bases-for-amazon-bedrock: A RAG-based Question Answering application using LLMs powered by Amazon Bedrock, with Knowledge Bases for Amazon Bedrock as the knowledge base.
- mcp-tutorial: A workshop for building and deploying Model Context Protocol (MCP) servers on local and AWS environments, and integrating them with AI applications. 👥
- amazon-emr-with-delta-lake: A guide to explore Delta Lake features on Amazon EMR, demonstrating how to read from and write to Delta tables.
- aws-analytics-immersion-day: A workshop to implement a Business Intelligence (BI) system using various AWS Analytics Services like Kinesis, S3, Athena, OpenSearch, and QuickSight.
- aws-athena-cqrs-pattern: An implementation of the CQRS (Command and Query Responsibility Segregation) pattern using Amazon Athena, where query execution results are delivered via email.
- aws-dms-cdc-data-pipeline: Implements an end-to-end data pipeline to replicate transactional data from MySQL to Amazon OpenSearch Service via Kinesis using AWS Database Migration Service (DMS).
- aws-dms-serverless-mysql-to-s3-migration: A data pipeline project that uses AWS DMS Serverless to migrate data from an Aurora MySQL database to Amazon S3.
- aws-dms-serverless-to-kinesis-data-pipeline: A data pipeline project that uses AWS DMS Serverless to stream data from an Aurora MySQL database to Amazon Kinesis Data Streams.
- aws-glue-streaming-etl-with-apache-iceberg: Creates a streaming ETL job in AWS Glue to build an updatable data lake on Amazon S3 using the Apache Iceberg format.
- aws-glue-streaming-etl-with-delta-lake: Creates a streaming ETL job in AWS Glue to build an updatable data lake on Amazon S3 using the Delta Lake format.
- aws-glue-streaming-ingestion-from-kafka-to-apache-iceberg: A collection of projects demonstrating how to ingest streaming data from Amazon MSK (Managed and Serverless) into Apache Iceberg tables on S3 using AWS Glue Streaming.
- aws-msk-cdc-data-pipeline-with-debezium: Implements a Change Data Capture (CDC) pipeline to replicate data from MySQL to S3 using Amazon MSK, MSK Connect, and Debezium.
- aws-msk-serverless-cdc-data-pipeline-with-debezium: Implements a CDC pipeline using Amazon MSK Serverless, MSK Connect, and Debezium to replicate data from MySQL to S3.
- aws-opensearch-ingestion-tutorials: A collection of example projects for Amazon OpenSearch Ingestion, showing data ingestion into both OpenSearch domains and serverless collections.
- opensearch-serverless-common-usage-patterns: A set of example projects for common Amazon OpenSearch Serverless use cases, including search, time-series analysis, and vector search.
- redshift-streaming-ingestion-patterns: A collection of CDK projects demonstrating how to load data from streaming services like Kinesis and MSK into Amazon Redshift.
- streaming-count-sketches-with-hyperloglog-in-amazon-memorydb: A project demonstrating how to efficiently count unique items in a data stream using the HyperLogLog algorithm in Amazon MemoryDB for Redis.
- streaming-data-pipeline-from-kafka-to-s3-using-aws-kinesis-firehose: Example projects for managed data delivery from Amazon MSK to Amazon S3 using Amazon Kinesis Data Firehose.
- transactional-datalake-using-amazon-datafirehose-iceberg: Implements a transactional data lake by ingesting CDC data from MySQL to S3 in Apache Iceberg format, using AWS DMS and Amazon Kinesis Data Firehose.
- transactional-datalake-using-amazon-msk-and-apache-iceberg-on-aws-glue: Implements a transactional data lake by ingesting CDC data from MySQL to S3 in Apache Iceberg format, using Amazon MSK Connect (with Debezium) and AWS Glue Streaming.
- transactional-datalake-using-amazon-msk-serverless-and-apache-iceberg-on-aws-glue: Implements a transactional data lake using Amazon MSK Serverless, MSK Connect (with Debezium), and AWS Glue Streaming to ingest CDC data from MySQL to S3 in Apache Iceberg format.
- transactional-datalake-using-apache-iceberg-on-aws-glue: Implements a transactional data lake by ingesting CDC data from MySQL to S3 in Apache Iceberg format, using AWS DMS and AWS Glue Streaming.
- web-analytics-on-aws: A simple web analytics system implemented with Amazon API Gateway, Kinesis Data Streams, Kinesis Data Firehose, S3, and Athena.
- zero-etl-architecture-patterns: A collection of CDK scripts for creating Amazon RDS zero-ETL integrations with Amazon Redshift Serverless to enable near real-time analytics.
- llava-next-video-model-on-sagemaker-endpoint: A CDK project to host the LLaVA-NeXT-Video model on an Amazon SageMaker endpoint for real-time and asynchronous inference.
- llava-on-aws-sagemaker: A CDK project to host the LLaVA model on an Amazon SageMaker real-time inference endpoint.
- video-llava-on-aws-sagemaker: A CDK project to host the Video-LLaVA model on an Amazon SageMaker real-time inference endpoint.
- mlflow-ec2-sagemaker: A CDK project to provision an MLflow server on an Amazon EC2 instance.
- mlflow-ecs-sagemaker: A CDK project to provision an MLflow server on Amazon ECS with AWS Fargate.
- whisper-model-hosting-on-sagemaker-endpoint: Provides CDK projects to host the OpenAI Whisper model on Amazon SageMaker for real-time and asynchronous inference.
- sagemaker-inference-component: A CDK project to deploy multiple foundation models to the same SageMaker endpoint using inference components.
- deepseek-on-sagemaker: Provides a set of CDK projects to host DeepSeek models on Amazon SageMaker.
- janus-pro-7b: A CDK project to host the Janus-Pro-7B model on an Amazon SageMaker real-time inference endpoint.
- lgai-exaone-on-sagemaker/exaone-deep-7_8b-sglang: A CDK project to host LG's EXAONE model on Amazon SageMaker.
- qwen-vl-on-sagemaker-endpoint/qwen2_5-vl-32b-sglang: A CDK project to host the Qwen-VL model on an Amazon SageMaker endpoint.
- scale-to-zero-sagemaker-endpoint: Demonstrates how to deploy a SageMaker endpoint that can scale down to zero instances to reduce costs.
- aws-realtime-image-analysis: A project for automatic image tagging and visualization using Amazon Rekognition, Lambda, Kinesis, and OpenSearch Service.
- social-graph-based-people-recommender-using-amazon-neptune-and-textract: A people recommendation service based on a social graph built with Amazon Neptune, using Amazon Textract to perform OCR on business cards to build the graph.
- saas-metering-system-on-aws: A demo project for a SaaS metering system that enables usage-based billing models by accurately tracking API usage.
- my-aws-cdk-examples: A comprehensive repository of
131AWS CDK Python examples with architecture diagrams for a wide range of frequently used AWS services.
👥: Co-authored