Business Problem
About Ledgebrook
Ledgebrook receives various submission and policy documents via email, which need to be classified as loss run or non-loss run, processed, and stored efficiently for underwriting and claims processing. goML automated the document processing workflow, eliminating manual efforts.
Solution
goML developed an automated document classification and storage system leveraging AWS cloud services:
1.
Document Ingestion & Storage: Documents received via email were automatically stored in an AWS S3 bucket, ensuring scalability and security. The pipeline starts by ingesting files from S3.
4.
Database Integration: Processed document details were stored in a PostgreSQL RDS database, while the documents themselves were vectorized and stored in OpenSearch.
2.
AI-Based Classification: AWS Textract was used to extract text from the files, and AWS Bedrock was leveraged to classify documents into loss run and non-loss run categories.
5.
Search & Retrieval: Indexed metadata in OpenSearch Serverless allowed underwriters to quickly retrieve documents based on classification.
3.
Token-Based Document Grouping: A unique aiDocumentSessionToken was assigned to each group of related files, enabling seamless tracking.