Commit Graph

18 Commits

Author SHA1 Message Date
Nils-Johan Gynther ea006e7fbe fix: increase retry delay for Mistral 429 to 3s*attempt
Test Suite / test (24.15.0) (push) Has been cancelled
2026-05-03 22:15:28 +02:00
Nils-Johan Gynther fa9bd141e0 fix: use require() for pdf-parse and pdfjs-dist legacy build to fix Node 24 compat
Test Suite / test (24.15.0) (push) Has been cancelled
2026-05-03 22:10:53 +02:00
Nils-Johan Gynther 6e9c588ae3 chore: Update Node.js version to 24.15.0 in CI and Dockerfile for consistency
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 21:48:49 +02:00
Nils-Johan Gynther 96833f0eea fix: improve pdf parsing with pdfjs-dist fallback
Test Suite / test (24.x) (push) Has been cancelled
2026-05-03 20:11:30 +02:00
Nils-Johan Gynther 19ef7a4ea5 feat: Integrate pdfjs-dist for improved PDF parsing fallback
Test Suite / test (24.x) (push) Has been cancelled
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 20:09:45 +02:00
Nils-Johan Gynther 99343f74af feat: Enhance PDF parsing with fallback to pdf-lib for improved handling of complex PDFs
Test Suite / test (24.x) (push) Has been cancelled
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 20:03:10 +02:00
Nils-Johan Gynther 8f9e1bac1e Refactor code structure for improved readability and maintainability
Test Suite / test (24.x) (push) Has been cancelled
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 19:55:37 +02:00
Nils-Johan Gynther c8bfde346c feat: Add function to ignore non-item lines in receipt parsing
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 18:18:14 +02:00
Nils-Johan Gynther a0bedcc9a9 feat: Update multipack handling in quantity rules and extraction logic
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 16:16:13 +02:00
Nils-Johan Gynther 504cf1c947 feat: Exclude monetary and weight tokens from name extraction in receipt parsing
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 14:11:25 +02:00
Nils-Johan Gynther f270f8510e feat: Enhance receipt parsing with additional preprocessing functions and improved AI response handling
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 11:19:58 +02:00
Nils-Johan Gynther fa27c4d4de feat: Enhance receipt parsing with rule-based line interpretation and AI fallback for complex cases
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 10:47:54 +02:00
Nils-Johan Gynther 3a4d73da18 feat: Enhance quantity rules for receipt parsing and update prompt structure 2026-05-03 07:36:27 +02:00
Nils-Johan Gynther fe67804bfc refactor: Update import paths and remove redundant DocumentServiceModule 2026-04-30 20:08:11 +02:00
Nils-Johan Gynther a5f00cee9e feat: Add receipt parsing functionality with image and PDF support, including Tesseract OCR integration 2026-04-30 20:00:25 +02:00
Nils-Johan Gynther 409d0b6b06 Refactor and enhance document and recipe import functionality
- Removed the QuickImport module and service to streamline the codebase.
- Introduced DocumentImport module, service, and controller for handling PDF uploads and conversions to Markdown.
- Added PdfParser for parsing PDF files and extracting text.
- Implemented DocumentParser as an abstract class for future document parsers.
- Created a new GenericRecipeParser and IcaRecipeParser for handling recipe data from various sources.
- Developed QuickImportService to manage recipe scraping from URLs.
- Enhanced error handling and validation for file uploads and input processing.
- Updated project documentation with a structured plan for future development and improvements.
2026-04-23 15:54:14 +02:00
Nils-Johan Gynther e18bf79395 feat: Implement PDF document import functionality with Markdown conversion
- Added DocumentImportModule, DocumentImportController, and DocumentImportService for handling PDF uploads.
- Integrated pdf-parse for extracting text from PDF files.
- Created PdfParser for parsing PDF documents and converting them to Markdown format.
- Updated frontend to support file uploads via drag-and-drop and file input for PDF documents.
- Modified API routes to handle document import requests.
- Enhanced error handling for unsupported file types and file size limits.
- Updated README to reflect new features and usage instructions.
2026-04-12 18:57:40 +02:00
Nils-Johan Gynther 1608eb4d70 Initial microservice-importer setup with NestJS backend and Next.js frontend 2026-04-12 16:58:23 +02:00