Commit Graph

25 Commits

Author SHA1 Message Date
Nils-Johan Gynther 96833f0eea fix: improve pdf parsing with pdfjs-dist fallback
Test Suite / test (24.x) (push) Has been cancelled
2026-05-03 20:11:30 +02:00
Nils-Johan Gynther 19ef7a4ea5 feat: Integrate pdfjs-dist for improved PDF parsing fallback
Test Suite / test (24.x) (push) Has been cancelled
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 20:09:45 +02:00
Nils-Johan Gynther 99343f74af feat: Enhance PDF parsing with fallback to pdf-lib for improved handling of complex PDFs
Test Suite / test (24.x) (push) Has been cancelled
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 20:03:10 +02:00
Nils-Johan Gynther 723730fd2f feat: Update documentation with new receipt categorization rules and test infrastructure details
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 19:58:04 +02:00
Nils-Johan Gynther 8f9e1bac1e Refactor code structure for improved readability and maintainability
Test Suite / test (24.x) (push) Has been cancelled
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 19:55:37 +02:00
Nils-Johan Gynther 7bc785e8e0 ci: add automated test workflow
Test Suite / test (24.x) (push) Has been cancelled
2026-05-03 19:55:09 +02:00
Nils-Johan Gynther c8bfde346c feat: Add function to ignore non-item lines in receipt parsing
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 18:18:14 +02:00
Nils-Johan Gynther a5f6ebe6ff feat: Update documentation with recent improvements and status for microservice importer
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 17:03:25 +02:00
Nils-Johan Gynther a0bedcc9a9 feat: Update multipack handling in quantity rules and extraction logic
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 16:16:13 +02:00
Nils-Johan Gynther 504cf1c947 feat: Exclude monetary and weight tokens from name extraction in receipt parsing
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 14:11:25 +02:00
Nils-Johan Gynther f270f8510e feat: Enhance receipt parsing with additional preprocessing functions and improved AI response handling
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 11:19:58 +02:00
Nils-Johan Gynther fa27c4d4de feat: Enhance receipt parsing with rule-based line interpretation and AI fallback for complex cases
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 10:47:54 +02:00
Nils-Johan Gynther 5879712a7f feat: Update README and next steps with Mistral AI model details and parsing rules for receipts
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 10:02:31 +02:00
Nils-Johan Gynther 841d098405 feat: Optimize AI usage in import function with structured workflow and caching strategies
Co-authored-by: Copilot <copilot@github.com>
2026-05-03 09:54:44 +02:00
Nils-Johan Gynther 3a4d73da18 feat: Enhance quantity rules for receipt parsing and update prompt structure 2026-05-03 07:36:27 +02:00
Nils-Johan Gynther 1bb0eedd4f Update to .md-files 2026-05-01 00:27:30 +02:00
Nils-Johan Gynther fe67804bfc refactor: Update import paths and remove redundant DocumentServiceModule 2026-04-30 20:08:11 +02:00
Nils-Johan Gynther a5f00cee9e feat: Add receipt parsing functionality with image and PDF support, including Tesseract OCR integration 2026-04-30 20:00:25 +02:00
Nils-Johan Gynther a8ef23c77b feat: Update long-term goal for receipt import to reflect current backend implementation and future scalability considerations
Co-authored-by: Copilot <copilot@github.com>
2026-04-30 13:25:22 +02:00
Nils-Johan Gynther f6c9c3949c feat: Add long-term goal for structured receipt import with parser implementation
Co-authored-by: Copilot <copilot@github.com>
2026-04-30 12:48:25 +02:00
Nils-Johan Gynther 9db794a3be docs: Update README to emphasize cross-platform coding practices 2026-04-24 15:20:15 +02:00
Nils-Johan Gynther 409d0b6b06 Refactor and enhance document and recipe import functionality
- Removed the QuickImport module and service to streamline the codebase.
- Introduced DocumentImport module, service, and controller for handling PDF uploads and conversions to Markdown.
- Added PdfParser for parsing PDF files and extracting text.
- Implemented DocumentParser as an abstract class for future document parsers.
- Created a new GenericRecipeParser and IcaRecipeParser for handling recipe data from various sources.
- Developed QuickImportService to manage recipe scraping from URLs.
- Enhanced error handling and validation for file uploads and input processing.
- Updated project documentation with a structured plan for future development and improvements.
2026-04-23 15:54:14 +02:00
Nils-Johan Gynther e18bf79395 feat: Implement PDF document import functionality with Markdown conversion
- Added DocumentImportModule, DocumentImportController, and DocumentImportService for handling PDF uploads.
- Integrated pdf-parse for extracting text from PDF files.
- Created PdfParser for parsing PDF documents and converting them to Markdown format.
- Updated frontend to support file uploads via drag-and-drop and file input for PDF documents.
- Modified API routes to handle document import requests.
- Enhanced error handling for unsupported file types and file size limits.
- Updated README to reflect new features and usage instructions.
2026-04-12 18:57:40 +02:00
Nils-Johan Gynther a1a4f9beb3 Update README with comprehensive architecture and API documentation 2026-04-12 17:08:48 +02:00
Nils-Johan Gynther 1608eb4d70 Initial microservice-importer setup with NestJS backend and Next.js frontend 2026-04-12 16:58:23 +02:00