Nils-Johan Gynther
2dc8aa4fb4
perf: skip Mistral AI for PDF lines that lack numeric value (header/footer/junk)
Test Suite / test (24.15.0) (push) Has been cancelled
2026-05-03 22:21:44 +02:00
Nils-Johan Gynther
ea006e7fbe
fix: increase retry delay for Mistral 429 to 3s*attempt
Test Suite / test (24.15.0) (push) Has been cancelled
2026-05-03 22:15:28 +02:00
Nils-Johan Gynther
fa9bd141e0
fix: use require() for pdf-parse and pdfjs-dist legacy build to fix Node 24 compat
Test Suite / test (24.15.0) (push) Has been cancelled
2026-05-03 22:10:53 +02:00
Nils-Johan Gynther
a0ac8b6084
Merge branch 'main' of ssh://gitea.gynther.se:2222/nilsjohan/microservice-importer
Test Suite / test (24.15.0) (push) Has been cancelled
2026-05-03 21:59:41 +02:00
Nils-Johan Gynther
6e9c588ae3
chore: Update Node.js version to 24.15.0 in CI and Dockerfile for consistency
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 21:48:49 +02:00
nilsjohan
bc6702b3e9
chore: pin Node 24.15.0 in Docker and CI
Test Suite / test (24.15.0) (push) Has been cancelled
2026-05-03 21:47:50 +02:00
Nils-Johan Gynther
cef8ee4b25
feat: Add Node.js version parity to next steps for consistent development environment
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 21:37:55 +02:00
Nils-Johan Gynther
96833f0eea
fix: improve pdf parsing with pdfjs-dist fallback
Test Suite / test (24.x) (push) Has been cancelled
2026-05-03 20:11:30 +02:00
Nils-Johan Gynther
19ef7a4ea5
feat: Integrate pdfjs-dist for improved PDF parsing fallback
...
Test Suite / test (24.x) (push) Has been cancelled
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 20:09:45 +02:00
Nils-Johan Gynther
99343f74af
feat: Enhance PDF parsing with fallback to pdf-lib for improved handling of complex PDFs
...
Test Suite / test (24.x) (push) Has been cancelled
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 20:03:10 +02:00
Nils-Johan Gynther
723730fd2f
feat: Update documentation with new receipt categorization rules and test infrastructure details
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 19:58:04 +02:00
Nils-Johan Gynther
8f9e1bac1e
Refactor code structure for improved readability and maintainability
...
Test Suite / test (24.x) (push) Has been cancelled
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 19:55:37 +02:00
Nils-Johan Gynther
7bc785e8e0
ci: add automated test workflow
Test Suite / test (24.x) (push) Has been cancelled
2026-05-03 19:55:09 +02:00
Nils-Johan Gynther
c8bfde346c
feat: Add function to ignore non-item lines in receipt parsing
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 18:18:14 +02:00
Nils-Johan Gynther
a5f6ebe6ff
feat: Update documentation with recent improvements and status for microservice importer
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 17:03:25 +02:00
Nils-Johan Gynther
a0bedcc9a9
feat: Update multipack handling in quantity rules and extraction logic
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 16:16:13 +02:00
Nils-Johan Gynther
504cf1c947
feat: Exclude monetary and weight tokens from name extraction in receipt parsing
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 14:11:25 +02:00
Nils-Johan Gynther
f270f8510e
feat: Enhance receipt parsing with additional preprocessing functions and improved AI response handling
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 11:19:58 +02:00
Nils-Johan Gynther
fa27c4d4de
feat: Enhance receipt parsing with rule-based line interpretation and AI fallback for complex cases
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 10:47:54 +02:00
Nils-Johan Gynther
5879712a7f
feat: Update README and next steps with Mistral AI model details and parsing rules for receipts
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 10:02:31 +02:00
Nils-Johan Gynther
841d098405
feat: Optimize AI usage in import function with structured workflow and caching strategies
...
Co-authored-by: Copilot <copilot@github.com >
2026-05-03 09:54:44 +02:00
Nils-Johan Gynther
3a4d73da18
feat: Enhance quantity rules for receipt parsing and update prompt structure
2026-05-03 07:36:27 +02:00
Nils-Johan Gynther
1bb0eedd4f
Update to .md-files
2026-05-01 00:27:30 +02:00
Nils-Johan Gynther
fe67804bfc
refactor: Update import paths and remove redundant DocumentServiceModule
2026-04-30 20:08:11 +02:00
Nils-Johan Gynther
a5f00cee9e
feat: Add receipt parsing functionality with image and PDF support, including Tesseract OCR integration
2026-04-30 20:00:25 +02:00
Nils-Johan Gynther
a8ef23c77b
feat: Update long-term goal for receipt import to reflect current backend implementation and future scalability considerations
...
Co-authored-by: Copilot <copilot@github.com >
2026-04-30 13:25:22 +02:00
Nils-Johan Gynther
f6c9c3949c
feat: Add long-term goal for structured receipt import with parser implementation
...
Co-authored-by: Copilot <copilot@github.com >
2026-04-30 12:48:25 +02:00
Nils-Johan Gynther
9db794a3be
docs: Update README to emphasize cross-platform coding practices
2026-04-24 15:20:15 +02:00
Nils-Johan Gynther
409d0b6b06
Refactor and enhance document and recipe import functionality
...
- Removed the QuickImport module and service to streamline the codebase.
- Introduced DocumentImport module, service, and controller for handling PDF uploads and conversions to Markdown.
- Added PdfParser for parsing PDF files and extracting text.
- Implemented DocumentParser as an abstract class for future document parsers.
- Created a new GenericRecipeParser and IcaRecipeParser for handling recipe data from various sources.
- Developed QuickImportService to manage recipe scraping from URLs.
- Enhanced error handling and validation for file uploads and input processing.
- Updated project documentation with a structured plan for future development and improvements.
2026-04-23 15:54:14 +02:00
Nils-Johan Gynther
e18bf79395
feat: Implement PDF document import functionality with Markdown conversion
...
- Added DocumentImportModule, DocumentImportController, and DocumentImportService for handling PDF uploads.
- Integrated pdf-parse for extracting text from PDF files.
- Created PdfParser for parsing PDF documents and converting them to Markdown format.
- Updated frontend to support file uploads via drag-and-drop and file input for PDF documents.
- Modified API routes to handle document import requests.
- Enhanced error handling for unsupported file types and file size limits.
- Updated README to reflect new features and usage instructions.
2026-04-12 18:57:40 +02:00
Nils-Johan Gynther
a1a4f9beb3
Update README with comprehensive architecture and API documentation
2026-04-12 17:08:48 +02:00
Nils-Johan Gynther
1608eb4d70
Initial microservice-importer setup with NestJS backend and Next.js frontend
2026-04-12 16:58:23 +02:00