Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

✨ feat(font & formula): More accurate recognition #75

Merged
merged 11 commits into from
Feb 10, 2025
Merged

✨ feat(font & formula): More accurate recognition #75

merged 11 commits into from
Feb 10, 2025

Conversation

awwaawwa
Copy link
Member

No description provided.

- Introduce optional ascent and descent float attributes in document IL schema
- Update schema definitions across Python, RNC, RNG, and XSD formats
- Provides additional font metric information for more detailed typography representation
- ensure ascent and descent attributes are correctly assigned from font object
- fix potential rendering issues due to incorrect font attribute mapping
- Enhance font subsetting configuration handling
- Add conditional checks for font subsetting and mono processing
- Ensure more flexible font processing during PDF generation
- Add comprehensive font metrics for multiple font variants
- Implement dynamic font metrics assignment for different font types
- Utilize functools.lru_cache for optimizing font character length calculations
…ing boxes

- Add descent calculation for both vertical and horizontal text orientations
- Adjust character bounding boxes by incorporating font descent metrics
- Enhance font rendering accuracy by correctly applying descent offsets
- Add flake8 configuration in pyproject.toml
- Set specific ignore rules for code style exceptions
- Configure max line length to 88 characters
…graphs

- Implement comprehensive descent removal for characters across different text compositions
- Add support for handling descent in vertical and horizontal text orientations
- Enhance paragraph box adjustment based on most common descent values
- Optimize font retrieval using lru_cache for improved performance
- copy page_font_map to page_xobj_font_map for each xobj
- ensure all fonts are available in xobj context
- Modify line width from 0.5 to 1 for improved visibility
- Minor adjustment to graphic rendering parameters
- Modify get_layout method to use _get_layout as a helper method
- Implement more robust layout detection with multiple position modes
- Prioritize 'isolate_formula' layout when available
- Simplify layout selection logic with fallback mechanisms
@awwaawwa awwaawwa merged commit a5e7c0b into main Feb 10, 2025
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant