Skip to content

Conversation

Rahban1
Copy link
Contributor

@Rahban1 Rahban1 commented Jun 26, 2025

I have updated the tokenizer and improved on the custom trimmer. This aim to close #1457 and close #2114

@mortenpi mortenpi added Type: Enhancement Format: HTML Related to the default HTML output labels Jun 29, 2025
@Rahban1
Copy link
Contributor Author

Rahban1 commented Jul 16, 2025

These are the benchmarks results after these changes in the tokenizer :
Screenshot 2025-07-16 at 3 26 33 PM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Format: HTML Related to the default HTML output Type: Enhancement
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Search can't find e.g. __init__ Search often gives insufficient results
3 participants