Add token view and model family selection #7

BedirT · 2025-08-02T21:21:26Z

Summary

enable alternating two-color token highlight via new Token View option
reduce model selection to model families and add Gemini family
document new feature in README and changelog
bump version to 1.3.0

Testing

npm run lint
npm test (fails: runs but produces no output)

Copilot

Pull Request Overview

This PR adds a token view feature for visual token highlighting and simplifies model selection by grouping models into families rather than individual models. It also adds support for Google's Gemini model family.

Introduces a Token View feature that highlights tokens with alternating colors for better visualization
Simplifies model selection by organizing models into families (OpenAI, Anthropic, Google) instead of individual models
Adds support for Google Gemini model family with appropriate tokenizer mapping

Reviewed Changes

Copilot reviewed 4 out of 5 changed files in this pull request and generated 5 comments.

File	Description
src/extension.js	Implements token highlighting functionality and reorganizes model selection from individual models to families
package.json	Updates version to 1.3.0
README.md	Documents new Token View feature and model family selection changes
CHANGELOG.md	Records version 1.3.0 changes

Copilot · 2025-08-02T21:23:19Z

src/extension.js

+        'gpt-4o': 'gpt-4o',
+        'gpt-4': 'gpt-4',
+        'gpt-3.5': 'gpt-3.5-turbo',
+        'claude-4*': 'claude-3.7*',


The mapping for 'claude-4*' points to 'claude-3.7*' instead of a corresponding Claude-4 model. This appears to be incorrect and could cause confusion about which model is actually being used for tokenization.

Suggested change

'claude-4*': 'claude-3.7*',

'claude-4*': 'claude-4*',

Copilot · 2025-08-02T21:23:19Z

src/extension.js

+        'claude-4*': 'claude-3.7*',
+        'claude-3.7*': 'claude-3.7*',
+        'claude-3.5*': 'claude-3.5*',
+        'gemini*': 'cl100k_base'


The Gemini model family is mapped to 'cl100k_base' which is an OpenAI tokenizer, not a Google tokenizer. This will produce incorrect token counts for Gemini models and should use an appropriate Google tokenizer or approximation method.

Copilot · 2025-08-02T21:23:20Z

src/extension.js

            currentModel = model;

-            if (currentProvider === 'openai') {
+            if (currentProvider === 'openai' || currentProvider === 'google') {


Google models are being treated the same as OpenAI models for tokenization, but they should use different tokenizers. Google models will incorrectly use OpenAI's encoding_for_model function instead of an appropriate Google tokenizer.

Copilot · 2025-08-02T21:23:20Z

src/extension.js

+            const { getTokenizer } = require('@anthropic-ai/tokenizer');
+            const tok = getTokenizer();
+            tokens = Array.from(tok.encode(text, 'all'));
+            decode = t => Buffer.from(tok.decode_single_token_bytes(t)).toString();
+            tok.free();


The Anthropic tokenizer is being required and initialized on every token highlighting call. This should be moved to module-level or cached to avoid repeated imports and initializations.

Suggested change

const { getTokenizer } = require('@anthropic-ai/tokenizer');

const tok = getTokenizer();

tokens = Array.from(tok.encode(text, 'all'));

decode = t => Buffer.from(tok.decode_single_token_bytes(t)).toString();

tok.free();

if (!anthropicTokenizer) {

anthropicTokenizer = getTokenizer();

}

tokens = Array.from(anthropicTokenizer.encode(text, 'all'));

decode = t => Buffer.from(anthropicTokenizer.decode_single_token_bytes(t)).toString();

Copilot · 2025-08-02T21:23:20Z

src/extension.js

+            const tok = getTokenizer();
+            tokens = Array.from(tok.encode(text, 'all'));
+            decode = t => Buffer.from(tok.decode_single_token_bytes(t)).toString();
+            tok.free();


The tokenizer is freed immediately after use, but the decode function created on line 142 still references it. This will cause errors when the decode function is called later in the loop.

Suggested change

tok.free();

Add token view and model family selection

babf7c4

BedirT added the codex label Aug 2, 2025 — with ChatGPT Codex Connector

BedirT requested a review from Copilot August 2, 2025 21:22

BedirT linked an issue Aug 2, 2025 that may be closed by this pull request

Highlight tokens #6

Open

Copilot AI reviewed Aug 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add token view and model family selection #7

Add token view and model family selection #7

BedirT commented Aug 2, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Aug 2, 2025

Uh oh!

Copilot AI Aug 2, 2025

Uh oh!

Copilot AI Aug 2, 2025

Uh oh!

Copilot AI Aug 2, 2025

Uh oh!

Copilot AI Aug 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Add token view and model family selection #7

Are you sure you want to change the base?

Add token view and model family selection #7

Conversation

BedirT commented Aug 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Aug 2, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

BedirT commented Aug 2, 2025 •

edited

Loading