ivrit

Python package providing wrappers around ivrit.ai's capabilities.

Installation

pip install ivrit

Usage

Audio Transcription

The ivrit package provides audio transcription functionality using multiple engines.

Basic Usage

import ivrit

# Transcribe a local audio file
model = ivrit.load_model(engine="faster-whisper", model="ivrit-ai/whisper-large-v3-turbo-ct2")
result = model.transcribe(path="audio.mp3")

# With custom device
model = ivrit.load_model(engine="faster-whisper", model="ivrit-ai/whisper-large-v3-turbo-ct2", device="cpu")
result = model.transcribe(path="audio.mp3")

print(result["text"])

Transcribe from URL

# Transcribe audio from a URL
model = ivrit.load_model(engine="faster-whisper", model="ivrit-ai/whisper-large-v3-turbo-ct2")
result = model.transcribe(url="https://example.com/audio.mp3")

print(result["text"])

Streaming Results

# Get results as a stream (generator)
model = ivrit.load_model(engine="faster-whisper", model="base")
for segment in model.transcribe(path="audio.mp3", stream=True, verbose=True):
    print(f"{segment.start:.2f}s - {segment.end:.2f}s: {segment.text}")

# Or use the model directly
model = ivrit.FasterWhisperModel(model="base")
for segment in model.transcribe(path="audio.mp3", stream=True):
    print(f"{segment.start:.2f}s - {segment.end:.2f}s: {segment.text}")

# Access word-level timing
for segment in model.transcribe(path="audio.mp3", stream=True):
    print(f"Segment: {segment.text}")
    for word in segment.extra_data.get('words', []):
        print(f"  {word['start']:.2f}s - {word['end']:.2f}s: '{word['word']}'")

API Reference

`load_model()`

Load a transcription model for the specified engine and model.

Parameters

engine (str): Transcription engine to use. Options: "faster-whisper", "stable-ts"
model (str): Model name for the selected engine
device (str, optional): Device to use for inference. Default: "auto". Options: "auto", "cpu", "cuda", "cuda:0", etc.
model_path (str, optional): Custom path to the model (for faster-whisper)

Returns

TranscriptionModel object that can be used for transcription

Raises

ValueError: If the engine is not supported
ImportError: If required dependencies are not installed

Architecture

The ivrit package uses an object-oriented design with a base TranscriptionModel class and specific implementations for each transcription engine.

Model Classes

TranscriptionModel: Abstract base class for all transcription models
FasterWhisperModel: Implementation for the Faster Whisper engine

Usage Patterns

Pattern 1: Using `load_model()` (Recommended)

# Step 1: Load the model
model = ivrit.load_model(engine="faster-whisper", model="base")

# Step 2: Transcribe audio
result = model.transcribe(path="audio.mp3")

Pattern 2: Direct Model Creation

# Create model directly
model = ivrit.FasterWhisperModel(model="base")

# Use the model
result = model.transcribe(path="audio.mp3")

Multiple Transcriptions

For multiple transcriptions, load the model once and reuse it:

# Load model once
model = ivrit.load_model(engine="faster-whisper", model="base")

# Use for multiple transcriptions
result1 = model.transcribe(path="audio1.mp3")
result2 = model.transcribe(path="audio2.mp3")
result3 = model.transcribe(path="audio3.mp3")

Installation

Basic Installation

pip install ivrit

With Faster Whisper Support

pip install ivrit[faster-whisper]

Supported Engines

faster-whisper

Fast and accurate speech recognition using the Faster Whisper model.

Model Class: FasterWhisperModel

Available Models: base, large, small, medium, large-v2, large-v3

Features:

Word-level timing information
Language detection with confidence scores
Support for custom devices (CPU, CUDA, etc.)
Support for custom model paths
Streaming transcription

Dependencies: faster-whisper>=1.1.1

stable-ts

Stable and reliable transcription using Stable-TS models.

Status: Not yet implemented

Development

Installation for Development

git clone <repository-url>
cd ivrit
pip install -e ".[dev]"

Running Tests

pytest

Code Formatting

black .
isort .

License

MIT License - see LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
ivrit		ivrit
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ivrit

Installation

Usage

Audio Transcription

Basic Usage

Transcribe from URL

Streaming Results

API Reference

`load_model()`

Parameters

Returns

Raises

Architecture

Model Classes

Usage Patterns

Pattern 1: Using `load_model()` (Recommended)

Pattern 2: Direct Model Creation

Multiple Transcriptions

Installation

Basic Installation

With Faster Whisper Support

Supported Engines

faster-whisper

stable-ts

Development

Installation for Development

Running Tests

Code Formatting

License

About

Uh oh!

Releases

Packages

Languages

noamgot/ivrit-py

Folders and files

Latest commit

History

Repository files navigation

ivrit

Installation

Usage

Audio Transcription

Basic Usage

Transcribe from URL

Streaming Results

API Reference

load_model()

Parameters

Returns

Raises

Architecture

Model Classes

Usage Patterns

Pattern 1: Using load_model() (Recommended)

Pattern 2: Direct Model Creation

Multiple Transcriptions

Installation

Basic Installation

With Faster Whisper Support

Supported Engines

faster-whisper

stable-ts

Development

Installation for Development

Running Tests

Code Formatting

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`load_model()`

Pattern 1: Using `load_model()` (Recommended)

Packages