Desktop-assistant-AI

Desktop-assistant-AI is an AI-powered desktop assistant designed to help users, especially coders, when they are unsure what to do next. It originally intended to take screenshots from PC or screen recording and provide it to the AI for analysis. It is not currently a useful product.

Platform: Windows only

Features

AI-powered help for coding and general desktop tasks
Screenshot capture and context-aware assistance
Integration with OpenAI (ChatGPT) and Whisper for speech recognition
Text-to-speech responses
Secure model loading with progress feedback
Modern PyQt5 GUI

Installation

This project uses a PowerShell script to automate the setup process. It will check for the required Python version (3.11), create a virtual environment, and install all the necessary dependencies.

Download or clone this repository.
Run the setup script:

Open a PowerShell terminal and run the following command:
```
.\setup.ps1
```
The script will guide you through the setup process. If you don't have Python 3.11 installed, it will offer to install it for you from the Microsoft Store.
Run the application:

Once the setup is complete, you can launch the application with:
```
.\run.ps1
```

See also: gotchas.md for troubleshooting common installation issues.

Usage

After installation, launch the assistant using run.ps1. The app will show a loading screen while the Whisper and Coqui TTS models load, then present the main window for interaction.

Developer Setup

To set up a development environment, simply follow the installation instructions above. The setup.ps1 script will create a self-contained virtual environment in the .venv directory, which you can use for development.

Project Structure

src/ — Main source code
resources/ — Images and logos
run.bat, compile.bat — Windows scripts for running and compiling
download_dependencies.sh — Dependency installer for Linux

Security Notes

See The nature of the security vulnerability.md for details on a Powershell script parser vulnerability related to speculative execution in batch scripts. This project is designed with security in mind, but always review scripts before running.

License

MIT License (see LICENSE file if present)

Built With

PyQt5 - The GUI framework used
OpenAI - For ChatGPT and Whisper integration
Coqui TTS - For text-to-speech
PyAudio - For audio I/O
Silero VAD - For voice activity detection

Screenshots

v0.2 - PyQt5 GUI with CoquiTTS.

v0.1 - Command line with pyttsx3

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
resources		resources
shell security vulnerability		shell security vulnerability
src		src
.gitignore		.gitignore
Desktop-assistant-AI.code-workspace		Desktop-assistant-AI.code-workspace
README.md		README.md
compile.sh		compile.sh
compile_test.ps1		compile_test.ps1
download_dependencies.sh		download_dependencies.sh
gotchas.md		gotchas.md
grit_installer		grit_installer
requirements.txt		requirements.txt
run.ps1		run.ps1
setup.ps1		setup.ps1
v0.1_screenshot_20241120_123944.png		v0.1_screenshot_20241120_123944.png
v0.2_Screenshot_2025-08-14_235842.png		v0.2_Screenshot_2025-08-14_235842.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Desktop-assistant-AI

Features

Installation

Usage

Developer Setup

Project Structure

Security Notes

License

Built With

Screenshots

About

Uh oh!

Releases

Packages

Languages

AMDphreak/Desktop-Assistant-AI

Folders and files

Latest commit

History

Repository files navigation

Desktop-assistant-AI

Features

Installation

Usage

Developer Setup

Project Structure

Security Notes

License

Built With

Screenshots

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages