Skip to content
@sunblaze-ucb

sunblaze-ucb

Popular repositories Loading

  1. Intuitor Intuitor Public

    Code for the paper: "Learning to Reason without External Rewards"

    Python 357 41

  2. rl-generalization rl-generalization Public

    Modifiable OpenAI Gym environments for studying generalization in RL

    Python 87 14

  3. cybergym cybergym Public

    CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

    Python 72 13

  4. dpml-benchmark dpml-benchmark Public

    This repository contains the codes for first large-scale investigation of Differentially Private Convex Optimization algorithms.

    Python 63 18

  5. blackbox-attacks blackbox-attacks Public

    Code used in 'Exploring the Space of Black-box Attacks on Deep Neural Networks' (https://arxiv.org/abs/1712.09491)

    Python 61 13

  6. Virgo Virgo Public

    C++ 59 17

Repositories

Showing 10 of 47 repositories
  • sunblaze-ucb/rl-grok-recipe’s past year of commit activity
    7 0 0 0 Updated Oct 3, 2025
  • awesome-RLVR-boundary Public

    A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Language Models (LLMs).

    sunblaze-ucb/awesome-RLVR-boundary’s past year of commit activity
    45 2 0 0 Updated Oct 3, 2025
  • VMDT Public
    sunblaze-ucb/VMDT’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Oct 2, 2025
  • AgentSynth Public

    AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents

    sunblaze-ucb/AgentSynth’s past year of commit activity
    Python 32 Apache-2.0 2 2 0 Updated Sep 25, 2025
  • cybergym Public

    CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents on real-world vulnerability analysis tasks.

    sunblaze-ucb/cybergym’s past year of commit activity
    Python 72 Apache-2.0 13 0 1 Updated Sep 24, 2025
  • verina Public

    Verina (Verifiable Code Generation Arena) is a high-quality benchmark enabling a comprehensive and modular evaluation of code, specification, and proof generation as well as their compositions.

    sunblaze-ucb/verina’s past year of commit activity
    Lean 26 Apache-2.0 4 0 0 Updated Sep 21, 2025
  • progent Public
    sunblaze-ucb/progent’s past year of commit activity
    Python 17 8 1 1 Updated Sep 11, 2025
  • mirage-bench Public
    sunblaze-ucb/mirage-bench’s past year of commit activity
    Python 4 Apache-2.0 1 0 0 Updated Aug 22, 2025
  • sunblaze-ucb/llm-code-security’s past year of commit activity
    HTML 1 0 0 0 Updated Aug 12, 2025
  • Intuitor Public

    Code for the paper: "Learning to Reason without External Rewards"

    sunblaze-ucb/Intuitor’s past year of commit activity
    Python 357 41 10 0 Updated Jul 10, 2025

Most used topics

Loading…