Add batch processing to LeanRepl #93

FrederickPu · 2025-04-28T15:19:09Z

Able to process multiple commands at once using

{ "cmds": ["theorem womp : 2 + 2 = 4 := by rfl", "#eval 0 = 2"]}

Also added multithreading parallelism using Task monad as well as option to garbage collect command snapshots (this overlaps with #83.

Batch commands have timeouts to prevent one bad proof from stalling the batch which overlaps with #92.
Timeouts are in milliseconds and can be set using an option

{ "cmds": ["theorem womp : 2 + 2 = 4 := by rfl", "#eval 0 = 2"], "timeout": 6000}

Right now, there is no command snapshotting for batch commands.

FrederickPu · 2025-04-28T15:20:55Z

see README.md for more details about all of the available config options for batch commands

augustepoiroux · 2025-04-28T15:40:08Z

Amazing feature, thanks for sharing it!
A few comments:

There is a typo: parrallel -> parallel
I see you pushed some python code and text files for testing. I don't have a strong opinion on mixing the REPL with python code, but maybe not everyone would agree, just telling you in case ^^

FrederickPu · 2025-04-28T15:42:32Z

the python code is just for benchmarking, I can get rid of it before we merge (and generate a corresponding .in file) before we merge

vadimkantorov · 2025-04-29T13:16:01Z

One useful option could also be something along the lines of "skip processing elements if an earlier example got verified okay". This can be useful for verifying multiple proofs for the same problem produced by whole-proof completeion LLMs (e.g. 32 or 256 or several thousand roll-outs). Sometimes we are already satisfied if we found a single proof and for time-saving, all further proofs of the given problem can be skipped

augustepoiroux · 2025-04-29T13:18:51Z

I agree this can be a useful feature. ~~This can be implemented with IO.waitAny, and then cancelling the other jobs (hoping the jobs will cooperate though).~~ Nevermind, it is not that straightforward as the first job to finish may fail.

FrederickPu · 2025-04-29T13:47:45Z

i think to do this efficiently you would need to poll each pending task to see if it has yielded a successful result at regular intervals kind of like I do for the withTimeout function

augustepoiroux · 2025-04-29T15:00:20Z

This sounds like a good practical approach. I wonder if there is a more efficient way of doing this though.
For example, regarding timeout, I think I prefer the approach in #92. IO.waitAny is used to terminate as soon as the command or the timeout terminate instead of checking at regular intervals.

vadimkantorov · 2025-05-01T11:14:41Z

One useful option could also be something along the lines of "skip processing elements if an earlier example got verified okay".

It could also be good to process the batch in micro-batches (when configured) - so that the CPU utilization can be maximized. When I ran REPL, I got typically 10% utilization of a single CPU core. So bringing it up would be useful.

Another question is multi-core processing. Then maybe proofs should be annotated with problem-name, so that simultaneously multiple micro-batches from different problems can be processed

augustepoiroux · 2025-05-01T12:01:14Z

If you look at the readme file in this PR, I think @FrederickPu already implemented that through the "buckets" parameter

FrederickPu · 2025-05-01T12:44:52Z

If you look at the readme file in this PR, I think @FrederickPu already implemented that through the "buckets" parameter

When running in parallel mode on a VM with 300+ cores I max out at a 7x speedup. So I'm not sure if core level parallelism is being acheived

FrederickPu · 2025-05-01T13:48:43Z

Should we do thread level parallelism within each bucket?

alok · 2025-05-05T20:55:42Z

README.md

+{ "cmds": ["theorem womp : 2 + 2 = 4 := by rfl", "theorem womp1 : 2 + 4 = 6 := by rfl"]}
+```
+
+All the same options from Command can be used and will be applied to each command in the `cmds` array. Additionally, you can specify the parrallelism mode using `mode`


parallel has 1 r.

FrederickPu added 9 commits April 4, 2025 10:13

added batch processing to repl

116ca8c

simplified enviroment lifecycle

3e4e9c3

integrated into cli interface

07181a5

added support for enviroments

3a194e9

included json changes

b5c1a8f

better test

80f9f6c

added generalized batch commands

a8c5c67

added readme description

9693c5a

added timeouts

0336da1

vadimkantorov mentioned this pull request Apr 29, 2025

[feature request] Built-in command timeout support #94

Open

kim-em marked this pull request as draft May 1, 2025 15:17

alok reviewed May 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add batch processing to LeanRepl #93

Add batch processing to LeanRepl #93

Uh oh!

FrederickPu commented Apr 28, 2025

Uh oh!

FrederickPu commented Apr 28, 2025

Uh oh!

augustepoiroux commented Apr 28, 2025

Uh oh!

FrederickPu commented Apr 28, 2025

Uh oh!

vadimkantorov commented Apr 29, 2025

Uh oh!

augustepoiroux commented Apr 29, 2025 •

edited

Loading

Uh oh!

FrederickPu commented Apr 29, 2025

Uh oh!

augustepoiroux commented Apr 29, 2025

Uh oh!

vadimkantorov commented May 1, 2025

Uh oh!

augustepoiroux commented May 1, 2025

Uh oh!

FrederickPu commented May 1, 2025

Uh oh!

FrederickPu commented May 1, 2025

Uh oh!

alok May 5, 2025

Uh oh!

Uh oh!

Add batch processing to LeanRepl #93

Are you sure you want to change the base?

Add batch processing to LeanRepl #93

Uh oh!

Conversation

FrederickPu commented Apr 28, 2025

Uh oh!

FrederickPu commented Apr 28, 2025

Uh oh!

augustepoiroux commented Apr 28, 2025

Uh oh!

FrederickPu commented Apr 28, 2025

Uh oh!

vadimkantorov commented Apr 29, 2025

Uh oh!

augustepoiroux commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

FrederickPu commented Apr 29, 2025

Uh oh!

augustepoiroux commented Apr 29, 2025

Uh oh!

vadimkantorov commented May 1, 2025

Uh oh!

augustepoiroux commented May 1, 2025

Uh oh!

FrederickPu commented May 1, 2025

Uh oh!

FrederickPu commented May 1, 2025

Uh oh!

alok May 5, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

augustepoiroux commented Apr 29, 2025 •

edited

Loading