add garbage collection option #83

RexWzh · 2025-04-10T20:08:16Z

Add a record option to optimize memory usage. The reference-counting issue is fixed by @ohyeat as discussed here.

kim-em · 2025-04-10T23:59:38Z

What is this meant to do? This needs documentation!

RexWzh · 2025-04-11T05:13:31Z

I just updated the readme. Setting gc = true will discard the environment after execution, useful for memory management.

RexWzh · 2025-04-14T11:07:53Z

The code modifications for unsafe def repl are written by @ohyeat , replacing loops with tail recursion.

I've tested these changes using statements from MiniF2F. Here are the test samples:

{"cmd": "import Mathlib\nopen BigOperators Real Nat Topology Rat"}

-- tactic mode
{"cmd": theorem algebra_amgm_faxinrrp2msqrt2geq2mxm1div2x :\n    ∀ x > 0, 2 - Real.sqrt 2 ≥ 2 - x - 1 / (2 * x) := by sorry", "env": 0}

-- command mode
{"cmd": "theorem algebra_amgm_faxinrrp2msqrt2geq2mxm1div2x :\n    ∀ x > 0, 2 - Real.sqrt 2 ≥ 2 - x - 1 / (2 * x) := by", "env": 0}

-- gc for tactic mode
{"cmd": "theorem algebra_amgm_faxinrrp2msqrt2geq2mxm1div2x :\n    ∀ x > 0, 2 - Real.sqrt 2 ≥ 2 - x - 1 / (2 * x) := by sorry", "env": 0, "gc": true}

-- gc for command mode
{"cmd": "theorem algebra_amgm_faxinrrp2msqrt2geq2mxm1div2x :\n    ∀ x > 0, 2 - Real.sqrt 2 ≥ 2 - x - 1 / (2 * x) := by", "env": 0, "gc": true}

And the test results:

FrederickPu · 2025-04-15T15:11:56Z

we also want to avoid saving the proofSnapshots too

RexWzh · 2025-04-15T15:39:04Z

The proofSnapshots are essential for tactic mode - without them, the proofState id would be invalid when running commands in the format {"tactic": "xxx", "proofState":xxx}.

ADD:

However, the command that creates the proofSnapshot can be garbage-collected. See the green and yellow lines in the diagram above. I will add more tests later.

augustepoiroux · 2025-04-15T15:46:06Z

Just a question of aesthetics. I would prefer the option to be called discardState instead of gc.
Or maybe we do the opposite and call this option recordState or saveState, with true as default value.
Don't know which option is better ^^

Other than that, I think this is a useful feature 👍

RexWzh · 2025-04-15T15:50:22Z

Just a question of aesthetics. I would prefer the option to be called discardState instead of gc. Or maybe we do the opposite and call this option recordState or saveState, with true as default value. Don't know which option is better ^^

Other than that, I think this is a useful feature 👍

You mean discardCmd, right?

As for discardState, I think implementing something like deleteState might be more plausible. We would rarely create a state and discard it immediately.

FrederickPu · 2025-04-15T16:07:37Z

The proofSnapshots are essential for tactic mode - without them, the proofState id would be invalid when running commands in the format {"tactic": "xxx", "proofState":xxx}.

ADD:

However, the command that creates the proofSnapshot can be garbage-collected. See the green and yellow lines in the diagram above. I will add more tests later.

The whole point of garbage collecting a command is that you never interact with its state again. So you won't apply tactics on its proof state

RexWzh · 2025-04-15T17:16:49Z

@augustepoiroux I think using discard would be sufficient since it only applies to command mode.

with true as default value

We'd better set false as default to be compatibile with previous versions. For example, users need to set discard=false when first running import Mathlib :)

augustepoiroux · 2025-04-15T17:23:57Z

Nice 👍 I agree with you about keeping backward compatibiliy. Regarding

with true as default value

The full sentence is

Or maybe we do the opposite and call this option recordState or saveState, with true as default value.

It is just a change of point of view. Instead of discarding commands, we record them. My naming was bad, but the idea was essentially to replace gc by record and inverting all the values. I.e. gc=True <-> record=False. And in this case we would have record=True as the default value, which keeps the current behavior of the REPL ;)

RexWzh · 2025-04-15T17:24:38Z

@FrederickPu

The proofSnapshots are stored in proofStates and can exist independently of their corresponding env commands.

repl/REPL/Main.lean

Lines 61 to 74 in 506a6f3

    
           /-- The monadic state for the Lean REPL. -/ 
        
           structure State where 
        
             /-- 
        
             Environment snapshots after complete declarations. 
        
             The user can run a declaration in a given environment using `{"cmd": "def f := 37", "env": 17}`. 
        
             -/ 
        
             cmdStates : Array CommandSnapshot := #[] 
        
             /-- 
        
             Proof states after individual tactics. 
        
             The user can run a tactic in a given proof state using `{"tactic": "exact 42", "proofState": 5}`. 
        
             Declarations with containing `sorry` record a proof state at each sorry, 
        
             and report the numerical index for the recorded state at each sorry. 
        
             -/ 
        
             proofStates : Array ProofSnapshot := #[]

For verification, you can check the test cases in this PR:

https://github.com/leanprover-community/repl/pull/83/files

FrederickPu · 2025-04-15T17:28:47Z

then there should be an option to discard proof states as well. For example if you are doing proof repair, then you don't every use the tactic thing

RexWzh · 2025-04-17T18:21:46Z

For example if you are doing proof repair, then you don't every use the tactic thing

@FrederickPu I've implemented similar command mode features for tactic mode and am currently testing how these features reduce memory usage.

Also, note that setting record=false will discard states created by tactics, but not states created by sorry. For example, tactics like have : 1=1 := by sorry will still create additional states.

btw, deleting goals could disrupt the proofState id, so we should avoid that approach.

The whole point of garbage collecting a command is that you never interact with its state again.

You can still interact with the state even after garbage collecting the command. This could address the issue you posted earlier.

RexWzh · 2025-05-03T23:02:04Z

@kim-em I think this PR is ready.

Setting record=False enables garbage collection of environment snapshots. This helps reduce memory usage from continuously increasing, although commands like {"cmd":"import xxx"} cannot be garbage collected.

Additionally, it can serve as a health check command. For example:
{"cmd":"#check true", "record":false} can be used to verify if the REPL is alive without modifying the environment snapshots.

Thanks for your time and effort for reviewing these changes.

kim-em · 2025-05-12T04:19:37Z

test/Mathlib/test/record_cmd.expected.out

+{"sorries":
+ [{"proofState": 0,
+   "pos": {"line": 1, "column": 93},
+   "goal": "⊢ (2000 + 2001 + 2002 + 2003 + 2004 + 2005 + 2006) % 7 = 0",
+   "endPos": {"line": 1, "column": 98}}],
+ "messages":
+ [{"severity": "warning",
+   "pos": {"line": 1, "column": 8},
+   "endPos": {"line": 1, "column": 30},
+   "data": "declaration uses 'sorry'"}]}


I don't understand here: if we're generating proofState: 0 here, and it is actually usable, then the environment has been transitively captured, and the record: false wasn't really respected.

Rex said that you can garbage collect the command that created the proof snapshot without garbage collecting the proof snapshot itself. I'm not sure what the use case for this would be tho. Also I think having each proof snapshot connected to a command would make it easier to garbage collect all the snapshots after you are done with tactic mode for a particular problem.

You can check the green(garbage collected) and yellow lines in the diagram above. Here are the relevant commands:

# By default, this consumes about 83 MB [{'cmd': 'import Mathlib'}, {'cmd': 'theorem womp0 (a0 b c : Nat) : (a0 + b) + c = c + a0 + b := by sorry', 'env': 0}, {'cmd': 'theorem womp1 (a1 b c : Nat) : (a1 + b) + c = c + a1 + b := by sorry', 'env': 0}, # ... ] # With record=False, this consumes about 21 MB [{'cmd': 'import Mathlib'}, {'cmd': 'theorem womp0 (a0 b c : Nat) : (a0 + b) + c = c + a0 + b := by sorry', 'env': 0, 'record': False}, {'cmd': 'theorem womp1 (a1 b c : Nat) : (a1 + b) + c = c + a1 + b := by sorry', 'env': 0, 'record': False}, # ... ]

The ProofSnapshot is independent of the cmdSnapshot once created. The structure provides sufficient information for interactions and pickles.

structure ProofSnapshot where coreState : Core.State coreContext : Core.Context metaState : Meta.State metaContext : Meta.Context termState : Term.State termContext : Term.Context tacticState : Tactic.State tacticContext : Tactic.Context rootGoals : List MVarId

and the record: false wasn't really respected

We can view it this way: in Cmd mode, record is intended for cmdSnapshot-gc, while in Tactic mode, it's for proofSnapshot.
And it can be used as default config for tactic-based interactions, as the generated cmdSnapshots are rarely used since they contain sorry.

If someone wishes to obtain the state while discarding both the cmdSnapshot and proofSnapshot, they can use the File mode like:

{"cmd": "example : 1 = 1 := by", "env": 0, "record": false}

And then extract the states from errors.

btw, this feature is beneficial for File-mode-based interactions like itp-interfaces, where most generated cmdStates are wasted due to incomplete errors.

ahh so command snapshots are only necessary to create additional command states. Which rarely happens for a single proof. Like usually u would only want to snapshot import headers and stuff

desaxce

The code looks good, that's a nice feature, I don't see anything preventing the merge apart from refreshing the branch and tests.

desaxce · 2025-07-23T10:47:06Z

test/record_cmd2.expected.out

+   "pos": {"line": 3, "column": 0},
+   "endPos": {"line": 3, "column": 6},
+   "data": "Try this: exact h2 (h1 p)"},
+  {"severity": "info",


Can you please update your fork and update your branch to be on 4.22.0-rc3?
Then you can remove the "Goals accomplished!" below, it's not returned anymore.

Thanks for your review. I have merged the latest code and updated the test script.

=== Test Summary === Failed tests: ✗ record_cmd2 Total: 1 failed ==================

desaxce · 2025-07-25T10:07:32Z

@kim-em Do you think we could merge this development?
The new record parameter helps a lot limiting memory use on a REPL.

augustepoiroux · 2025-07-25T13:45:51Z

Can someone make a summary of what "record": false does exactly in each case?
From what I understood, if the command contains sorries, then the proof states are recorded, and the command state is transitively recorded as well, right?
Is there a way to not record a command state, regardless of whether it contains sorries?

FrederickPu · 2025-07-25T14:59:28Z

In the following example

{ "cmd": "import Mathlib" }
{"env": 0}
{ "cmd": "theorem womp : 2 + 2 = 4 := sorry", "env": 1}
{ "sorries" : [ {"proofState": 0, ....}] }

The proofState=0 is defined entirely in terms of the context env=0 so env=1 can be discarded entirely. I think that was @RexWzh 's idea. However, if the command for env=1 created new declarations I don't think this intuition would hold.

The only state that is transitively saved in the proofSnapshot is the Enviroment through Meta.Context or something like that. Ie all of the parsed information from the Command.json won't be included only the context necessary for creating the syntax of the tactic state for the proofsnapshot.

FrederickPu · 2025-07-25T15:00:21Z

I think it would be nice to have a way of discarding proofsnapshots once they are no longer being used. Ie you've finished with your proof search for a particular problem.

add gc option

4ab23f9

RexWzh mentioned this pull request Apr 10, 2025

LeanRepl terminates trying to run large amounts of commands on a single LeanRepl instance #77

Open

add gc option tests

8e88e86

RexWzh force-pushed the gc-option branch from 75ab6cb to 8e88e86 Compare April 10, 2025 20:49

update readme

cf63ddc

RexWzh force-pushed the gc-option branch from 124926a to cf63ddc Compare April 11, 2025 05:15

RexWzh changed the title ~~add gc option~~ add garbage collection option Apr 11, 2025

RexWzh added 2 commits April 14, 2025 18:24

Merge branch 'master' into gc-option

75643c6

@ohyeat: replace loop by tail recursion

bac41b9

RexWzh force-pushed the gc-option branch from 92bc0f9 to 97e8d6b Compare April 15, 2025 17:07

rename gc to discard

f867ab2

RexWzh force-pushed the gc-option branch from 97e8d6b to f867ab2 Compare April 15, 2025 17:19

RexWzh force-pushed the gc-option branch from 9d5547e to 84fc89d Compare April 15, 2025 17:45

rename discard to record

91abba5

RexWzh force-pushed the gc-option branch from 84fc89d to 91abba5 Compare April 15, 2025 17:54

record for proof states

edc5079

augustepoiroux mentioned this pull request Apr 24, 2025

server.run(lean_interact.Command(…)) starts failing after ~560 issued commands augustepoiroux/LeanInteract#6

Closed

FrederickPu mentioned this pull request Apr 28, 2025

Add batch processing to LeanRepl #93

Draft

RexWzh added 2 commits May 4, 2025 06:46

Merge branch 'master' into gc-option

4979b94

rename tests

b1cb4e7

kim-em reviewed May 12, 2025

View reviewed changes

kim-em added the awaiting-author label May 12, 2025

update test scripts

d374f1b

desaxce approved these changes Jul 23, 2025

View reviewed changes

RexWzh added 3 commits July 24, 2025 03:50

Merge remote-tracking branch 'origin/master' into gc-option

42d2bc7

Merge branch 'fix-tests' into gc-option

a66d357

fix error test

3d71944

add garbage collection option #83

Are you sure you want to change the base?

add garbage collection option #83

Uh oh!

Conversation

RexWzh commented Apr 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kim-em commented Apr 10, 2025

Uh oh!

RexWzh commented Apr 11, 2025

Uh oh!

RexWzh commented Apr 14, 2025

Uh oh!

FrederickPu commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RexWzh commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

augustepoiroux commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RexWzh commented Apr 15, 2025

Uh oh!

FrederickPu commented Apr 15, 2025

Uh oh!

RexWzh commented Apr 15, 2025

Uh oh!

augustepoiroux commented Apr 15, 2025

Uh oh!

RexWzh commented Apr 15, 2025

Uh oh!

FrederickPu commented Apr 15, 2025

Uh oh!

RexWzh commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RexWzh commented May 3, 2025

Uh oh!

kim-em May 12, 2025

Choose a reason for hiding this comment

Uh oh!

FrederickPu May 12, 2025

Choose a reason for hiding this comment

Uh oh!

RexWzh May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RexWzh May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

FrederickPu May 13, 2025

Choose a reason for hiding this comment

Uh oh!

desaxce left a comment

Choose a reason for hiding this comment

Uh oh!

desaxce Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

RexWzh Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

desaxce commented Jul 25, 2025

Uh oh!

augustepoiroux commented Jul 25, 2025

Uh oh!

FrederickPu commented Jul 25, 2025

Uh oh!

FrederickPu commented Jul 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

RexWzh commented Apr 10, 2025 •

edited

Loading

FrederickPu commented Apr 15, 2025 •

edited

Loading

RexWzh commented Apr 15, 2025 •

edited

Loading

augustepoiroux commented Apr 15, 2025 •

edited

Loading

RexWzh commented Apr 17, 2025 •

edited

Loading

RexWzh May 12, 2025 •

edited

Loading

RexWzh May 12, 2025 •

edited

Loading

RexWzh Jul 23, 2025 •

edited

Loading