Feature request: include the original problem descriptions in docstrings for the theorem statements #196

eric-wieser · 2024-08-06T18:07:12Z

It's quite hard to spot misformalizations when reading the formalization alone.
It would be great if the $$\LaTeX$$ description could be included in a docstring (/-- doc -/ in lean, I can't speak for other systems) for the theorems, to make mistakes easier to spot and correct.

The text was updated successfully, but these errors were encountered:

LasseBlaauwbroek · 2024-08-06T18:09:59Z

Seconded for Coq: (** latex *).

eric-wieser · 2024-08-06T23:18:29Z

Probably also a good idea to have a script that extracts the docstrings from each language, and checks in CI that they are all consistent.

amit9oct · 2024-08-08T03:32:27Z

Thank you for opening this issue. I agree that something like this will make it easy for the repository maintainers to check the correctness of the formalization itself.

We are planning to add a tool (probably a web page on our main website) that will grab the three formalizations along with informal statements from the GitHub repository and then show them side-by-side. We can also build a local version of this so that the maintainers can see these on their local machines.

The reason for not adding comments in the file is that it will make it hard to maintain the same informal statement in two places, and might create problems when one gets modified but the other does not. If we create a tool for this, then we will not have to deal with any consistency issues which may arise. This will also keep parsing of formalizations simple, for example, if someone wants to just use the formal statement and not use informal statements for their AI tests they will not have to parse and remove the informal statement from the file.

eric-wieser · 2024-08-09T00:44:59Z

The reason for not adding comments in the file is that it will make it hard to maintain the same informal statement in two places, and might create problems when one gets modified but the other does not.

My recommendation here would be:

Write a script for each of Coq, Lean, and Isabelle that extracts the doc comment to a json file, mapping problem names to docstrings. I suggest separate scripts since you might find that it is best to parse the Coq code with Coq, the Lean code with Lean, etc.
Add a test in github's CI that verifies the outputs of these three scripts are identical

CI is a fantastic tool for enforcing consistency.

eric-wieser · 2024-09-23T11:31:47Z

CI is a fantastic tool for enforcing consistency.

#216 adds the docstrings to all the lean files, and uses CI to ensure they are the same as the ones in the json file.

eric-wieser mentioned this issue Sep 23, 2024

Add the docstrings to the Lean files. #216

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: include the original problem descriptions in docstrings for the theorem statements #196

Feature request: include the original problem descriptions in docstrings for the theorem statements #196

eric-wieser commented Aug 6, 2024 •

edited

Loading

LasseBlaauwbroek commented Aug 6, 2024

eric-wieser commented Aug 6, 2024

amit9oct commented Aug 8, 2024

eric-wieser commented Aug 9, 2024

eric-wieser commented Sep 23, 2024

Feature request: include the original problem descriptions in docstrings for the theorem statements #196

Feature request: include the original problem descriptions in docstrings for the theorem statements #196

Comments

eric-wieser commented Aug 6, 2024 • edited Loading

LasseBlaauwbroek commented Aug 6, 2024

eric-wieser commented Aug 6, 2024

amit9oct commented Aug 8, 2024

eric-wieser commented Aug 9, 2024

eric-wieser commented Sep 23, 2024

eric-wieser commented Aug 6, 2024 •

edited

Loading