Render MathML in QTI questions generated by Studio #13668

rtibbles · 2025-08-21T22:32:06Z

Summary

In our QTI question generation on Studio for any formulae, we leverage the MathML semantics element to preserve the original LaTeX of the expression.
Unfortunately, DOMPurify santiizes out semantics elements due to XSS vulnerabilities associated with its annotation elements.
We allow the semantics element to be retained here, but don't allow the annotation elements to be preserved - this slightly messes with the point of preserving the LaTeX as an annotation, but if we do need it in future, we can do some more precise filtering to allow that through will still disallowing any other encoding.
Adds some basic styling for MathML.

References

Reviewer guidance

Import a survey with a free response question and formula content into Kolibri and observe that the formula is now properly renderered.

Open question - the alternative here is to just remove the annotation completely from the Studio side - I don't think allowing semantics in this way opens us up to an XSS, but we would have to do some extra work to specifically allow the LaTeX annotation to be retained during santization.

…ny child annotation elements still. Add some basic styling for MathML.

github-actions · 2025-08-21T23:05:57Z

Build Artifacts

Asset type	Download link
PEX file	kolibri-0.18.2a0.dev0_git.2.g06bf89d2.pex
Windows Installer (EXE)	kolibri-0.18.2a0.dev0+git.2.g06bf89d2-windows-setup-unsigned.exe
Debian Package	kolibri_0.18.2a0.dev0+git.2.g06bf89d2-0ubuntu1_all.deb
Mac Installer (DMG)	kolibri-0.18.2a0.dev0+git.2.g06bf89d2.dmg
Android Package (APK)	kolibri-0.18.2a0.dev0+git.2.g06bf89d2-0.1.5-debug.apk
Raspberry Pi Image	kolibri-pi-image-0.18.2a0.dev0+git.2.g06bf89d2.zip
TAR file	kolibri-0.18.2a0.dev0+git.2.g06bf89d2.tar.gz
WHL file	kolibri-0.18.2a0.dev0+git.2.g06bf89d2-py2.py3-none-any.whl

rtibbles · 2025-09-05T21:35:22Z

Would be good to get QA verification here - I don't think the code is likely to change much.

pcenov · 2025-09-08T14:29:55Z

@rtibbles which Studio environment should I be using to test this one, publishing a channel with a Survey with a Free response question is still not working properly at Hotfixes...

nucleogenesis

Overall the updates look reasonable.

I did a cursory search for what 'semantics' refers to as a string passed to sanitize (ie, does this mean all "semantic" html elements in general or something specific to DOMPurify?)

Open question - the alternative here is to just remove the annotation completely from the Studio side - I don't think allowing semantics in this way opens us up to an XSS, but we would have to do some extra work to specifically allow the LaTeX annotation to be retained during santization.

Is this approach something that is more work for likely marginal (if any) security benefit?

@marcellamaki noted that we'd like this to be QA'd soon so I'm approving it - I don't suspect answers to my questions above will result in change requests so should be good to make available to QA

rtibbles · 2025-09-15T19:09:03Z

Is this approach something that is more work for likely marginal (if any) security benefit?

It would basically involve rolling back the additional LaTeX annotation I added on the Studio side (whereby the MathML is rendered in such a way that we keep the source LaTeX as an annotation on the MathML) - we might need this for supporting non-MathML supporting browsers, or possibly as a hint to an editor. I do think the current security risk is nil, simply because we are only allowing the semantic MathML element and not the annotation element (which is the vector for the injection attack).

marcellamaki

Similar to #13738, I have tested this with the dev environment sandbox (since we are having issues with hotfixes which prevents the QA team from testing). MathML rendering correctly!

Allow semantics tag in MathML, but avoid XSS issues by not allowing a…

9700da5

…ny child annotation elements still. Add some basic styling for MathML.

github-actions bot added the SIZE: very small label Aug 21, 2025

rtibbles added this to the Kolibri 0.18: Planned Patch 3 milestone Aug 22, 2025

rtibbles assigned nucleogenesis Aug 26, 2025

nucleogenesis approved these changes Sep 15, 2025

View reviewed changes

marcellamaki approved these changes Sep 15, 2025

View reviewed changes

marcellamaki merged commit c7ede7a into learningequality:release-v0.18.x Sep 15, 2025
53 checks passed

marcellamaki mentioned this pull request Sep 15, 2025

QTI question display MathML formulae are not displayed #13667

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Render MathML in QTI questions generated by Studio #13668

Render MathML in QTI questions generated by Studio #13668

Uh oh!

rtibbles commented Aug 21, 2025

Uh oh!

github-actions bot commented Aug 21, 2025

Uh oh!

rtibbles commented Sep 5, 2025

Uh oh!

pcenov commented Sep 8, 2025

Uh oh!

nucleogenesis left a comment •

edited

Loading

Uh oh!

rtibbles commented Sep 15, 2025

Uh oh!

marcellamaki left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Render MathML in QTI questions generated by Studio #13668

Render MathML in QTI questions generated by Studio #13668

Uh oh!

Conversation

rtibbles commented Aug 21, 2025

Summary

References

Reviewer guidance

Uh oh!

github-actions bot commented Aug 21, 2025

Build Artifacts

Uh oh!

rtibbles commented Sep 5, 2025

Uh oh!

pcenov commented Sep 8, 2025

Uh oh!

nucleogenesis left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rtibbles commented Sep 15, 2025

Uh oh!

marcellamaki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

nucleogenesis left a comment •

edited

Loading