bpo-37374: Do not escape quotes in minidom inside text segments #14312

mitar · 2019-06-22T21:09:32Z

Addressing the issue: https://bugs.python.org/issue37374

https://bugs.python.org/issue37374

mangrisano · 2019-06-22T21:30:45Z

Hi and thank you for the pull request.
Just one question: did you test the change?

mitar · 2019-06-22T22:39:02Z

Yes, I tested by running:

from xml.etree import ElementTree
text = ElementTree.Element('text')
text.text = 'f&oo"b<a>r'
xml_string = ElementTree.tostring(text)
import xml.dom.minidom as minidom
xml_tree = minidom.parseString(xml_string)
output = xml_tree.toprettyxml(indent='  ')
assert output.splitlines()[1] == xml_string.decode('utf8')
print(output)

which now returns:

<?xml version="1.0" ?>
<text>f&amp;oo"b&lt;a&gt;r</text>

I also ran whole Python test suite.

mangrisano · 2019-06-22T22:43:33Z

Perhaps you should provide it in the test_xml module.
I'm not a core-dev so feel free to not follow my suggestion. :)

mitar · 2019-06-23T21:09:44Z

You are right. I should add the test.

This PR in fact makes minidom behave exactly the same as ElementTree.tostring. ElementTree.tostring is also not escaping quotes.

mitar · 2021-10-21T22:15:13Z

I have added tests.

scoder

Thank you for the PR. However, I've rejected the ticket, so I have to reject the PR, too.

I'll leave my review comments anyway, in case it gets reconsidered at some later point.

scoder · 2023-08-13T06:33:02Z

Lib/test/test_minidom.py

+    def testQuoteEscape(self):
+        text = ElementTree.Element('text')
+        text.text = 'f&oo"b<a>r'
+        xml_string = ElementTree.tostring(text)


We should not rely on an unrelated library providing the same text string serialisation. Instead, the test should use a plain string to compare against. Otherwise, changes in ElementTree could make this test fail without need.

scoder · 2023-08-13T06:35:52Z

Lib/xml/dom/minidom.py

    def writexml(self, writer, indent="", addindent="", newl=""):
-        _write_data(writer, "%s%s%s" % (indent, self.data, newl))
+        _write_text_data(writer, "%s%s%s" % (indent, self.data, newl))


I haven't checked deeply, but my gut feeling would prefer a special case for attribute values, not for text content. Can't say if that's similarly easy to achieve.

the-knights-who-say-ni added the CLA signed label Jun 22, 2019

bedevere-bot added the awaiting review label Jun 22, 2019

mitar added 3 commits October 21, 2021 23:30

bpo-37374: Do not escape quotes in minidom inside text segments.

3b73f3c

Adding NEWS entry.

60ad399

Adding tests.

393923f

mitar force-pushed the fix-issue-37374 branch from df9c8b1 to 393923f Compare October 21, 2021 22:14

ezio-melotti removed the CLA signed label Jul 13, 2022

mitar mannequin mentioned this pull request Aug 12, 2023

Minidom does not have to escape quote inside text segments #81555

Closed

scoder reviewed Aug 13, 2023

View reviewed changes

scoder closed this Aug 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

bpo-37374: Do not escape quotes in minidom inside text segments #14312

bpo-37374: Do not escape quotes in minidom inside text segments #14312

Uh oh!

mitar commented Jun 22, 2019 •

edited by bedevere-bot

Loading

Uh oh!

mangrisano commented Jun 22, 2019

Uh oh!

mitar commented Jun 22, 2019 •

edited

Loading

Uh oh!

mangrisano commented Jun 22, 2019

Uh oh!

mitar commented Jun 23, 2019

Uh oh!

mitar commented Oct 21, 2021

Uh oh!

scoder left a comment

Uh oh!

scoder Aug 13, 2023

Uh oh!

scoder Aug 13, 2023

Uh oh!

Uh oh!

Uh oh!

bpo-37374: Do not escape quotes in minidom inside text segments #14312

bpo-37374: Do not escape quotes in minidom inside text segments #14312

Uh oh!

Conversation

mitar commented Jun 22, 2019 • edited by bedevere-bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mangrisano commented Jun 22, 2019

Uh oh!

mitar commented Jun 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mangrisano commented Jun 22, 2019

Uh oh!

mitar commented Jun 23, 2019

Uh oh!

mitar commented Oct 21, 2021

Uh oh!

scoder left a comment

Choose a reason for hiding this comment

Uh oh!

scoder Aug 13, 2023

Choose a reason for hiding this comment

Uh oh!

scoder Aug 13, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mitar commented Jun 22, 2019 •

edited by bedevere-bot

Loading

mitar commented Jun 22, 2019 •

edited

Loading