Don't use XSLT in report generator #909

JukkaL · 2015-10-11T22:13:08Z

XSLT is not a widely used technology and pretty complex (and I don't want to learn it). We shouldn't rely on it in the report generator. Some reasonable alternatives include straight XML manipulation or JSON.

o11c · 2015-10-12T02:28:51Z

The thought behind my use of XSLT is: the primary report is now the machine-readable XML file. It is up to the user to decide how they want to view that - say, combine a python-cov report with a mypy report in a single report. For that motivating example, it is impossible for mypy to produce a meaningful HTML file, so the fact that XML files can be viewed directly with XSLT is a huge bonus (of course, nothing, but I think it's very valuable that we ship an example XSLT file).

Plus (ignoring a little of the txt hackery which I did just to prove how flexible XSLT is, and some percentage calculations which we could precompute if we really had to), all the XSLT we use is pretty simple "get this part of the XML file, put it here in the output HTML file".

Besides, I'm way more active than you are anyway.

It's true that XSLT got less popular with the rise of languages like Markdown, but those are generally not machine-readable with arbitrary attributes like XML is.

And XSLT actually works quite similarly to CSS, which lots of people know. (Possibly some CSS guru could even display the XML files directly without XSLT, but that doesn't mean it's a good idea).

JukkaL · 2015-10-12T03:12:56Z

I agree that technically XSLT does the job. It's just that I don't like it :-P

Let's wait for more feedback -- maybe people are happy with it. First we should document how the report generator works, though.

JukkaL · 2015-10-12T03:15:23Z

Added #912 for documenting the report generator.

refi64 · 2015-10-12T13:56:30Z

Is there a reason you can't just use JSON? It's much easier (and faster) to parse.

o11c · 2015-10-12T17:01:15Z

If you're parsing it yourself, you're doing it wrong, use a library. You should treat XML (or JSON) as an opaque format that is only readable by special tools.

JSON has a number of problems (limited unicode support, no integers, no standard schema tools, very poor tooling in general) that XML does not have.

refi64 · 2015-10-12T17:10:01Z

If you're parsing it yourself, you're doing it wrong, use a library.

Exactly. I know of several languages that have built-in JSON parsers but not XML parsers. Writing an efficient XML parser is hard.

limited unicode support

...yeah, you got me on this one. ;)

no integers

Huh? Aren't normal numbers good enough?

no standard schema tools

Not quite sure what this means...

very poor tooling in general

Does JSON even need that much tooling to begin with, though?

o11c · 2015-10-12T18:16:46Z

If you're parsing it yourself, you're doing it wrong, use a library.

Exactly. I know of several languages that have built-in JSON parsers but not XML parsers. Writing an efficient XML parser is hard.

Languages, just like end developers, shouldn't be writing their own JSON or XML parsers, but linking to a C implementation. JSON is merely a more tempting target.

no integers

Huh? Aren't normal numbers good enough?

No. Floats are evil.

no standard schema tools

Not quite sure what this means...

When mypy produces a report, it verifies that the XML nodes follow a certain structure. Besides preventing bugs in the generator, this allows tools that consume mypy XML files to automatically verify that they really are handling everything correctly.

While there are a number of JSON schema tools, none is anywhere near even a de-facto standard.

very poor tooling in general

Does JSON even need that much tooling to begin with, though?

A data language is useless without tooling.

Show me a way to combine a python-cov report with a mypy report for the same file. It's easily possible with XSLT.

refi64 · 2015-10-12T18:37:15Z

Languages, just like end developers, shouldn't be writing their own JSON or XML parsers, but linking to a C implementation. JSON is merely a more tempting target.

...a C implementation which is almost always more painful. Not even Expat supports the entirety of XML. The parsers that support everything are huge and slow as heck.

No. Floats are evil.

Why? When doing arithmetic operations with integers stored as floats, the precision is perfectly fine.

o11c · 2015-10-12T18:53:01Z

...a C implementation which is almost always more painful. Not even Expat supports the entirety of XML. The parsers that support everything are huge and slow as heck.

lxml, which I used in mypy, is a great example of how to write a non-painful wrapper over a C library.

And libxml2 comparse very favorably for performance, see http://lxml.de/performance.html

refi64 · 2015-10-12T19:00:24Z

You forgot the part where installing lxml on Windows is horribly broken.

jhance · 2015-10-13T00:48:51Z

Using XSLT requires me to write XML therefore I will never use it

o11c · 2015-10-13T01:16:14Z

Using XSLT requires me to write XML therefore I will never use it

That's what I said about XML until I used XSLT. In my opinion, XSLT tooling alone forgives all the human-unfriendliness from the rest of XML.

JukkaL · 2015-10-14T05:12:24Z

It seems like we have a consensus forming... XSLT is losing, unless it can gather some more support.

ilinum · 2017-07-12T17:46:17Z

I think we can use jinja2 instead of XSLT. It is more common than XSLT and so should be easier to use.

gvanrossum · 2017-11-02T02:47:40Z

Do we still want to get rid of xslt? Then I think the jinja2 idea is a fine one. Would we completely lose the lxml dependency in favor of jinja2?

JukkaL · 2017-11-02T10:55:35Z

It would still be nice to get rid of the lxml dependency and I like the idea of using jinja2 instead. Jinja2 is much better known than XSLT in the Python community and it's also easier to learn, in my opinion.

ilinum · 2017-11-02T15:35:24Z

I think we can get rid of xslt if we switch to another jinja2.

Coverage.py has their own template renderer that uses Django/Jinja2-like syntax. I wonder why they chose that over standard jinja2.

gvanrossum · 2017-11-02T23:43:12Z

I think the comment about the aosa 500 lines book <http://aosabook.org/en/500L/a-template-engine.html> says it all -- it was an educational project. Also, it takes away an external dependency, but that's not important for us. Let's go with actual jinja2.

…

On Thu, Nov 2, 2017 at 8:35 AM, Svyatoslav Ilinskiy < ***@***.***> wrote: I think we can get rid of xslt if we switch to another jinja2. Coverage.py has their own <https://bitbucket.org/ned/coveragepy/src/9cf891202041c5cbafc19e8c2d5467db77100066/coverage/templite.py?at=default&fileviewer=file-view-default> template renderer that uses Django/Jinja2-like syntax. I wonder why they chose that over standard jinja2. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#909 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ACwrMnu6ya9vQI7xil_HC4io3xJ85ovRks5syeE-gaJpZM4GM1H_> .

-- --Guido van Rossum (python.org/~guido)

JukkaL added the refactoring Changing mypy's internals label Oct 11, 2015

JukkaL mentioned this issue Oct 11, 2015

Implement XML-based reports #713

Closed

JukkaL mentioned this issue Nov 29, 2015

Declare dependency on lxml #1001

Closed

gvanrossum mentioned this issue Jan 13, 2016

Type check coverage reports need more love #1123

Open

ilinum mentioned this issue Jul 17, 2017

Support Python 3.4 on Windows? #3722

Closed

JukkaL added the priority-1-normal label May 18, 2018

sirosen mentioned this issue Jul 6, 2020

Add an extra which specifies the lxml dependency #9105

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't use XSLT in report generator #909

Don't use XSLT in report generator #909

JukkaL commented Oct 11, 2015

o11c commented Oct 12, 2015

JukkaL commented Oct 12, 2015

JukkaL commented Oct 12, 2015

refi64 commented Oct 12, 2015

o11c commented Oct 12, 2015

refi64 commented Oct 12, 2015

o11c commented Oct 12, 2015

refi64 commented Oct 12, 2015

o11c commented Oct 12, 2015

refi64 commented Oct 12, 2015

jhance commented Oct 13, 2015

o11c commented Oct 13, 2015

JukkaL commented Oct 14, 2015

ilinum commented Jul 12, 2017 •

edited

Loading

gvanrossum commented Nov 2, 2017

JukkaL commented Nov 2, 2017

ilinum commented Nov 2, 2017

gvanrossum commented Nov 2, 2017 via email

Don't use XSLT in report generator #909

Don't use XSLT in report generator #909

Comments

JukkaL commented Oct 11, 2015

o11c commented Oct 12, 2015

JukkaL commented Oct 12, 2015

JukkaL commented Oct 12, 2015

refi64 commented Oct 12, 2015

o11c commented Oct 12, 2015

refi64 commented Oct 12, 2015

o11c commented Oct 12, 2015

refi64 commented Oct 12, 2015

o11c commented Oct 12, 2015

refi64 commented Oct 12, 2015

jhance commented Oct 13, 2015

o11c commented Oct 13, 2015

JukkaL commented Oct 14, 2015

ilinum commented Jul 12, 2017 • edited Loading

gvanrossum commented Nov 2, 2017

JukkaL commented Nov 2, 2017

ilinum commented Nov 2, 2017

gvanrossum commented Nov 2, 2017 via email

ilinum commented Jul 12, 2017 •

edited

Loading