6718 Commits

Author SHA1 Message Date
Nick Wellnhofer
de918d45e1 Release v2.13.5 v2.13.5 2024-11-12 18:19:24 +01:00
Nick Wellnhofer
a3afc31fd7 xmlIO: Fix reading from non-regular files like pipes
Commit 7e14c05d removed unnecessary copying of uncompressed input
through zlib or xzlib. This broke input from non-regular files like
pipes which can't be reopened. Try to detect such files by checking
whether they're seekable and always pipe them through zlib or xzlib.

Also remove seemingly unnecessary calls to gzread and gzrewind to
support unseekable files.

Fixes https://gitlab.gnome.org/GNOME/libxslt/-/issues/124.
2024-11-12 16:36:39 +01:00
Nick Wellnhofer
f228c6f320 xpath: Fix parsing of non-ASCII names
Fix a long-standing issue where QNames starting with a non-ASCII
character would be rejected. This became more visible after "streaming"
XPath evaluation was disabled since the latter handled non-ASCII names
correctly.

Fixes #818.
2024-11-12 16:36:24 +01:00
Nick Wellnhofer
bcda764930 xmlreader: Fix return value of xmlTextReaderReadString
Return NULL if the node has no children or the children were already
deleted to match the 2.12 behavior.

Fixes #817.
2024-11-12 16:36:13 +01:00
Nick Wellnhofer
5cce7af791 parser: Fix loading of parameter entities in external DTDs
Regressed with commit 12f0bb94.

Fixes #816.
2024-11-12 16:35:58 +01:00
Nick Wellnhofer
4334cbb4e3 parser: Fix downstream code that swaps DTDs
Downstream code like the nginx xslt module can change the document's DTD
pointers in a SAX callback. If an entity from a separate DTD is parsed
lazily, its content must not reference the current document.

Regressed with commit d025cfbb.

Fixes #815.
2024-11-12 16:35:38 +01:00
Nick Wellnhofer
929297749c parser: Fix detection of duplicate attributes
We really need a second scan if more than one namespace clash was
detected.
2024-11-12 16:32:33 +01:00
Nick Wellnhofer
5a2a243b1b string: Fix va_copy fallback
Fix va_copy fallback reworked in 5cffba83.

Should fix #812.
2024-10-23 14:20:56 +02:00
Nick Wellnhofer
60d3056c97 Release v2.13.4 v2.13.4 2024-09-18 15:10:15 +02:00
Nick Wellnhofer
7c4fef2412 xpath: Make recursion check work with xmlXPathCompile
The check for maximum recursion depth required a parser context with an
xmlXPathContext which xmlXPathCompile didn't provide.

All other code should already set up or require an xmlXPathContext.
2024-09-18 15:03:05 +02:00
Nick Wellnhofer
d4e4f6f2e0 include: Check whether _MSC_VER is defined
Should fix #795.
2024-09-04 16:30:43 +02:00
Nick Wellnhofer
65db397463 parser: Make unsupported encodings an error in declarations
This was changed in 45157261, but in encoding declarations, unsupported
encodings should raise a fatal error.

Fixes #794.
2024-09-04 16:30:43 +02:00
Nick Wellnhofer
4224a3fb27 python: Fix SAX driver with character streams
This apparently broke with Python 3.5 which introduced character
streams.

Fixes #790.
2024-09-02 19:04:06 +02:00
triallax
da9f892f20 io: don't set the executable bit when creating files
Issue seems to have been introduced in
0bef93bf24def68c448af0e71844b942e0ed93ec.
2024-08-27 12:55:31 +02:00
Nick Wellnhofer
208a4d026c xmlcatalog: Improved fix for #699 2024-08-19 20:51:39 +02:00
Nick Wellnhofer
4e3ce5c009 Revert "catalog: Fetch XML catalog before dumping"
This reverts commit 723b4de04015c5acccd3cda5dd60db7d00702064.
2024-08-19 20:51:29 +02:00
Nick Wellnhofer
b93d4e7087 io: Add missing calls to xmlInitParser
This is required after c9a46a91.

Should fix #782.
2024-08-13 15:36:21 +02:00
Nick Wellnhofer
3a95572408 tree: Restore return value of xmlNodeListGetString with NULL list
When passing a NULL list to xmlNodeListGetString or
xmlNodeListGetRawString, return NULL instead of "" to match the old
behavior.

Fixes #783.
2024-08-13 15:36:16 +02:00
Nick Wellnhofer
239e25f613 parser: Report at least one fatal error 2024-08-05 16:47:21 +02:00
Nick Wellnhofer
d470dedf74 parser: Fix error handling after reaching limit
Mark document as non-wellformed and stop parser even if error limit was
reached.

Regressed in abd74186.
2024-08-05 16:47:12 +02:00
Nick Wellnhofer
3a648d11a3 parser: Make xmlParseChunk return an error if parser was stopped
This regressed after enhancing the disableSAX member in 2.13.

Should fix #777.
2024-07-25 17:27:43 +02:00
Nick Wellnhofer
3b1742b839 Release v2.13.3 v2.13.3 2024-07-24 15:16:52 +02:00
Nick Wellnhofer
de28e6ed3a [CVE-2024-40896] Fix XXE protection in downstream code
Some users set an entity's children manually in the getEntity SAX
callback to restrict entity expansion. This stopped working after
renaming the "checked" member of xmlEntity, making at least one
downstream project and its dependants susceptible to XXE attacks.

See #761.
2024-07-24 14:34:13 +02:00
Nick Wellnhofer
6ae65368ad autotools: Use AC_CHECK_DECL to check for getentropy
Should fix build issue on iOS.
2024-07-22 19:30:29 +02:00
Nick Wellnhofer
ed8b4264f6 xinclude: Set error handler when parsing text 2024-07-18 20:56:31 +02:00
Nick Wellnhofer
a0330b53c8 xinclude: Fix fallback for text includes
Fixes #772.
2024-07-18 19:45:59 +02:00
Nick Wellnhofer
8699ba234b io: Don't call getcwd in xmlParserGetDirectory
The "directory" value isn't used internally. Calling getcwd is
unnecessary and can cause problems in sandboxed environments.

Fixes #770.
2024-07-18 03:40:37 +02:00
Nick Wellnhofer
bf43e8a888 io: Fix return value of xmlFileRead
This broke in commit 6d27c54.

Fixes #766.
2024-07-09 17:53:49 +02:00
Nick Wellnhofer
e30cb632e7 parser: Fix error return of xmlParseBalancedChunkMemory
Only return an error code if the chunk is not well-formed to match the
2.12 behavior. Return 0 on non-fatal errors like invalid namespaces.

Fixes #765.
2024-07-08 13:32:58 +02:00
Nick Wellnhofer
dd5adf54c9 Undeprecate xmlKeepBlanksDefault 2024-07-06 20:25:27 +02:00
Nick Wellnhofer
4b3f860e54 Release v2.13.2 v2.13.2 2024-07-04 17:16:44 +02:00
Nick Wellnhofer
c7f8781c48 tree: Fix handling of empty strings in xmlNodeParseContent
We shouldn't create an empty text node to match the old behavior.

Fixes #759.
2024-07-03 16:18:06 +02:00
Nick Wellnhofer
4a0d74dcc3 SAX2: Reenable 'directory' as base URI fallback
Apparently, some users overwrite this member manually to set a base URI
for memory streams.

Fixes #753.
2024-07-03 12:19:07 +02:00
Nick Wellnhofer
2f6766dba6 valid: Restore ID lookup
Revert a change from d025cfbb and don't overwrite ID table entries, so
that the first attribute will be returned if there are duplicate IDs.

This requires two other changes:

- Attributes in entity content are never added to the ID table. This
  seems reasonable.

- Remove the optimization to skip ID lookup when copying and the target
  document has an empty ID table. This also seems more correct since the
  document could have ID declarations nevertheless or we could be
  copying xml:ids into the document for the first time.

Fixes #757.
2024-07-03 12:19:00 +02:00
Nick Wellnhofer
fc3c432b74 Clarify xpointer() extension removal 2024-07-03 12:18:37 +02:00
Nick Wellnhofer
046f61c698 parser: Reenable ctxt->directory
Unused internally, but used in downstream code.

Should fix #753.
2024-07-02 22:29:25 +02:00
Nick Wellnhofer
3c85a0e969 tests: Clarify licence of test/intsubset2.xml 2024-06-29 15:11:25 +02:00
Nick Wellnhofer
b3579eabd5 encoding: Restore old lookup order in xmlOpenCharEncodingHandler
When looking up encodings with xmlLookupCharEncodingHandler, the
returned handler can have a different name than requested
(capitalization, internal aliases). This should eventually be fixed.
For now we revert part of commit 5b893fa9, start the lookup with
xmlFindHandler and add an explicit check for UTF-8.

Should fix the encoding name issue mentioned in #749.
2024-06-27 13:22:49 +02:00
Nick Wellnhofer
411eeed447 uri: Only set file scheme for special Windows paths
Fixes 2ce70cde.

Also fix a test case.
2024-06-26 14:40:12 +02:00
Nick Wellnhofer
977c076ef6 uri: Handle filesystem paths in xmlBuildRelativeURISafe
This mainly fixes issues on Windows but should also fix a few general
corner cases.

Should fix #745.
2024-06-26 14:39:50 +02:00
Nick Wellnhofer
6fa2573140 uri: Enable Windows paths on Cygwin 2024-06-26 14:39:35 +02:00
Nick Wellnhofer
dc8c7d1ce8 encoding: Make xmlFindCharEncodingHandler return UTF-8 handler
xmlFindCharEncodingHandler must always return a handler.

Remove UTF-8 handler from default handler list.
2024-06-24 20:54:44 +02:00
Nick Wellnhofer
7759765c6c encoding: Fix encoding lookup with xmlOpenCharEncodingHandler
Make xmlOpenCharEncodingHandler call xmlParseCharEncoding first so we
prefer our own handlers for names like "UTF8". Only UTF-16 needs an
exception.

Make callers check the return value. For UTF-8, a NULL encoding doesn't
mean an error.

Remove unnecessary UTF-8 check from htmlFindOutputEncoder. Don't try to
look up ASCII handler since the HTML handler is always available.

Fix return code of xmlParseCharEncoding.

Should fix #744.
2024-06-22 22:41:10 +02:00
Nick Wellnhofer
5ff37946dd include: Define ATTRIBUTE_UNUSED for clang
Silences warnings under clang on Windows.
2024-06-21 14:54:04 +02:00
Nick Wellnhofer
aaa24ca6be uri: Fix xmlBuildURI with NULL base
Don't try to parse URI if base is NULL. Fixes functions like xmlParseDTD
with certain filenames.

Should fix #742.
2024-06-21 14:52:15 +02:00
Nick Wellnhofer
48dba1e21f Release v2.13.1 v2.13.1 2024-06-19 12:13:15 +02:00
Nick Wellnhofer
fc57977497 doc: Regenerate documentation 2024-06-19 12:09:53 +02:00
Nick Wellnhofer
14523148d1 doc: Hide internal macro 2024-06-19 12:09:22 +02:00
Nick Wellnhofer
def06f376e parser: Selectively reenable reading from "-"
Make filename "-" mean stdin for legacy SAX1 functions and xmlReadFile.
This should hopefully fix most command line utilities.

See #737.
2024-06-17 18:17:15 +02:00
Nick Wellnhofer
e1c702414b tests: Add XInclude test for issue #733 2024-06-17 17:44:38 +02:00