libxml2

c/libxml2

mirror of https://gitlab.gnome.org/GNOME/libxml2 synced 2025-03-28 21:33:13 +00:00

Author	SHA1	Message	Date
Nick Wellnhofer	b349225952	include: Change some return types from int to enum This also affects some new functions from 2.13.	2025-03-14 02:31:01 +01:00
Nick Wellnhofer	fd1b939168	include: Convert some macros to enums	2025-03-14 00:35:40 +01:00
Nick Wellnhofer	a78843be5e	xmllint: Support compressed input from stdin Another regression related to reading from stdin. Making a "-" filename read from stdin was deeply baked into the core IO code but is inherently insecure. I really want to reenable this dangerous feature as sparingly as possible. This now enables compressed input when using the "Fd" API functions which wan't supported before. But XML_PARSE_NO_UNZIP will be inverted later. Allow compressed stdin in xmlReadFile to support xmlstarlet and older versions of xsltproc. So far, these are the only known command-line tools that rely on "-" meaning stdin.	2025-01-28 23:20:37 +01:00
Nick Wellnhofer	a530ff125d	io: Always consume encoding handler when creating output buffers Also free encoding handler in error case. Remove xmlAllocOutputBufferInternal which was identical to xmlAllocOutputBuffer.	2024-07-29 14:25:39 +02:00
Nick Wellnhofer	a6f54f055b	io: Fine-tune initial IO buffer size	2024-07-16 17:42:10 +02:00
Nick Wellnhofer	1432949d3c	io: Pass input flags to xmlParserInputBufferCreateUrl	2024-06-12 16:14:15 +02:00
Nick Wellnhofer	b5890cb425	io: Remove xmlParserInputBufferCreateFilenameSafe	2024-06-12 16:14:15 +02:00
Nick Wellnhofer	1b1e8b3c12	io: Stop invoking generic error handler for IO errors	2024-06-12 16:14:15 +02:00
Nick Wellnhofer	e314109ad1	save: Don't write directly to internal buffer Make sure that OOM errors are reported.	2024-02-16 16:14:05 +01:00
Nick Wellnhofer	7e0bbbc143	parser: New input API Provide a new set of functions to create xmlParserInputs. These can be used for the document entity or from external entity loaders. - Don't require xmlParserInputBuffer. - All functions take a base URI. - All functions take an encoding as string. - xmlNewInputURL also takes a public ID. - xmlNewInputMemory takes a size_t. - Optimization hints for memory buffers. Improve documentation. Only call xmlInitParser before allocating a new parser context. Call xmlCtxtUseOptions as early as possible.	2023-12-29 01:22:13 +01:00
Nick Wellnhofer	a26934105e	io: Move some code from xmlIO.c to parserInternals.c Move everything related to parser contexts to parserInternals.c.	2023-12-25 23:38:40 +01:00
Nick Wellnhofer	c9a46a91fe	io: Rework initialization	2023-12-21 15:02:24 +01:00
Nick Wellnhofer	23345a1cb1	io: Report IO errors through xmlCtxtErrIO This is also a new public API function to be used in external entity loaders.	2023-12-21 15:02:24 +01:00
Nick Wellnhofer	7e511f35f1	io: Pass error codes from xmlFileOpenReal to xmlNewInputFromFile This allows to report the reason why opening a file failed to the parser context and improve error messages. Now we can also remove the stat call before opening a file.	2023-12-21 15:02:24 +01:00
Nick Wellnhofer	f19a95108a	parser: Report malloc failures Fix many places where malloc failures aren't reported. Make xmlErrMemory public. This is useful for custom external entity loaders. Introduce new API function xmlSwitchEncodingName. Change the way how we store whether the the parser is stopped. This used to be signaled by setting ctxt->instate to XML_PARSER_EOF which was misdesigned and error-prone. Set ctxt->disableSAX to 2 instead and introduce a macro PARSER_STOPPED. Also stop to remove parser inputs in xmlHaltParser. This allows to remove many checks of ctxt->instate. Introduce xmlErrParser to handle errors if a parser context is available.	2023-12-11 22:13:05 +01:00
Nick Wellnhofer	834b8123ef	parser: Stream data when reading from memory Don't create a copy of the whole input buffer. Read the data chunk by chunk to save memory. Historically, it was probably envisioned to read data from memory without additional copying. This doesn't work reliably with the current design of the XML parser which requires a terminating null byte at the end of input buffers. This lead to xmlReadMemory interfaces, which expect pointer and size arguments, being changed to make a zero-terminated copy of the input buffer. Interfaces based on xmlReadDoc, which actually expect a zero-terminated string and would make zero-copy operation work, were then simplified to rely on xmlReadMemoryi, resulting in an unnecessary copy. To avoid copying (possibly gigabytes) of memory temporarily, we now stream in-memory input just like content read from files in a chunk-by-chunk fashion (using a somewhat outdated INPUT_CHUNK size of 250 bytes). As a side effect, we also avoid another copy of the whole input when handling non-UTF-8 data which was made possible by some earlier commits. Interfaces expecting zero-terminated strings now make use of strnlen which unfortunately isn't part of the standard C library and only mandated since POSIX 2008.	2023-08-08 15:21:28 +02:00
Nick Wellnhofer	ccb6d54409	Hide internal functions These functions were never declared in public headers, so it should be safe to hide them. Fixes #139.	2022-11-27 02:20:53 +01:00
Nick Wellnhofer	46cd7d224e	io: Remove xmlInputReadCallbackNop In some cases, for example when using encoders, the read callback was set to NULL, in other cases it was set to xmlInputReadCallbackNop. xmlGROW only tested for xmlInputReadCallbackNop, resulting in errors when parsing large encoded content from memory. Always use a NULL callback for memory buffers to avoid ambiguities. Fixes #262.	2022-11-20 21:12:18 +01:00
Nick Wellnhofer	0f568c0b73	Consolidate private header files Private functions were previously declared - in header files in the root directory - in public headers guarded with IN_LIBXML - in libxml.h - redundantly in source files that used them. Consolidate all private header files in include/private.	2022-08-26 02:11:56 +02:00

19 Commits