libxml2

mirror of https://gitlab.gnome.org/GNOME/libxml2.git synced 2025-10-28 23:14:57 +03:00

Author	SHA1	Message	Date
Nick Wellnhofer	6f4b452742	parser: Stop using ctxt->linenumbers I think this was used to avoid setting the `line` member before it was added (20+ years ago).	2025-05-16 18:03:12 +02:00
Nick Wellnhofer	46f05ea4d5	html: Rework meta charset handling Don't use encoding from meta tags when serializing. Only use the value in `doc->encoding`, matching the XML serializer. This is the actual encoding used when parsing. Stop modifying the input document by setting meta tags before serializing. Meta tags are now injected during serialization. Add full support for <meta charset=""> which is also used when adding meta tags. Align with HTML5 and implement the "algorithm for extracting a character encoding from a meta element". Only modify the encoding substring in Content-Type meta tags. Only switch encoding once when parsing. Fix htmlSaveFileFormat with a NULL encoding not to declare a misleading UTF-8 charset. Fixes #909.	2025-05-11 20:29:25 +02:00
Nick Wellnhofer	6896f478d4	Revert "valid: Remove duplicate error messages when streaming" This reverts commit `cd220b93d8`. This commit broke the xmstarlet tests.	2025-04-18 17:24:45 +02:00
Maks Verver	4d24aa22ac	python: Add a test to reproduce bug #889	2025-04-13 13:50:19 +02:00
Nick Wellnhofer	4135ceea75	meson: Run Python tests	2025-03-14 03:27:31 +01:00
Nick Wellnhofer	cd220b93d8	valid: Remove duplicate error messages when streaming	2024-12-28 11:55:24 +01:00
Yegor Yefremov	513949293d	python/tests: fix typos Typos were found with codespell.	2024-10-15 11:11:38 +02:00
Nick Wellnhofer	e179f3ec0e	html: Stop reporting syntax errors It doesn't make much sense to keep the old syntax error handling which doesn't conform to HTML5. Handling HTML5 parser errors is rather involved and not essential for parsers.	2024-10-06 20:04:00 +02:00
Nick Wellnhofer	ec0881099b	parser: Upgrade XML_IO_NETWORK_ATTEMPT to error Fixes XML::LibXML test suite.	2024-07-04 15:47:20 +02:00
Nick Wellnhofer	2608baaf92	parser: Make failure to load main document a warning Revert the change that made failures to load the main document an error. This fixes the --path option of xmllint and xsltproc. Should fix #733.	2024-06-14 20:06:07 +02:00
Nick Wellnhofer	1b1e8b3c12	io: Stop invoking generic error handler for IO errors	2024-06-12 16:14:15 +02:00
Nick Wellnhofer	bd7cafdbce	meson: Add some TODO comments	2024-05-20 23:59:55 +02:00
Nick Wellnhofer	fdc5ff3657	parser: Always throw entity errors if external DTD is loaded When parsing with XML_PARSE_DTDLOAD, missing entities are always an error. Also consolidate behavior when validating. See `b717abdd`.	2024-05-03 11:52:54 +02:00
Nick Wellnhofer	bffef46c4c	doc: Don't install example code	2024-04-28 22:58:06 +02:00
Nick Wellnhofer	b717abdd09	parser: Consolidate error handling for undeclared entities Always use XML_WAR_UNDECLARED_ENTITY with warning error level in documents with external subset or parameter entities. Use XML_ERR_UNDECLARED_ENTITY otherwise.	2024-04-23 18:36:15 +02:00
Vincent Torri	5732ce56f3	meson: Initial commit	2024-04-04 12:23:39 +02:00
Nick Wellnhofer	67e475b78e	http: Improve error message for HTTPS redirects	2024-02-19 11:09:39 +01:00
Nick Wellnhofer	63986c45b9	parser: Report fatal error if document entity couldn't be loaded Only lower error level when loading entities. Fixes #667.	2024-01-22 21:07:41 +01:00
Nick Wellnhofer	e8fb3d639f	parser: Convert some "internal errors" to meaningful codes	2024-01-02 19:48:23 +01:00
Nick Wellnhofer	e45a4d7115	io: Always forward IO errors to global handler The HTTP module raises errors without context. This won't be fixed, so send them to the global error handler.	2023-12-29 01:22:13 +01:00
Nick Wellnhofer	d944a41515	parser: Fix in-parameter-entity and in-external-dtd checks Use in ctxt->input->entity instead of ctxt->inputNr to determine whether we are inside a parameter entity. Stop using ctxt->external to check whether we're in an external DTD. This is signaled by ctxt->inSubset == 2.	2023-12-29 01:19:56 +01:00
Nick Wellnhofer	60841beba6	parser: Make XML_IO_NETWORK_ATTEMPT behave as before Always reported to generic error, not to parser context for backward compatibility. Several downstream test suites rely on this behavior.	2023-12-25 23:38:40 +01:00
Nick Wellnhofer	7e511f35f1	io: Pass error codes from xmlFileOpenReal to xmlNewInputFromFile This allows to report the reason why opening a file failed to the parser context and improve error messages. Now we can also remove the stat call before opening a file.	2023-12-21 15:02:24 +01:00
Nick Wellnhofer	c5a8aef2f6	error: Refactor error reporting Introduce xmlStrVASPrintf, trying to handle buggy snprintf implementations. Introduce xmlSetError to set errors atomically. Introduce xmlUpdateError to set an error, fixing up node, file and line. Introduce helper function xmlRaiseMemoryError. Make legacy error handlers call xmlReportError, avoiding checks in xmlVRaiseError. Remove fragile support for getting file and line info from XInclude nodes.	2023-12-21 02:46:27 +01:00
Nick Wellnhofer	aca16fb3d4	tree: Report malloc failures Fix many places where malloc failures aren't reported. Make some API function return an error code. Changing the return type from void to int is technically an ABI break but should be safe on most platforms. - xmlNodeSetContent - xmlNodeSetContentLen - xmlNodeAddContent - xmlNodeAddContentLen - xmlNodeSetBase Introduce new API functions that return a separate error code if a memory allocation fails. - xmlNodeGetAttrValue - xmlNodeGetBaseSafe - xmlGetNsListSafe Introduce private functions xmlTreeEnsureXMLDecl and xmlSplitQName4. Don't report low-level errors to the global error handler. Fix tree Introduce xmlGetNsListSafe Fix tree	2023-12-11 22:13:05 +01:00
Nick Wellnhofer	56944c517f	python: Make sure to distribute new files Add pyproject.toml and tests/setup_test.py to Makefile.am.	2023-11-04 19:32:07 +01:00
Nick Wellnhofer	fc26934eb0	memory: Fix memory debugging with Windows threads On Windows, malloc hooks can be called after the final call to xmlCleanupParser in various tests. This means that xmlMemMutex can still be accessed if memory debugging is enabled, so the mutex should not be cleaned. This also means that tests may report spurious memory leaks on Windows. The old implementation avoided the issue by keeping track of all global state objects in a doubly linked list, so they could be cleaned during xmlCleanupParser. But as far as I can tell all memory will be freed eventually, so this is mostly an issue with our test suite.	2023-09-21 23:29:18 +02:00
Nick Wellnhofer	6c4ea468b2	python: Fix tests Revert part of commit `138213ac`.	2023-09-21 21:31:52 +02:00
Nick Wellnhofer	89ee0369d2	python: Fix potential crash in tests/thread2.py Memory debugging must be initialized.	2023-09-21 15:19:42 +02:00
Nick Wellnhofer	bbd918b2e7	parser: Fix detection of null bytes Also suppress misleading extra errors. Fixes #122.	2023-08-29 18:43:10 +02:00
Nick Wellnhofer	138213acdf	python: Fix tests on MinGW Add the directory containing libxml2.dll with os.add_dll_directory to make tests work on MinGW. This has changed in Python 3.8 but for some reason, the issue only turned up with Python 3.11 on MinGW. Contrary to documentation, copying libxml2.dll into the directory containing the .pyd file doesn't work.	2023-08-15 12:55:35 +02:00
Nick Wellnhofer	886bf4e63b	Stop calling xmlMemoryDump This was used to check for memory leaks but could potentially create a .memdump file. These days, there are better ways to check for memory leaks.	2023-04-30 15:48:41 +02:00
David Kilzer	cb1b8b8516	xmlValidatePopElement() can return invalid value (-1) Covered by: test/VC/ElementValid5 This only affects XML Reader API with LIBXML_REGEXP_ENABLED and LIBXML_VALID_ENABLED turned on. * result/VC/ElementValid5.rdr: - Update result to add missing error message. * python/tests/reader2.py: * result/VC/ElementValid6.rdr: * result/VC/ElementValid7.rdr: * result/valid/781333.xml.err.rdr: - Update result to fix grammar issue. * valid.c: (xmlValidatePopElement): - Check return value of xmlRegExecPushString() to handle -1, and assign 'ret = 0;' to return 0 from xmlValidatePopElement(). This change affects xmlTextReaderValidatePop() from xmlreader.c. - Fix grammar of error message by changing 'child' to 'children'.	2023-04-10 13:21:53 -07:00
Nick Wellnhofer	74aa61e0bd	parser: Halt parser on DTD errors If we try to continue parsing after an error in the internal or external subset, entity expansion accounting gets more complicated. Simply halt the parser. Found with libFuzzer.	2023-01-24 11:32:15 +01:00
Ross Burton	4762c85668	Use python3 not python As per https://peps.python.org/pep-0394/, the python binary can be one of the following options: - Python 2 - Python 3 - Not exist All of the scripts in libxml2 use 'python', which may not exist. As Python 2 reached EOL on the 1st January 2020, it's safe to move the scripts to use python3 explicitly.	2022-12-07 13:21:12 +00:00
Ross Burton	0ac8c15eb4	python/tests/reader2: use absolute paths everywhere The expected errors contain an relative path, but the messages from the parser contain absolute paths. However, due to the tests not actually failing if there was an error this wasn't noticed. Instead of putting relative paths in the expected messages use format() to embed the correct absolute path. Also use os.path.join() consistently when constructing paths to ensure uniformly formatted paths.	2022-12-06 17:27:34 +00:00
Ross Burton	b9ba5e1d90	python/tests/reader2: always exit(1) if a test fails Batch up the errors in the first parse tests and ensure that the last tests exit with an error if they fail. Also remove an unused import.	2022-12-06 17:25:34 +00:00
Nick Wellnhofer	97c0a9cff7	tests: Fix use-after-free in Python tests The nodeset must be freed before the document. Fixes #443.	2022-11-22 17:01:39 +01:00
Nick Wellnhofer	d8f05db8f6	Fix Python tests on macOS	2022-06-20 01:49:38 +02:00
David Seifert	a62b31f43f	Use portable python shebangs * In conda or Gentoo Prefix, we don't want to use the system python and instead rely on PATH lookup.	2022-04-06 19:57:30 +02:00
Nick Wellnhofer	3f74e42bae	Simplify 'make check' targets	2022-04-04 05:41:51 +02:00
David Seifert	0137d9879b	python/tests: open() relative to test scripts	2022-03-30 22:00:50 +02:00
David Seifert	438209f3e1	python/Makefile.am: nest python docs in $(docdir)	2022-03-30 16:51:15 +02:00
Chun-wei Fan	1b7d4e2bcc	tstmem.py: Try importing from libxmlmods.libxml2mod if needed Distutils builds place libxml2mod.pyd under the libxmlmods subdir, so try this directory if 'import libxml2mod' failed.	2022-01-16 15:18:06 +01:00
Nick Wellnhofer	de5b624f10	Fix handling of unexpected EOF in xmlParseContent Readd the XML_ERR_TAG_NOT_FINISHED error on unexpected EOF which was removed in commit `62150ed2`. This commit also introduced a regression for direct users of xmlParseContent. Unclosed tags weren't checked.	2021-05-08 20:47:36 +02:00
Nick Wellnhofer	3e80560d4b	Fix line numbers in error messages for mismatched tags Commit `62150ed2` introduced a small regression in the error messages for mismatched tags. This typically only affected messages after the first mismatch, but with custom SAX handlers all line numbers would be off. This also fixes line numbers in the SAX push parser which were never handled correctly.	2021-05-07 11:48:11 +02:00
Nick Wellnhofer	20c60886e4	Fix typos Resolves #133.	2020-03-08 17:41:53 +01:00
Pieter van Oostrum	8f62ac92b2	Updated Python test reader2.py Added all test cases that have a non-empty error in result/valid/*.xml.err Restructured to make it easier extensible with new test cases Added coding cookie because there is non-ASCII in the error messages	2020-01-02 13:50:10 +01:00
Pieter van Oostrum	8c3e52ebd9	Updated python/tests/tstLastError.py libxml2.registerErrorHandler(None,None): None is not acceptable as first argument failUnlessEqual replaced by assertEqual	2020-01-02 13:49:31 +01:00
Nick Wellnhofer	d188eb921a	Make sure that Python tests exit with error code Closes #108.	2019-10-21 12:45:37 +02:00

1 2 3 4

152 Commits