Nick Wellnhofer
5f8ebc8809
save: Avoid xmlOutputBufferWriteQuotedString
...
xmlOutputBufferWriteQuotedString should be reserved for things like
system IDs.
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
0d81d6f811
html: Use xmlOutputBufferWrite if possible
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
89fcfe3a29
html: Start to use xmlSerializeText
...
Avoid temporary copy to speed up serialization.
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
777e2adf77
io: Consolidate escaping code
...
Use generated table approach of xmlSerializeText for xmlEscapeText.
Move most code to xmlIO.c.
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
cdaf657ffb
html: Don't escape < and > when serializing attribute values
...
Align with HTML5.
This will break some test suites.
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
e0e0a1f0f5
html: Remove special handling of &{...} when serializing
...
See https://www.w3.org/TR/html401/appendix/notes.html#h-B.7.1
Align with HTML5.
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
dad1163078
entities: Always replace invalid chars when escaping
...
The previous refactor painstakingly recreated the different behavior of
separate functions that were merged. It makes
Optimize IS_CHAR check for non-ASCII chars.
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
c8cea39d8a
save: Fix serialization of attribute defaults containing <
...
Long-standing bug that produced invalid XML.
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
971038e59f
html: Call lower-level escaping functions
...
Removes the need to pass a document around.
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
63535d3922
tree: Make xmlNodeListGetStringInternal work with escape flags
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
442c1903af
doc: Fix some damage from automated conversions
...
Add some newlines, fix returns.
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
98a61c9dff
doc: Fix briefs in tree docs
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
4b4bc15acf
doc: Misc fixes to buffer docs
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
ad390a5d14
parser: Set doc properties in endDocument SAX handler
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
c7c4964342
html: Move DTD creation to endDocument SAX callback
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
46f05ea4d5
html: Rework meta charset handling
...
Don't use encoding from meta tags when serializing. Only use the value
in `doc->encoding`, matching the XML serializer. This is the actual
encoding used when parsing.
Stop modifying the input document by setting meta tags before
serializing. Meta tags are now injected during serialization.
Add full support for <meta charset=""> which is also used when adding
meta tags.
Align with HTML5 and implement the "algorithm for extracting a character
encoding from a meta element". Only modify the encoding substring in
Content-Type meta tags.
Only switch encoding once when parsing.
Fix htmlSaveFileFormat with a NULL encoding not to declare a misleading
UTF-8 charset.
Fixes #909 .
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
9aaa52fe48
tree: Make xmlNodeAddContent work with attributes
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
655ac5f851
html: Add comment regarding hack for XML documents
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
f3a080bc48
html: Ignore U+0000 in body text
...
Align with HTML5. Fixes #908 .
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
a1e83b2401
io: Fix negation of potentially unsigned value
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
b3854fe964
reader: Fix null deref on malloc failure
...
Short-lived regression from 177067ea .
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
6684eb9350
fuzz: Fix out-of-tree build
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
6bd380ce1c
fuzz: Update README
2025-05-11 20:29:25 +02:00
Nick Wellnhofer
967df734c5
malloc-fail: Handle malloc failure in xmlSchemaCopyValue
...
Avoid null pointer dereference. Fixes #905 .
2025-05-11 20:29:25 +02:00
Pavel Kopylov
4ed7157406
python: fix use-after-free in functions xmlPythonFileReadRaw(), xmlPythonFileRead()
...
with python2.
Fixes #910 .
2025-05-09 11:58:01 +02:00
Nick Wellnhofer
38ea8fa9de
doc: Fix varargs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
9bbffec568
doc: Move brief to top, params to bottom of doc comments
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
7bc7ae9db3
doc: Enable Doxygen autobrief
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
ab13fbfd68
doc: Misc fixes to error docs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
b1685459a3
doc: Misc fixes to xmlsave docs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
7d689fabda
doc: Fix doc installation with Autotools
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
7b59e74c5f
doc: Always use case sensitive filenames with Doxygen
...
Avoid platform-specific behavior.
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
298f70b3d7
doc: Misc fixes to HTML tree docs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
18d20a68bc
doc: More fine-grained redirects for old pages
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
80b6429fb3
doc: Misc fixes to encoding docs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
81ac2e27fd
doc: Misc fixes to valid docs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
714decd6d6
doc: Misc fixes to entities docs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
f38f3e7b25
doc: Misc fixes to IO documentation
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
e6cfd04994
doc: Misc fixes to tree docs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
1bf44f09ba
doc: Misc fixes to parser docs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
b7274fb02f
doc: Misc fixes to HTML parser docs
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
411f30ef2a
doc: Don't document legacy HTML parser macros
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
4a01087585
doc: Move parser option docs to enum
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
0173fac786
gitlab-ci: Only build documentation once per CMake platform
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
a449c5fde3
catalog: Deprecate some functions
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
306b8bf28d
autotools: Remove -DSYSCONFDIR
...
This is handled in config.h now.
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
075283d49d
xlink: Deprecate remaining public function
...
This was never finished.
2025-05-06 19:51:38 +02:00
Nick Wellnhofer
05d0f59221
python: Skip __xml thread-local accessors
...
So we can remove conditional directives for Doxygen.
2025-05-06 19:51:26 +02:00
Nick Wellnhofer
9f496fdb8c
xmllint: Return early on invalid args
...
At this point, no memory was allocated and xmllintOom wasn't
initialized. Return immediately on invalid args to avoid triggering
false positive unreported OOM errors when fuzzing.
2025-05-03 14:33:06 +02:00
Nick Wellnhofer
488939b6a1
gitlab-ci: Enable documentation in more tests
2025-05-02 23:40:39 +02:00