Commit Graph

31114 Commits

Author SHA1 Message Date
Philip Withnall
7868e6dd33 Merge branch 'on-error-stack' into 'main'
Small improvements to g_on_error_stack_trace and g_on_error_query

See merge request GNOME/glib!4375
2024-11-06 12:30:09 +00:00
Philip Withnall
8e46f31a23 Merge branch 'thread-name-length' into 'main'
thread: Force-limit thread name length

See merge request GNOME/glib!4374
2024-11-06 12:15:56 +00:00
Matthias Clasen
904be498c5 Small improvements to g_on_error_query
We can do the [S] option even if g_prgname isn't set, now that
g_on_error_stack_trace(NULL) works.
2024-11-06 11:46:53 +00:00
Matthias Clasen
f6e71c25ed Small improvements to g_on_error_stack_trace
Make the gdb commands match what /usr/bin/gstack uses,
and produce a stacktrace including all threads.
2024-11-06 11:46:53 +00:00
Matthias Clasen
95cdd0f06f Small improvements to g_on_error_stack_trace
Make g_on_error_stack_trace (NULL) work by using /proc/$PID/exe.

This matches what /usr/bin/gstack does.
2024-11-06 11:46:53 +00:00
Matthias Clasen
5b84636e62 thread: Force-limit thread name length
The documentation for glibc's pthread_setname_np states:

    The thread name is a meaningful C language string,
    whose length is restricted to 16 characters,
    including the  terminating  null  byte  ('\0').

The documentation for Solaris' pthread_setname_np states:

    The thread name is a string of length 31 bytes or less,
    UTF-8 encoded.

Failing to respect this length limitation may lead to no name being
set, which is confusing, since the thread then shows up under the
binary name in gdb. This was happening for the pango worker thread
with the name "[pango] fontconfig".
2024-11-06 11:44:49 +00:00
Philip Withnall
9949067a69 Merge branch 'fix-gtask-auto-set-name-doc' into 'main'
gtask: Fix comment for auto task naming via 'g_task_set_source_tag()'

See merge request GNOME/glib!4383
2024-11-06 11:34:09 +00:00
Sid
9e29968183 gtask: Fix comment for auto task naming via 'g_task_set_source_tag()' 2024-11-06 11:34:09 +00:00
Philip Withnall
705832f269 Merge branch 'amolenaar/fix-content-typo' into 'main'
macos: Remove extraneous space from type identifier

See merge request GNOME/glib!4373
2024-11-04 17:02:22 +00:00
Arjan Molenaar
6cb13b153c macos: Remove extraous space from type identifier 2024-11-04 16:31:36 +00:00
Emmanuele Bassi
7abfef8e06 Merge branch 'wip/pwithnall/fix-unix-mounts-tests-deprecations' into 'main'
tests: Fix calls to deprecated API in unix-mounts tests

See merge request GNOME/glib!4367
2024-11-04 16:27:12 +00:00
Philip Withnall
a33f26ff58 Merge branch 'wip/gvariant-builder-init-unset' into 'main'
gvariant: Introduce G_VARIANT_BUILDER_INIT_UNSET

See merge request GNOME/glib!4377
2024-11-04 16:11:52 +00:00
Sebastian Wick
3f71e403ed gvariant: Introduce G_VARIANT_BUILDER_INIT_UNSET
For g_auto(GVariantBuilder) one needs to initialize it before the
function returns, so it's best to do it when the variable is declared.
G_VARIANT_BUILDER_INIT exists but requires specifying a GVariantType in
the declaration which moves the type away from the usage of the builder
which often results in less readable code. G_VARIANT_BUILDER_INIT also
mentions that it's possible to explicitly zero the variable but this is
hard to find and writing `g_auto(GVariantBuilder) builder = {0,};` is
kind of ugly.

This introduces G_VARIANT_BUILDER_INIT_UNSET which zero initializes the
variable being declared. This gives us documentation and hides the
explicitly zeroing detail:

  auto(GVariantBuilder) builder = G_VARIANT_BUILDER_INIT_UNSET ();
2024-11-04 16:43:42 +01:00
Sebastian Wick
b10a4507a5 gvariant: Use gi-docgen for the G_VARIANT_BUILDER_INIT documentation 2024-11-04 16:38:05 +01:00
Michael Catanzaro
155810edd9 Merge branch 'cm/fix-gio-tools-leaks' into 'main'
gio: Fix GFileEnumerator leaks in gio tools

See merge request GNOME/glib!4378
2024-11-04 15:29:34 +00:00
correctmost
5b2da7ecb3 gio: Fix GFileEnumerator leaks in gio tools 2024-11-03 12:22:01 -05:00
Michael Catanzaro
3dcf1ea13d Merge branch 'nm-nl' into 'main'
Enable GNetworkMonitorNetlink on FreeBSD

See merge request GNOME/glib!4376
2024-10-29 19:26:59 +00:00
Gleb Popov
02172bda73 Enable GNetworkMonitorNetlink on FreeBSD 2024-10-29 21:18:36 +03:00
Philip Withnall
0828714bd4 Merge branch 'structured-logging-domain-check-without-nul' into 'main'
glib: Don't require GLIB_DOMAIN to be a NUL-terminated string

See merge request GNOME/glib!4350
2024-10-24 20:41:32 +00:00
Philip Withnall
b0943a70e7 Merge branch 'issue_3512' into 'main'
glib/glib-private: Build glib without ASAN sanitizer on AIX.

Closes #3512

See merge request GNOME/glib!4368
2024-10-24 20:21:45 +00:00
Parth Patel
929114fad2 glib/glib-private: Build glib without ASAN sanitizer on AIX.
In AIX, we doesn't have lsan sysmbols related to sanitizer.
So, skipping this check in AIX to build glib without ASAN sanitizer.

Issue: https://gitlab.gnome.org/GNOME/glib/-/issues/3512
2024-10-23 11:10:32 -05:00
Philip Withnall
4c3f06acbf Merge branch 'str-is-ascii-ifunc' into 'main'
gutf8: Add ifunc resolver for g_str_is_ascii() too

Closes #3511

See merge request GNOME/glib!4364
2024-10-22 17:37:32 +00:00
Sebastian Dröge
99bf0c966a glib: Add test for handling of non-NUL terminated strings in default log handler 2024-10-22 19:08:25 +03:00
Sebastian Dröge
9719853507 glib: Make sure GLIB_OLD_LOG_API is a NUL-terminated string
Every usage in GLib ensures this but theoretically external code might
pass something else. As this is only meant to be used internally from
GLib, don't support the other case but at least avoid potential out of
bound reads.
2024-10-22 19:08:25 +03:00
Sebastian Dröge
f221864d6e glib: Don't require GLIB_DOMAIN to be a NUL-terminated string
The length might be passed explicitly in the field instead, and the
string might not have a NUL-terminator as happens for example when
passed from the Rust bindings.

This might lead to out of bounds reads.

Thanks to Sebastian Wiesner for noticing this.
2024-10-22 19:08:25 +03:00
Michael Catanzaro
2c345bd8e2 Merge branch 'fuzz-fix' into 'main'
fuzzing: Fix buffer overread error in the fuzz test itself

See merge request GNOME/glib!4365
2024-10-22 16:00:36 +00:00
Michael Catanzaro
155d00a61e Merge branch '3470-unicode-16' into 'main'
Update to Unicode 16.0.0 and fix Unicode composition for its new codepoints

Closes #3470

See merge request GNOME/glib!4362
2024-10-22 15:29:42 +00:00
Philip Withnall
c14810e08f
tests: Fix calls to deprecated API in unix-mounts tests
This fixes commit aac56f1618 — I missed
this while reviewing it, but the unit tests were partially changed to
call the new APIs, without being fully changed. This caused the build to
succeed on Linux, but fail on macOS due to using a deprecated API.

Actually, a better approach for the unit tests would be to consistently
call the *old* APIs, as they all immediately call the new APIs. Then we
get coverage of both old and new for free, at the cost of putting
`G_GNUC_BEGIN_IGNORE_DEPRECATIONS` at the top of the test file.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>

Helps: #3492
2024-10-22 16:15:18 +01:00
Philip Withnall
e795c715b5 Merge branch 'fix-namespace-UnixMountEntry' into 'main'
UnixMountEntry: Deprecate g_unix_mount_* API in favor of g_unix_mount_entry_* API for GUnixMountEntry methods

Closes #3492

See merge request GNOME/glib!4337
2024-10-22 14:48:12 +00:00
Jialu Zhou
aac56f1618 UnixMountEntry: Deprecate g_unix_mount_* API in favor of g_unix_mount_entry_*
This issue arises because the g_unix_mount_* naming convention does not match
the GUnixMountEntry instance type, confusing the introspection generator.

To resolve this, we are deprecating the g_unix_mount_* API functions that take
a GUnixMountEntry parameter and introducing equivalent g_unix_mount_entry_*
functions that correctly associate with the GUnixMountEntry instance. This change
ensures that introspection data correctly treats these as instance methods and
that documentation reflects proper ownership of returned data.

(Some minor tweaks by Philip Withnall.)

Fixes: #3492
2024-10-22 15:31:52 +01:00
Philip Withnall
ad67d4e833
glocalfile: Disable faccessat()-based query_exists on FreeBSD
It appears not to work, and nobody interested in FreeBSD has picked it
up to investigate yet.

Rather than have a completely broken implementation of
`g_file_query_exists()` on FreeBSD, let’s settle for using the old one.
It’s slightly slower than the new one, but has worked fine for people
for years.

This essentially reverts commit 65ad41d8a4
on FreeBSD. This commit can be reverted when a FreeBSD person
investigates what’s going wrong with the `faccessat()`-based
implementation.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>

Helps: #3495
2024-10-22 13:37:55 +01:00
Philip Withnall
0e0421cfe2
fuzzing: Fix buffer overread error in the fuzz test itself
valgrind spotted this one: a read off the end of the `stop_chars` buffer
when `stop_chars_len == -1`, due to the fuzzing test not sticking to the
requirement from `g_data_input_stream_read_upto()` that `stop_chars`
must be nul-terminated if `stop_chars_len < 0`.

This can happen when reading `fuzzing/README.md`, which is done as a
smoketest when the fuzzing tests are run without the fuzzer, as normal
unit tests. In this case, it made smoke.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-22 13:18:59 +01:00
Philip Withnall
cf982177dd
gutf8: Add ifunc resolver for g_str_is_ascii() too
Just like how commit ad572e7780 added an
ifunc resolver for `g_utf8_validate()`, we also need to add one for
`g_str_is_ascii()`, as it also calls into the c-utf8 SIMD validation
code which causes false-positive buffer read overflow warnings from
valgrind and asan.

I thought about just adding the `strlen()` call into `g_str_is_ascii()`
unconditionally, as a simpler fix, but from a quick
codesearch.debian.net, it appears `g_str_is_ascii()` is used quite
widely, so this would have an unacceptable performance impact.

This should fix the valgrind failures on the `search-utils` test seen
here: https://gitlab.gnome.org/GNOME/glib/-/jobs/4423753.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-22 12:46:16 +01:00
Philip Withnall
1755024caf
gutf8: Factor out complex type signature into a typedef
As suggested by Michael Catanzaro, this should make the return type of
the resolve function a bit easier for people to parse.

This introduces no functional changes.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-22 12:45:20 +01:00
Philip Withnall
05fb05b49b
gutf8: Factor out ifunc attribute checks
It looks like these might get more complex in future, as compilers claim
to support the attribute (`__has_attribute(ifunc)` is true) but then
raise errors at compile time if the target architecture doesn’t support
ifuncs.

For example, see #3511.

This doesn’t fix #3511 (I don’t have time to test on musl right now), but
it should make it easier to update the platform preprocessor conditions
in future.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>

Helps: #3511
2024-10-22 12:42:41 +01:00
Philip Withnall
f9f74efd76
tests: Improve Unicode composition code coverage
This adds various additional tests to cover branches of `gunidecomp.c`
which are not already covered, bringing our branch coverage of that file
up to 100% (if you ignore `g_utf8_normalize()`, which is tested by
`unicode-normalize.c` and I’m counting it separately).

Signed-off-by: Philip Withnall <pwithnall@gnome.org>

Helps: #3470
2024-10-21 19:32:50 +01:00
Philip Withnall
0125c58a05
tests: Test restricted result_len sizes for g_unichar_fully_decompose()
This pushes the code coverage of that function up to 100%.

And it found no bugs!

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-21 19:32:43 +01:00
Philip Withnall
84e3a9cde9
tests: Refactor g_unichar_fully_decompose() test to make it extensible
This introduces no functional changes, but allows the test to be easily
extended, in the following commit, to test restricted `result_len`
sizes.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-21 19:32:37 +01:00
Philip Withnall
6eb648de1c
tests: Refactor g_unichar_compose() test to fix booleans
`g_assert_false (g_unichar_compose (…) && ch == 0)` will succeed if
`g_unichar_compose()` succeeds and returns a non-zero character (which
it will if it succeeds), so this isn’t really testing what we want it to
test. This regressed in commit ae4eea7a39.

Refactor out the repetitive calls to `g_unichar_compose()` and fix the
boolean checks.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-21 19:32:30 +01:00
Philip Withnall
5ec1c2422c
gunidecomp: Add some explanatory comments to g_utf8_normalize()
Just to make it a bit easier for people to understand the logic in the
implementation in future, because it took me a while.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-21 19:32:23 +01:00
Philip Withnall
32d3bddae4
gen-unicode-tables.pl: Add a new ‘either’ compose table
See the big comment in the code for details. Essentially, this adds a
new compose table specifically for the transitive closure of ‘either’
codepoints — codepoints which appear as the first codepoint in a
composition pair and as the second point in a composition pair
(potentially, but not necessarily, the same pair); or which appear in a
composition pair with an ‘either’ codepoint.

This new compose table has to be symmetrically indexed, as the
`COMPOSE_INDEX` macro doesn’t differentiate based on codepoint position
(first or second). It’s not possible to achieve that with the main
`compose_array` without making it absolutely huge (it’s currently about
150×40 in size and would have to become at least 150×150 in size). In
contrast, the new `compose_either_array` is currently 15×15.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>

Helps: #3470
2024-10-21 19:32:16 +01:00
Philip Withnall
dc2491d224
gen-unicode-tables.pl: Add more error checking
We’re essentially trying to build a minimal perfect hash function, and
`vals` is the map which represents that function. If we redefine a
member of `vals`, the map is no longer a partial function — one input
value (a Unicode codepoint) has two output values (compose table
indices).

So it’s bad if a member of `vals` gets redefined, and we want to be
notified if that happens.

As it happens, some of the new codepoints in Unicode 16.0 cause these
checks to fail. For example, U+16121 Gurung Khema Vowel Sign U
decomposes to U+1611E U+1611E. This causes `vals{U+1611E}` to be defined
to an index from the `first` map, and then redefined to an index from
the `second` map.

The following few commits will fix this, but let’s get the checks in
first.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-21 19:32:10 +01:00
Philip Withnall
ebd26727a8
gen-unicode-tables.pl: Add some internal documentation
Because how these big tables of numbers work is perhaps a bit hard to
figure out, and it would be useful to document the design decisions
involved in it.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-21 19:32:03 +01:00
Philip Withnall
e2b96c8679
gunicode: Update to Unicode 16.0.0
All changes mechanically generated with:
```
./tools/update-unicode-data.sh ~/Downloads/UCD 16.0.0
```
using the data from https://www.unicode.org/Public/16.0.0/ucd/UCD.zip.

Signed-off-by: Philip Withnall <pwithnall@endlessos.org>

Fixes: #3470
2024-10-21 19:31:56 +01:00
Philip Withnall
07e191012d
gunicode: Add new scripts for Unicode 16.0
Manually added from the data in
https://www.unicode.org/Public/16.0.0/ucd/UCD.zip.

The following commit will mechanically update the Unicode tables to use
them.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>

Helps: #3470
2024-10-21 19:31:49 +01:00
Philip Withnall
3065734dce
gunicode: Fix comment describing shortcode for Nag Mundari script
So it reflects the `sc` line in `PropertyValueAliases.txt` in the
Unicode database.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>
2024-10-21 19:31:42 +01:00
Philip Withnall
230fdbb373
tools: Add missing license and SPDX header to update-unicode-data.sh
Only one other previous author, and my contribution just now is so
simple as to not be copyrightable.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>

Helps: #1415
2024-10-21 19:31:35 +01:00
Philip Withnall
79e25aaa1b
tools: Update the normalisation test data when updating Unicode
Otherwise this easily gets forgotten.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>

Helps: #3470
2024-10-21 19:31:26 +01:00
Philip Withnall
ad51ff8038
gunicode: Switch compose_array table from guint16 to gunichar
The time has finally come when Unicode has specified a codepoint above
U+FFFF which has a decomposition: U+16125 GURUNG KHEMA VOWEL SIGN AI, in
Unicode 16 which the following commits will add support for.

So far, we’ve managed to store the reverse-lookup from decomposed pairs
to their composed form using a 16-bit integer. Now we have to switch to
storing the composed form in a 32-bit `gunichar` as U+16125 won’t fit
otherwise.

This introduces no functional changes, but does double the in-memory
size of the `compose_array` table from 9176 bytes to 19932 bytes.

The code which uses this lookup table, in `gunidecomp.c`, was already
implicitly converting the loaded value to a `gunichar`, so needs no
changes.

When we update to Unicode 16, the new `NormalizationTest.txt` file
contains a test which will check that composed codepoints > U+FFFF work.
Specifically, U+11391 TULU-TIGALARI LETTER AU is tested.

Signed-off-by: Philip Withnall <pwithnall@gnome.org>

Helps: #3470
2024-10-21 19:30:38 +01:00
Philip Withnall
7ed7fe2867 Merge branch 'g_array_free-free_seg' into 'main'
free_seg -> free_segment for g_array_free() and g_ptr_array_free()

Closes #3508

See merge request GNOME/glib!4361
2024-10-21 11:49:10 +00:00