synapse

Author	SHA1	Message	Date
Devon Hudson	003fc725db	Merge branch 'develop' into devon/ssext_threads	2025-11-09 12:33:55 -07:00
Devon Hudson	934f99a694	Add wait_for_new_data tests	2025-11-09 12:09:56 -07:00
Devon Hudson	78e8ec6161	Add test for room list filtering	2025-11-09 09:44:52 -07:00
Devon Hudson	a3b34dfafd	Run linter	2025-11-09 09:30:44 -07:00
Devon Hudson	cb82a4a687	Handle user leave/ban rooms to prevent leaking data	2025-11-09 08:45:52 -07:00
Devon Hudson	dedd6e35e6	Rejig thread updates to use room lists	2025-11-08 09:12:37 -07:00
Andrew Ferrazzutti	fcac7e0282	Write union types as `X \| Y` where possible (#19111 ) aka PEP 604, added in Python 3.10	2025-11-06 14:02:33 -06:00
Erik Johnston	6790312831	Fixup logcontexts after replication PR. (#19146 ) Fixes logcontext leaks introduced in #19138.	2025-11-05 15:38:14 +00:00
Erik Johnston	d3ffd04f66	Fix spelling (#19145 ) Fixes up #19138	2025-11-05 14:00:59 +00:00
Erik Johnston	4906771da1	Faster redis replication handling (#19138 ) Spawning a background process comes with a bunch of overhead, so let's try to reduce the number of background processes we need to spawn when handling inbound fed. Currently, we seem to be doing roughly one per command. Instead, lets keep the background process alive for a bit waiting for a new command to come in.	2025-11-05 13:42:04 +00:00
Erik Johnston	5408101d21	Speed up pruning of ratelimiter (#19129 ) I noticed this in some profiling. Basically, we prune the ratelimiters by copying and iterating over every entry every 60 seconds. Instead, let's use a wheel timer to track when we should potentially prune a given key, and then we a) check fewer keys, and b) can run more frequently. Hopefully this should mean we don't have a large pause everytime we prune a ratelimiter with lots of keys. Also fixes a bug where we didn't prune entries that were added via `record_action` and never subsequently updated. This affected the media and joins-per-room ratelimiter.	2025-11-04 12:44:57 +00:00
Andrew Morgan	08f570f5f5	Fix "There is no current event loop in thread" error in tests (#19134 )	2025-11-04 12:32:49 +00:00
Eric Eastwood	e02a6f5e5d	Fix lost logcontext on `HomeServer.shutdown()` (#19108 ) Same fix as https://github.com/element-hq/synapse/pull/19090 Spawning from working on clean tenant deprovisioning in the Synapse Pro for small hosts project (https://github.com/element-hq/synapse-small-hosts/pull/204).	2025-11-03 14:07:10 -06:00
V02460	3595ff921f	Pydantic v2 (#19071 ) Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com> Co-authored-by: Andrew Morgan <andrew@amorgan.xyz>	2025-10-31 09:22:22 +00:00
Andrew Morgan	300c5558ab	Update `check_dependencies` to support markers (#19110 )	2025-10-30 21:33:29 +00:00
Eric Eastwood	c0b9437ab6	Fix lost logcontext when using `timeout_deferred(...)` (#19090 ) Fix lost logcontext when using `timeout_deferred(...)` and things actually timeout. Fix https://github.com/element-hq/synapse/issues/19087 (our HTTP client times out requests using `timeout_deferred(...)` Fix https://github.com/element-hq/synapse/issues/19066 (`/sync` uses `notifier.wait_for_events()` which uses `timeout_deferred(...)` under the hood) ### When/why did these lost logcontext warnings start happening? ``` synapse.logging.context - 107 - WARNING - sentinel - Expected logging context call_later but found POST-2453 synapse.logging.context - 107 - WARNING - sentinel - Expected logging context call_later was lost ``` In https://github.com/element-hq/synapse/pull/18828, we switched `timeout_deferred(...)` from using `reactor.callLater(...)` to [`clock.call_later(...)`](https://github.com/element-hq/synapse/blob/3b59ac3b69f6a2f73a504699b30313d8dcfe4709/synapse/util/clock.py#L224-L313) under the hood. This meant it started dealing with logcontexts but our `time_it_out()` callback didn't follow our [Synapse logcontext rules](https://github.com/element-hq/synapse/blob/3b59ac3b69f6a2f73a504699b30313d8dcfe4709/docs/log_contexts.md).	2025-10-30 11:49:15 -05:00
Andrew Morgan	349599143e	Move reading of multipart response into `try` body (#19062 )	2025-10-30 15:22:52 +00:00
Eric Eastwood	0417296b9f	Remove logcontext problems caused by awaiting raw `deferLater(...)` (#19058 ) This is a normal problem where we `await` a deferred without wrapping it in `make_deferred_yieldable(...)`. But I've opted to replace the usage of `deferLater` with something more standard for the Synapse codebase. Part of https://github.com/element-hq/synapse/issues/18905 It's unclear why we're only now seeing these failures happen with the changes from https://github.com/element-hq/synapse/pull/19057 Example failures seen in https://github.com/element-hq/synapse/actions/runs/18477454390/job/52645183606?pr=19057 ``` builtins.AssertionError: Expected `looping_call` callback from the reactor to start with the sentinel logcontext but saw task-_resumable_task-0-IBzAmHUoepQfLnEA. In other words, another task shouldn't have leaked their logcontext to us. ```	2025-10-29 10:23:10 -05:00
Shay	f1695ac20e	Add an admin API to get the space hierarchy (#19021 ) It is often useful when investigating a space to get information about that space and it's children. This PR adds an Admin API to return information about a space and it's children, regardless of room membership. Will not fetch information over federation about remote rooms that the server is not participating in.	2025-10-24 15:32:16 -05:00
Andrew Ferrazzutti	fc244bb592	Use type hinting generics in standard collections (#19046 ) aka PEP 585, added in Python 3.9 - https://peps.python.org/pep-0585/ - https://docs.astral.sh/ruff/rules/non-pep585-annotation/	2025-10-22 16:48:19 -05:00
Tulir Asokan	ec7554b768	Stabilize support for MSC4326: Device masquerading for appservices (#19033 ) Note: the code references MSC3202, which is what MSC4326 was split off from. Only MSC4326 was accepted, MSC3202 wasn't yet.	2025-10-13 11:13:07 -05:00
Eric Eastwood	d2c582ef3c	Move unique snowflake homeserver background tasks to `start_background_tasks` (#19037 ) (the standard pattern for this kind of thing)	2025-10-13 10:19:09 -05:00
Tulir Asokan	690b3a4fcc	Allow using MSC4190 features without opt-in (#19031 )	2025-10-13 13:07:11 +00:00
Eric Eastwood	47fb4b43ca	Introduce `RootConfig.validate_config()` which can be subclassed in `HomeServerConfig` to do cross-config class validation (#19027 ) This means we can move the open registration config validation from `setup()` to `HomeServerConfig.validate_config()` (much more sane). Spawning from looking at this area of code in https://github.com/element-hq/synapse/pull/19015	2025-10-09 14:56:22 -05:00
Eric Eastwood	715cc5ee37	Split homeserver creation and setup (#19015 ) ### Background As part of Element's plan to support a light form of vhosting (virtual host) (multiple instances of Synapse in the same Python process), we're currently diving into the details and implications of running multiple instances of Synapse in the same Python process. "Clean tenant provisioning" tracked internally by https://github.com/element-hq/synapse-small-hosts/issues/221 ### Partial startup problem In the context of Synapse Pro for Small Hosts, since the Twisted reactor is already running (from the `multi_synapse` shard process itself), when provisioning a homeserver tenant, the `reactor.callWhenRunning(...)` callbacks will be invoked immediately. This includes the Synapse's [`start`](https://github.com/element-hq/synapse/blob/0615b64bb49684b846110465052642a46fd27028/synapse/app/homeserver.py#L418-L429) callback which sets up everything (including listeners, background tasks, etc). If we encounter an error at this point, we are partially setup but the exception will [bubble back to us](https://github.com/element-hq/synapse-small-hosts/blob/8be122186bf1acb8c0426d84eb3abded25d682b7/multi_synapse/app/shard.py#L114-L121) without us having a handle to the homeserver yet so we can't call `hs.shutdown()` and clean everything up. ### What does this PR do? Structures Synapse so we split creating the homeserver instance from setting everything up. This way we have access to `hs` if anything goes wrong during setup and can subsequently `hs.shutdown()` to clean everything up.	2025-10-09 13:12:10 -05:00
Devon Hudson	4cb0eeabdf	Allow SlidingSyncStreamToken in /relations	2025-10-09 11:28:33 -06:00
fkwp	18f07fdc4c	Add MatrixRTC backend/services discovery endpoint (#18967 ) Co-authored-by: Andrew Morgan <andrew@amorgan.xyz>	2025-10-09 17:15:47 +01:00
Devon Hudson	4d7826b006	Filter events from extension if in timeline	2025-10-08 17:01:40 -06:00
Devon Hudson	ab7e5a2b17	Properly return prev_batch tokens for threads extension	2025-10-08 16:12:46 -06:00
Shay	8f01eb8ee0	Add an Admin API to fetch an event by ID (#18963 ) Adds an endpoint to allow server admins to fetch an event regardless of their membership in the originating room.	2025-10-08 11:38:15 +01:00
Devon Hudson	4c51247cb3	Only return rooms where user is currently joined	2025-10-07 12:49:32 -06:00
Eric Eastwood	7b8831310f	No need to have `version_string` as an argument since it's always the same (#19012 ) Assuming, we're happy with https://github.com/element-hq/synapse/pull/19011, this PR makes sense.	2025-10-07 13:27:24 -05:00
Andrew Morgan	2443760d0d	Update `KeyUploadServlet` to handle case where client sends `device_keys: null` (#19023 )	2025-10-07 16:23:55 +01:00
Till	42bbff8294	Validate the body of requests to `/keys/upload` (#17097 ) Co-authored-by: Andrew Morgan <andrew@amorgan.xyz> Co-authored-by: Andrew Morgan <1342360+anoadragon453@users.noreply.github.com> Co-authored-by: Eric Eastwood <erice@element.io>	2025-10-07 11:27:53 +01:00
Devon Hudson	79ea4bed33	Add thread_root events to threads extension response	2025-10-03 15:57:13 -06:00
Devon Hudson	9ef4ca173e	Add user room filtering for threads extension	2025-10-03 14:01:16 -06:00
Devon Hudson	24b38733df	Don't return empty fields in response	2025-10-02 17:23:30 -06:00
Devon Hudson	4602b56643	Stub in early db queries to get tests going	2025-10-02 17:11:14 -06:00
Eric Eastwood	6835e7be0d	Wrap the Rust HTTP client with `make_deferred_yieldable` (#18903 ) Wrap the Rust HTTP client with `make_deferred_yieldable` so downstream usage doesn't need to use `PreserveLoggingContext()` or `make_deferred_yieldable`. > it seems like we should have some wrapper around it that uses [`make_deferred_yieldable(...)`](https://github.com/element-hq/synapse/blob/40edb10a98ae24c637b7a9cf6a3003bf6fa48b5f/docs/log_contexts.md#where-you-create-a-new-awaitable-make-it-follow-the-rules) to make things right so we don't have to do this in the downstream code. > > -- @MadLittleMods, https://github.com/element-hq/synapse/pull/18357#discussion_r2294941827 Spawning from wanting to [remove `PreserveLoggingContext()` from the codebase](https://github.com/element-hq/synapse/pull/18870) and thinking that we [shouldn't have to pollute all downstream usage with `PreserveLoggingContext()` or `make_deferred_yieldable`](https://github.com/element-hq/synapse/pull/18357#discussion_r2294941827) Part of https://github.com/element-hq/synapse/issues/18905 (Remove `sentinel` logcontext where we log in Synapse)	2025-10-02 13:00:50 -05:00
Eric Eastwood	06a84f4fe0	Revert "Switch to OpenTracing's `ContextVarsScopeManager` (#18849 )" (#19007 ) Revert https://github.com/element-hq/synapse/pull/18849 Go back to our custom `LogContextScopeManager` after trying OpenTracing's `ContextVarsScopeManager`. Fix https://github.com/element-hq/synapse/issues/19004 ### Why revert? For reference, with the normal reactor, `ContextVarsScopeManager` worked just as good as our custom `LogContextScopeManager` as far as I can tell (and even better in some cases). But since Twisted appears to not fully support `ContextVar`'s, it doesn't work as expected in all cases. Compounding things, `ContextVarsScopeManager` was causing errors with the experimental `SYNAPSE_ASYNC_IO_REACTOR` option. Since we're not getting the full benefit that we originally desired, we might as well revert and figure out alternatives for extending the logcontext lifetimes to support the use case we were trying to unlock (c.f. https://github.com/element-hq/synapse/pull/18804). See https://github.com/element-hq/synapse/issues/19004#issuecomment-3358052171 for more info. ### Does this require backporting and patch releases? No. Since `ContextVarsScopeManager` operates just as good with the normal reactor and was only causing actual errors with the experimental `SYNAPSE_ASYNC_IO_REACTOR` option, I don't think this requires us to backport and make patch releases at all. ### Maintain cross-links between main trace and background process work In order to maintain the functionality introduced in https://github.com/element-hq/synapse/pull/18932 (cross-links between the background process trace and currently active trace), we also needed a small change. Previously, when we were using `ContextVarsScopeManager`, it tracked the tracing scope across the logcontext changes without issue. Now that we're using our own custom `LogContextScopeManager` again, we need to capture the active span from the logcontext before we reset to the sentinel context because of the `PreserveLoggingContext()` below. Added some tests to ensure we maintain the `run_as_background` tracing behavior regardless of the tracing scope manager we use.	2025-10-02 11:27:26 -05:00
Devon Hudson	6c460b3eae	Stub in threads extension tests	2025-10-01 10:53:11 -06:00
Andrew Morgan	5fff5a1893	Merge branch 'develop' of github.com:element-hq/synapse into develop	2025-10-01 09:40:38 +01:00
Devon Hudson	396de6544a	Cleanly shutdown SynapseHomeServer object (#18828 ) This PR aims to allow for a clean shutdown of the `SynapseHomeServer` object so that it can be fully deleted and cleaned up by garbage collection without shutting down the entire python process. Fix https://github.com/element-hq/synapse-small-hosts/issues/50 ### Pull Request Checklist <!-- Please read https://element-hq.github.io/synapse/latest/development/contributing_guide.html before submitting your pull request --> * [x] Pull request is based on the develop branch * [x] Pull request includes a [changelog file](https://element-hq.github.io/synapse/latest/development/contributing_guide.html#changelog). The entry should: - Be a short description of your change which makes sense to users. "Fixed a bug that prevented receiving messages from other servers." instead of "Moved X method from `EventStore` to `EventWorkerStore`.". - Use markdown where necessary, mostly for `code blocks`. - End with either a period (.) or an exclamation mark (!). - Start with a capital letter. - Feel free to credit yourself, by adding a sentence "Contributed by @github_username." or "Contributed by [Your Name]." to the end of the entry. * [x] [Code style](https://element-hq.github.io/synapse/latest/code_style.html) is correct (run the [linters](https://element-hq.github.io/synapse/latest/development/contributing_guide.html#run-the-linters)) --------- Co-authored-by: Eric Eastwood <erice@element.io>	2025-10-01 02:42:09 +00:00
Eric Eastwood	5adb08f3c9	Remove `MockClock()` (#18992 ) Spawning from adding some logcontext debug logs in https://github.com/element-hq/synapse/pull/18966 and since we're not logging at the `set_current_context(...)` level (see reasoning there), this removes some usage of `set_current_context(...)`. Specifically, `MockClock.call_later(...)` doesn't handle logcontexts correctly. It uses the calling logcontext as the callback context (wrong, as the logcontext could finish before the callback finishes) and it didn't reset back to the sentinel context before handing back to the reactor. It was like this since it was [introduced 10+ years ago](https://github.com/element-hq/synapse/commit/38da9884e70e8e44bde14c67a7a8a9d49a8b87ac). Instead of fixing the implementation which would just be a copy of our normal `Clock`, we can just remove `MockClock`	2025-09-30 11:27:29 -05:00
Andrew Morgan	2aab171042	Remove unstable prefixes for MSC2732 This MSC was accepted in 2022. We shouldn't need to continue supporting the unstable field names.	2025-09-30 17:10:32 +01:00
Eric Eastwood	5143f93dc9	Fix `server_name` in logging context for multiple Synapse instances in one process (#18868 ) ### Background As part of Element's plan to support a light form of vhosting (virtual host) (multiple instances of Synapse in the same Python process), we're currently diving into the details and implications of running multiple instances of Synapse in the same Python process. "Per-tenant logging" tracked internally by https://github.com/element-hq/synapse-small-hosts/issues/48 ### Prior art Previously, we exposed `server_name` by providing a static logging `MetadataFilter` that injected the values: https://github.com/element-hq/synapse/blob/205d9e4fc4774850f34971469ae500e70119d17a/synapse/config/logger.py#L216 While this can work fine for the normal case of one Synapse instance per Python process, this configures things globally and isn't compatible when we try to start multiple Synapse instances because each subsequent tenant will overwrite the previous tenant. ### What does this PR do? We remove the `MetadataFilter` and replace it by tracking the `server_name` in the `LoggingContext` and expose it with our existing [`LoggingContextFilter`](https://github.com/element-hq/synapse/blob/205d9e4fc4774850f34971469ae500e70119d17a/synapse/logging/context.py#L584-L622) that we already use to expose information about the `request`. This means that the `server_name` value follows wherever we log as expected even when we have multiple Synapse instances running in the same process. ### A note on logcontext Anywhere, Synapse mistakenly uses the `sentinel` logcontext to log something, we won't know which server sent the log. We've been fixing up `sentinel` logcontext usage as tracked by https://github.com/element-hq/synapse/issues/18905 Any further `sentinel` logcontext usage we find in the future can be fixed piecemeal as normal. https://github.com/element-hq/synapse/blob/d2a966f922fdc95bc86f7fe55b7b54a9ab3f25c1/docs/log_contexts.md#L71-L81 ### Testing strategy 1. Adjust your logging config to include `%(server_name)s` in the format ```yaml formatters: precise: format: '%(asctime)s - %(server_name)s - %(name)s - %(lineno)d - %(levelname)s - %(request)s - %(message)s' ``` 1. Start Synapse: `poetry run synapse_homeserver --config-path homeserver.yaml` 1. Make some requests (`curl http://localhost:8008/_matrix/client/versions`, etc) 1. Open the homeserver logs and notice the `server_name` in the logs as expected. `unknown_server_from_sentinel_context` is expected for the `sentinel` logcontext (things outside of Synapse).	2025-09-26 17:10:48 -05:00
Eric Eastwood	2f2b854ac1	Fix logcontext handling in `timeout_deferred` tests (#18974 ) Related to https://github.com/element-hq/synapse/issues/18905 These fixes were split off from https://github.com/element-hq/synapse/pull/18828 where @devonh was seeing some test failures because `timeout_deferred(...)` is being updated to use `Clock` utilities instead of raw `reactor` methods. This test was failing in that branch/PR until we made this new version that handles the logcontexts properly. While the previous version of this test does pass on `develop`, it was using what appears completely wrong assertions, assumptions, and bad patterns to make it happen (see diff comments below) --- Test originally introduced in https://github.com/matrix-org/synapse/pull/4407	2025-09-26 11:10:02 -05:00
Travis Ralston	d2a966f922	Use signature support from policy servers when available (#18934 ) Opening on Kegan's behalf [MSC4284](https://github.com/matrix-org/matrix-spec-proposals/pull/4284) has already been opened accordingly. --------- Co-authored-by: Kegan Dougal <7190048+kegsay@users.noreply.github.com> Co-authored-by: Eric Eastwood <erice@element.io>	2025-09-25 19:30:24 +00:00
Hugh Nimmo-Smith	fd8fa97b6a	Document and fix room_config param when user_may_create_room callback is invoked for a room upgrade (#18721 ) Co-authored-by: Eric Eastwood <erice@element.io>	2025-09-24 21:42:19 +00:00
Eric Eastwood	5266e423e2	Explain how Deferred callbacks interact with logcontexts (#18914 ) Spawning from https://github.com/matrix-org/synapse/pull/12588#discussion_r865843321 > It turns out `Deferred.cancel()` is a lot like `Deferred.callback()`/`errback()` in that it will trash the logging context: > it can resume a coroutine, which will restore its own logging context, then run: > > - until it blocks, setting the sentinel context > - or until it terminates, setting the context it was started with > > So we need to wrap it in `with PreserveLoggingContext():`, like we do with `.callback()`: > > ```python > with PreserveLoggingContext(): > self.render_deferred.cancel() > ``` > > -- @squahtx, https://github.com/matrix-org/synapse/pull/12588#discussion_r865843321	2025-09-24 16:20:42 -05:00

1 2 3 4 5 ...

3993 Commits