scylla

Author	SHA1	Message	Date
Avi Kivity	3092e3a5dc	Merge 'doc: improvements to the Create Cluster page' from Anna Stuchlik This PR: - Removes the redundant information about previous versions from the Create Cluster page. - Fixes language mistakes on that page, and replaces "Scylla" with "ScyllaDB". (nobackport) Closes scylladb/scylladb#16885 * github.com:scylladb/scylladb: doc: fix the language on the Create Cluster page doc: remove reduntant info about old versions	2024-01-21 18:18:32 +02:00
Avi Kivity	5810396ba1	Merge 'Invalidate prepared statements for views when their schema changes.' from Eliran Sinvani When a base table changes and altered, so does the views that might refer to the added column (which includes "SELECT " views and also views that might need to use this column for rows lifetime (virtual columns). However the query processor implementation for views change notification was an empty function. Since views are tables, the query processor needs to at least treat them as such (and maybe in the future, do also some MV specific stuff). This commit adds a call to `on_update_column_family` from within `on_update_view`. The side effect true to this date is that prepared statements for views which changed due to a base table change will be invalidated. Fixes https://github.com/scylladb/scylladb/issues/16392 This series also adds a test which fails without this fix and passes when the fix is applied. Closes scylladb/scylladb#16897 github.com:scylladb/scylladb: Add test for mv prepared statements invalidation on base alter query processor: treat view changes at least as table changes	2024-01-21 17:43:49 +02:00
Kefu Chai	d1dd71fbd7	mutation: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16889	2024-01-21 16:58:26 +02:00
Kefu Chai	1ce58595aa	dht: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16891	2024-01-21 16:56:16 +02:00
Kefu Chai	45c4f2039b	cql3: add formatter for cql3::ut_name before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define a formatter for cql3::ut_name, and remove their operator<<(). Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16890	2024-01-21 16:53:05 +02:00
Kefu Chai	f916286b25	index: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16892	2024-01-21 16:52:25 +02:00
Kefu Chai	ce076b5ae3	gossiping_property_file_snitch: drop unused using namespace we don't use any symbol in this namespace, in this function, so drop it. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16893	2024-01-21 16:48:37 +02:00
Eliran Sinvani	0e5a8cad62	Add test for mv prepared statements invalidation on base alter Issue #16392 describes a bug where when a base table is altered, it's materialized views prepared statements are not invalidated which in turn causes them to return missing data. This test reproduces this bug and serves as a regression test for this problem. Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2024-01-21 15:44:06 +02:00
Eliran Sinvani	5e33d9346b	query processor: treat view changes at least as table changes When a base table changes and altered, so does the views that might refer to the added column (which includes "SELECT *" views and also views that might need to use this column for rows lifetime (virtual columns). However the query processor implementation for views change notification was an empty function. Since views are tables, the query processor needs to at least treat them as such (and maybe in the future, do also some MV specific stuff). This commit adds a call to `on_update_column_family` from within `on_update_view`. The side effect true to this date is that prepared statements for views which changed due to a base table change will be invalidated. Fixes #16392 Signed-off-by: Eliran Sinvani <eliransin@scylladb.com>	2024-01-21 15:40:54 +02:00
Petr Gusev	5de970e430	get_peer_info_for_update: update only required fields in raft topology mode Some fields of system.peers table are updated through raft, we don't need to peek them from gossiper. The goal of the patch is to declare explicitly which code is responsible for which fields. In particular, in raft topology mode we don't need to update raft-managed fields since it's done in topology_state_load and raft_ip_address_updater.	2024-01-19 20:37:12 +04:00
Petr Gusev	f51f843b67	get_peer_info_for_update: introduce set_field lambda This is a refactoring commit. In the next commit we'll add a parameter to this unified lambda and this is easy to do if we have only one lambda and not three.	2024-01-19 20:37:12 +04:00
Petr Gusev	37063e2432	storage_service::on_change: fix indent	2024-01-19 20:37:12 +04:00
Petr Gusev	8e6b569de5	storage_service::on_change: skip handle_state functions in raft topology mode We don't need them in raft topology mode since the token_metadata update happens in topology_state_load function. We lift the _raft_topology_change_enabled checks from those functions to on_change.	2024-01-19 20:37:12 +04:00
Petr Gusev	1e00889842	test_replace_different_ip: check old IP is removed from gossiper In this commit we modify the existing test_replace_different_ip. We add the check that the old IP is not contained in alive or down lists, which means it's completely wiped from gossiper. This test is failing without the force_remove_endpoint fix from a previous commit. We also check that the state of local system.peers table is correct.	2024-01-19 20:36:52 +04:00
Anna Stuchlik	d345a893d6	doc: fix the language on the Create Cluster page This commit fixes language mistakes on the Create Cluster page, and replaces "Scylla" with "ScyllaDB".	2024-01-19 17:21:12 +01:00
Anna Stuchlik	af669dd7ae	doc: remove reduntant info about old versions This commit removes the information about old versions, which is reduntant in the next upcoming version.	2024-01-19 17:06:34 +01:00
Anna Stuchlik	b1ba904c49	doc: remove upgrade for unsupported versions This commit removes the upgrade guides from ScyllaDB Open Source to Enterprise for versions we no longer support. In addition, it removes a link to one of the removed pages from the Troubleshooting section (the link is redundant). Closes scylladb/scylladb#16249	2024-01-19 15:59:35 +02:00
Mikołaj Grzebieluch	c589793a9e	test.py: test_maintenance_socket: remove pytest.xfail Issue https://github.com/scylladb/python-driver/issues/278 was fixed in https://github.com/scylladb/python-driver/pull/279. Closes scylladb/scylladb#16873	2024-01-19 14:54:15 +01:00
Botond Dénes	b50d9bb802	Merge 'Add code coverage support' from Eliran Sinvani This mini-set includes code coverage support for ScyllaDB, it provides: 1. Support for building ScyllaDB with coverage support. 2. Utilities for processing coverage profiling data 3. test.py support for generation and processing of coverage profiling into an lcov trace files which can later be used to produce HTML or textual coverage reports. Refs #16323 Closes scylladb/scylladb#16784 * github.com:scylladb/scylladb: Add code coverage documentation test.py: support code coverage code coverage: Add libraries for coverage handling test.py: support --coverage and --coverage-mode configure.py support coverage profiles on standrad build modes	2024-01-19 15:27:44 +02:00
Pavel Emelyanov	e62114214f	Merge 'More logging for Raft-based topology' from Kamil Braun Currently if topology coordinator gets stuck in a CI test run it's hard to debug this (e.g. scylladb/scylladb#16708). We can add a lot of logging inside topology coordinator code to aid debugging, without spamming the logs -- these are relatively rare control plane events. Closes scylladb/scylladb#16749 * github.com:scylladb/scylladb: test/pylib: scylla_cluster: enable raft_topology=debug level by default raft topology: increase level of some TRACE messages raft topology: log when entering transition states raft topology: don't include null ID in exclude_nodes raft topology: INFO log when executing global commands and updating topology state storage_service: separate logger for raft topology	2024-01-19 16:19:44 +03:00
Nadav Har'El	debf6753c7	Merge 'test/cql-pytest: run tests with tablets' from Botond Dénes Add `--experimental-features=tablets` to both `test/cql-pytest/suite.yaml` and `test/cql-pytest/run.py`, so tablets are enabled. Detect tablet support in `contest.py` and add an xfail and skip marker to mark tests that fail/crash with tablets. These are expected to be fixed soon. Some tests checking things around alter-keyspace, had to force-disable tablets on the created keyspace, because tablets interfere with the test (a keyspace with tablets cannot have simple strategy for example). Tablets were also interfering with `test_keyspace.py:test_storage_options_local`, because it is expecting `system_schema.scylla_keyspaces` to not have any entries for local storage keyspace, but they have it if tablets are enabled. Adjust the test to account for this. Closes scylladb/scylladb#16840 * github.com:scylladb/scylladb: test/cql-pytest: run.py,suite.yaml: enable tablets by default test/cql-pytest: sprinkle xfail_tablets and skip_with_tablets as needed test/cql-pytest: disable tablets for some keyspace-altering tests test/cql-pytest: test_keyspace.py: test_storage_options_local(): fix for tablets test/cql-pytest: fix test_tablets.py to set initial_tablets correctly test/cql-pytest: add tablet detection logic and fixtures test/cql-pytest: extract is_scylla check into util.py	2024-01-19 13:38:56 +02:00
Kamil Braun	cc039498c6	Update tools/cqlsh submodule * tools/cqlsh 426fa0ea...b8d86b76 (8): > Make cqlsh work with unix domain sockets Fixes scylladb/scylladb#16489 > Bump python-driver version > dist/debian: add trailer line > dist/debian: wrap long line > Draft: explicit build-time packge dependencies > stop retruning status_code=2 on schema disagreement > Fix minor typos in the code > Dockerfile: apt-get update and apt-get upgrade to get latest OS packages	2024-01-19 11:23:22 +01:00
Botond Dénes	04881b3915	test/cql-pytest: run.py,suite.yaml: enable tablets by default All the preparations are done, the tests can now run with tablets.	2024-01-19 03:46:38 -05:00
Botond Dénes	075be5a04a	test/cql-pytest: sprinkle xfail_tablets and skip_with_tablets as needed For tests that cover functionality, which doesn't yet work with tablets. These tests and the respective functionality they test, are expected to be fixed soon, and then these fixtures will be removed.	2024-01-19 03:46:38 -05:00
Botond Dénes	6e6bee4368	test/cql-pytest: disable tablets for some keyspace-altering tests When tablets are enabled on a keyspace, they cannot be altered to simple replication strategy anymore. These keyspaces are testing exactly that, so disable tablets on the initial keyspace create statements.	2024-01-19 03:46:38 -05:00
Botond Dénes	5f11aa940d	test/cql-pytest: test_keyspace.py: test_storage_options_local(): fix for tablets This test expects a keyspace with local storage option, to not have a row in system_schema.scylla_keyspace. With tablets enabled by default, this won't be the case. Adjust the test to check for the specific storage-related columns instead.	2024-01-19 03:46:38 -05:00
Nadav Har'El	f92d2b4928	test/cql-pytest: fix test_tablets.py to set initial_tablets correctly Recently, in commit `49026dc319`, the way to choose the number of tablets in a new keyspace changed. This broke the test we had for a memory leak when many tablets were used, which saw the old syntax wasn't recognized and assumed Scylla is running without tablet support - so the test was skipped. Let's fix the syntax. After this patch the test passes if the tablets experimental feature is enabled, and only skipped if it isn't. Signed-off-by: Nadav Har'El <nyh@scylladb.com>	2024-01-19 03:46:38 -05:00
Botond Dénes	2119faf7fe	test/cql-pytest: add tablet detection logic and fixtures Add keyspace_has_tablets() utility function, which, given a keyspace, returns whether it is using tablets or not. In addition, 3 new fixtures are added: * has_tablets - does scylla has tablets by default? * xfail_tablets - the test is marked xfail, when tablets are enabled by default. * skip_with_tablets - the test is skipped when tablets are enabled by default, because it might crash with tablets. We expect the latter two to be removed soon(ish), as we make all test, and the functionality they test work with tablets.	2024-01-19 03:46:38 -05:00
Botond Dénes	6e53264bc3	test/cql-pytest: extract is_scylla check into util.py This logic is currently in the scylla_only fixture, but we want to re-use this in other utility functions in the next patches too.	2024-01-19 03:46:38 -05:00
Petr Gusev	070de5c551	test_replace: check two replace with same IP one after another This is a test case for the problem, described in the previous commit. Before that fix the second replace failed since it couldn't resolve an IP for the new host_id.	2024-01-19 12:24:04 +04:00
Petr Gusev	30b2e5838c	storage_service: sync_raft_topology_nodes: force_remove_endpoint for left nodes only if an IP is not used by other nodes Before the patch we called gossiper.remove_endpoint for IP-s of the left nodes. The problem is that in replace-with-same-ip scenario we called gossiper.remove_endpoint for IP which is used by the new, replacing node. The gossiper.remove_endpoint method puts the IP into quarantine, which means gossiper will ignore all events about this IP for quarantine_delay (one minute by default). If we immediately replace just replaced node with the same IP again, the bootstrap will fail since the gossiper events are blocked for this IP, and we won't be able to resolve an IP for the new host_id. Another problem was that we called gossiper.remove_endpoint method, which doesn't remove an endpoint from _endpoint_state_map, only from live and unreachable lists. This means the IP will keep circulating in the gossiper message exchange between cluster nodes until full cluster restart. This patch fixes both of these problems. First, we rely on the fact that when topology coordinator moves the being_replaced node to the left state, the IP of the replacing node is known to all nodes. This means before removing an IP from the gossiper we can check if this IP is currently used by another node in the current raft topology. This is done by constructing the used_ips map based on normal and transition nodes. This map is cached to avoid quadratic behaviour. Second, we call gossiper.force_remove_endpoint, not gossiper.remove_endpoint. This function removes and IP from _endpoint_state_map, as well as from live and unreachable lists. The tests for both of these improvements will be added in subsequent commits.	2024-01-19 12:24:04 +04:00
Kefu Chai	0dbb0ed09f	api: storage_service: correct a typo s/trough/through/ Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16870	2024-01-19 10:21:41 +02:00
Kefu Chai	5c0484cb02	db: add formatter for db::operation_type before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we define a formatter for db::operation_type, and remove their operator<<(). Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16832	2024-01-19 10:16:41 +02:00
Kefu Chai	2d2cd5fa3a	repair: do not compare unsigned with signed this change should silence the warning like ``` /home/kefu/dev/scylladb/repair/repair.cc:222:23: error: comparison of integers of different signs: 'int' and 'size_type' (aka 'unsigned long') [-Werror,-Wsign-compare] 222 \| for (int i = 0; i < all.size(); i++) { \| ~ ^ ~~~~~~~~~~ ``` Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16867	2024-01-19 08:52:02 +02:00
Kefu Chai	21d55abe8b	unimplemented: add format_as() for unimplemented::cause before this change, we rely on the default-generated fmt::formatter created from operator<<, but fmt v10 dropped the default-generated formatter. in this change, we replace operator<< with format_as() for unimplemented::cause, so that we don't rely on the deprecated behavior, and neither do we create a fully blown fmt::formatter. as in fmt v10, format_as() can be used in place of fmt::formatter, while in fmt v9, format_as() is only allowed to return a integer. so, to be future-proof, and to be simpler, format_as() is used. we can even replace `format_as(c)` with `c`, once fmt v10 is available in future. Refs #13245 Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16866	2024-01-19 08:38:30 +02:00
Botond Dénes	70252ee36f	Merge 'auth: do not include unused headers' from Kefu Chai these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Closes scylladb/scylladb#16868 * github.com:scylladb/scylladb: auth: do not include unused headers locator: Handle replication factor of 0 for initial_tablets calculations table: add_sstable_and_update_cache: trigger compaction only in compaction group compaction_manager: perform_task_on_all_files: return early when there are no sstables to compact compaction_manager: perform_cleanup: use compaction_manager::eligible_for_compaction	2024-01-19 08:30:11 +02:00
Kefu Chai	263e2fabae	auth: do not include unused headers these unused includes were identified by clangd. see https://clangd.llvm.org/guides/include-cleaner#unused-include-warning for more details on the "Unused include" warning. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2024-01-19 10:49:17 +08:00
Avi Kivity	d65ce16cf6	Merge 'Prevent empty compaction tasks in cleanup, upgrade sstables, and add_sstable' from Benny Halevy This short series prevents the creation of compaction tasks when we know in advance that they have nothing to do. This is possible in the clean path by: - improve the detection of candidates for cleanup by skipping sstables that require cleanup but are already being compacted - checking that list of sstables selected for cleanup isn't empty before creating the cleanup task For upgrade sstables, and generally when rewriting all sstable: launch the task only if the list off candidate sstables isn't empty. For regular compaction, when triggered via `table::add_sstable_and_update_cache`, we currently trigger compaction (by calling `submit`) on all compaction groups while the sstable is added only to one of them. Also, it is typically called for maintenance sstables that are awaiting offstrategy compaction, in which case we can skip calling `submit` entirely since the caller triggers offstrategy compaction at a later stage. Refs scylladb/scylladb#15673 Refs scylladb/scylladb#16694 Fixes scylladb/scylladb#16803 Closes scylladb/scylladb#16808 * github.com:scylladb/scylladb: table: add_sstable_and_update_cache: trigger compaction only in compaction group compaction_manager: perform_task_on_all_files: return early when there are no sstables to compact compaction_manager: perform_cleanup: use compaction_manager::eligible_for_compaction	2024-01-18 19:47:33 +02:00
Pavel Emelyanov	8595d64d01	locator: Handle replication factor of 0 for initial_tablets calculations When calculating per-DC tablets the formula is shards_in_dc / rf_in_dc, but the denominator in it can be configured to be literally zero and the division doesn't work. Fix by assuming zero tablets for dcs with zero rf fixes: #16844 Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes scylladb/scylladb#16861	2024-01-18 19:42:08 +02:00
Kamil Braun	8d9b0a6538	raft: server: inline `poll_fsm_output`	2024-01-18 18:09:13 +01:00
Kamil Braun	754a7b54e4	raft: server: fix indentation	2024-01-18 18:09:11 +01:00
Kamil Braun	527780987b	raft: server: move `io_fiber`'s processing of `batch` to a separate function	2024-01-18 18:09:02 +01:00
Kamil Braun	3e6b4910a6	raft: move `poll_output()` from `fsm` to `server` `server` was the only user of this function and it can now be implemented using `fsm`'s public interface. In later commits we'll extend the logic of `io_fiber` to also subscribe to other events, triggered by `server` API calls, not only to outputs from `fsm`.	2024-01-18 18:07:52 +01:00
Kamil Braun	95b6a60428	raft: move `_sm_events` from `fsm` to `server` In later commits we will use it to wake up `io_fiber` directly from `raft::server` based on events generated by `raft::server` itself -- not only from events generated by `raft::fsm`. `raft::fsm` still obtains a reference to the condition variable so it can keep signaling it.	2024-01-18 18:07:44 +01:00
Kamil Braun	a83e04279e	raft: fsm: remove constructor used only in tests This constructor does not provide persisted commit index. It was only used in tests, so move it there, to the helper `fsm_debug` which inherits from `fsm`. Test cases which used `fsm` directly instead of `fsm_debug` were modified to use `fsm_debug` so they can access the constructor. `fsm_debug` doesn't change the behavior of `fsm`, only adds some helper members. This will be useful in following commits too.	2024-01-18 18:07:17 +01:00
Kamil Braun	689d59fccd	raft: fsm: move trace message from `poll_output` to `has_output` In a later commit we'll move `poll_output` out of `fsm` and it won't have access to internals logged by this message (`_log.stable_idx()`). Besides, having it in `has_output` gives a more detailed trace. In particular we can now see values such as `stable_idx` and `last_idx` from the moment of returning a new fsm output, not only when poll started waiting for it (a lot of time can pass between these two events).	2024-01-18 18:06:55 +01:00
Kamil Braun	f6d43779af	raft: fsm: extract `has_output()` Also use the more efficient coroutine-specific `condition_variable::when` instead of `wait`.	2024-01-18 18:06:27 +01:00
Kamil Braun	dccfd09d83	raft: pass `max_trailing_entries` through `fsm_output` to `store_snapshot_descriptor` This parameter says how many entries at most should be left trailing before the snapshot index. There are multiple places where this decision is made: - in `applier_fiber` when the server locally decides to take a snapshot due to log size pressure; this applies to the in-memory log - in `fsm::step` when the server received an `install_snapshot` message from the leader; this also applies to the in-memory log - and in `io_fiber` when calling `store_snapshot_descriptor`; this applies to the on-disk log. The logic of how many entries should be left trailing is calculated twice: - first, in `applier_fiber` or in `fsm::step` when truncating the in-memory log - and then again as the snapshot descriptor is being persisted. The logic is to take `_config.snapshot_trailing` for locally generated snapshots (coming from `applier_fiber`) and `0` for remote snapshots (from `fsm::step`). But there is already an error injection that changes the behavior of `applier_fiber` to leave `0` trailing entries. However, this doesn't affect the following `store_snapshot_descriptor` call which still uses `_config.snapshot_trailing`. So if the server got restarted, the entries which were truncated in-memory would get "revived" from disk. Fortunately, this is test-only code. However in future commits we'd like to change the logic of `applier_fiber` even further. So instead of having a separate calculation of trailing entries inside `io_fiber`, it's better for it to use the number that was already calculated once. This number is passed to `fsm::apply_snapshot` (by `applier_fiber` or `fsm::step`) and can then be received by `io_fiber` from `fsm_output` to use it inside `store_snapshot_descriptor`.	2024-01-18 18:05:45 +01:00
Kamil Braun	40cd91cff7	raft: server: pass `_aborted` to `set_exception` call This looks like a minor oversight, in `server_impl::abort` there are multiple calls to `set_exception` on the different promises, only one of them would not receive `_aborted`.	2024-01-18 18:05:18 +01:00
Kefu Chai	09a688d325	sstables: do not use lambda when not necessary before this change, we always reference the return value of `make_reader()`, and the return value's type `flat_mutation_reader_v2` is movable, so we can just pass it by moving away from it. in this change, instead of using a lambda, let's just have the return value of it. simpler this way. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com> Closes scylladb/scylladb#16835	2024-01-18 15:54:49 +02:00

... 3 4 5 6 7 ...

40965 Commits