scylla

Author	SHA1	Message	Date
Raphael S. Carvalho	ec79ac46c9	db/view: Add visibility to view updating of Staging SSTables Today, we're completely blind about the progress of view updating on Staging files. We don't know how long it will take, nor how much progress we've made. This patch adds visibility with a new metric that will inform the number of bytes to be processed from Staging files. Before any work is done, the metric tell us the total size to be processed. As view updating progresses, the metric value is expected to decrease, unless work is being produced faster than we can consume them. We're piggybacking on sstables::read_monitor, which allows the progress metric to be updated whenever the SSTable reader makes progress. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #11751	2022-10-12 16:57:37 +03:00
Avi Kivity	2e79bb431c	tools: change source_location location std::experimental::source_location is provided by <experimental/source_location>, not <source_location>. libstdc++ 12 insists, so change the header. Closes #11766	2022-10-12 15:29:14 +03:00
Takuya ASADA	6b246dc119	locator::ec2_snitch: Retry HTTP request to EC2 instance metadata service EC2 instance metadata service can be busy, ret's retry to connect with interval, just like we do in scylla-machine-image. Fixes #10250 Signed-off-by: Takuya ASADA <syuu@scylladb.com> Closes #11688	2022-10-12 13:59:06 +03:00
Kamil Braun	3e84b1f69c	Merge 'test.py: topology fix ssl var and improve pylint score' from Alecco When code was moved to the new directory, a bug was reintroduced with `ssl` local hiding `ssl` module. Fix again. Closes #11755 * github.com:scylladb/scylladb: test.py: improve pylint score for conftest test.py: fix variable name collision with ssl	2022-10-12 11:41:11 +02:00
Avi Kivity	f673d0abbe	build: support fmt 9 ostream formatter deprecation fmt 9 deprecates automatic fallback to std::ostream formatting. We should migrate, but in order to do so incrementally, first enable the deprecated fallback so the code continues to compile. Closes #11768	2022-10-12 09:27:36 +03:00
Avi Kivity	0952cecfc9	build: mark abseil as a system header Abseil is not under our control, so if a header generates a warning, we can do nothing about it. So far this wasn't a problem, but under clang 15 it spews a harmless deprecation warning. Silence the warning by treating the header as a system header (which it is, for us). Closes #11767	2022-10-12 09:27:36 +03:00
Asias He	810b424a8c	storage_service: Reject to bootstrap new node when node has unknown gossip status - Start a cluster with n1, n2, n3 - Full cluster shutdown n1, n2, n3 - Start n1, n2 and keep n3 as shutdown - Add n4 Node n4 will learn the ip and uuid of n3 but it does not know the gossip status of n3 since gossip status is published only by the node itself. After full cluster shutdown, gossip status of n3 will not be present until n3 is restarted again. So n4 will not think n3 is part of the ring. In this case, it is better to reject the bootstrap. With this patch, one would see the following when adding n4: ``` ERROR 2022-09-01 13:53:14,480 [shard 0] init - Startup failed: std::runtime_error (Node 127.0.0.3 has gossip status=UNKNOWN. Try fixing it before adding new node to the cluster.) ``` The user needs to perform either of the following before adding a new node: 1) Run nodetool removenode to remove n3 2) Restart n3 to get it back to the cluster Fixes #6088 Closes #11425	2022-10-11 15:47:34 +03:00
Botond Dénes	378c6aeebd	Merge 'More Raft upgrade tests' from Kamil Braun Refactor the existing upgrade tests, extracting some common functionality to helper functions. Add more tests. They are checking the upgrade procedure and recovery from failure in scenarios like when a node fails causing the procedure to get stuck or when we lose a majority in a fully upgraded cluster. Add some new functionalities to `ScyllaRESTAPIClient` like injecting errors and obtaining gossip generation numbers. Extend the removenode function to allow ignoring dead nodes. Improve checking for CQL availability when starting nodes to speed up testing. Closes #11725 * github.com:scylladb/scylladb: test/topology_raft_disabled: more Raft upgrade tests test/topology_raft_disabled: refactor `test_raft_upgrade` test/pylib: scylla_cluster: pass a list of ignored nodes to removenode test/pylib: rest_client: propagate errors from put_json test/pylib: fix some type hints test/pylib: scylla_cluster: don't create and drop keyspaces to check if cql is up	2022-10-11 15:30:00 +03:00
Kamil Braun	08e654abf5	Merge 'raft: (service) cleanups on the path for dynamic IP address support' from Konstantin Osipov In preparation for supporting IP address changes of Raft Group 0: 1) Always use start_server_for_group0() to start a server for group 0. This will provide a single extension point when it's necessary to prompt raft_address_map with gossip data. 2) Don't use raft::server_address in discovery, since going forward discovery won't store raft::server_address. On the same token stop using discovery::peer_set anywhere outside discovery (for persistence), use a peer_list instead, which is easier to marshal. Closes #11676 * github.com:scylladb/scylladb: raft: (discovery) do not use raft::server_address to carry IP data raft: (group0) API refactoring to avoid raft::server_address raft: rename group0_upgrade.hh to group0_fwd.hh raft: (group0) move the code around raft: (discovery) persist a list of discovered peers, not a set raft: (group0) always start group0 using start_server_for_group0()	2022-10-11 13:43:41 +02:00
Asias He	58c65954b8	storage_service: Reject decommission if nodes are down - Start n1, n2, n3 - Apply network nemesis as below: + Block gossip traffic going from nodes 1 and 2 to node 3. + All the other rpc traffic flows normally, including gossip traffic from node 3 to nodes 1 and 2 and responses to node_ops commands from nodes 1 and 2 to node 3. - Decommission n3 Currently, the decommission will be successful because all the network traffic is ok. But n3 could not advertise status STATUS_LEFT to the rest of the cluster due to the network nemesis applied. As a result, n1 and n3 could not move the n3 from STATUS_LEAVING to STATUS_LEFT, so n3 will stay in DL forever. I know why the node stays DL forever. The problem is that with node_ops_cmd based node operation, we still rely on the gossip status of STATUS_LEFT from the node being decommissioned to notify other nodes this node has finished decommission and can be moved from STATUS_LEAVING to STATUS_LEFT. This patch fixes by checking gossip liveness before running decommission. Reject if required peer nodes are down. With the fix, the decommission of n3 will fail like this: $ nodetool decommission -p 7300 nodetool: Scylla API server HTTP POST to URL '/storage_service/decommission' failed: std::runtime_error (decommission[adb3950e-a937-4424-9bc9-6a75d880f23d]: Rejected decommission operation, removing node=127.0.0.3, sync_nodes=[127.0.0.2, 127.0.0.3, 127.0.0.1], ignore_nodes=[], nodes_down={127.0.0.1}) Fixes #11302 Closes #11362	2022-10-11 14:09:28 +03:00
Botond Dénes	917fdb9e53	Merge "Cut database-system_keyspace circular dependency" from Pavel Emelyanov " There's one via the database's compaction manager and large data handler sub-services. Both need system keyspace to put their info into, but the latter needs database naturally via query_processor->storage_proxy link. The solution is to make c.m. \| l.d.h. -> sys.ks. dependency be weak with the help of shared_from_this(), described in details in patch #2 commit message. As a (not-that-)side effect this set removes a bunch of global qctx calls. refs: #11684 (this set seem to increase the chance of stepping on it) " * 'br-sysks-async-users' of https://github.com/xemul/scylla: large_data_handler: Use local system_keyspace to update entries system_keyspace: De-static compaction history update compaction_manager: Relax history paths database: Plug/unplug system_keyspace system_keyspace: Add .shutdown() method	2022-10-11 08:52:04 +03:00
Nadav Har'El	ef0da14d6f	test/cql-pytest: add simple tests for USE statement This patch adds a couple of simple tests for the USE statement: that without USE one cannot create a table without explicitly specifying a keyspace name, and with USE, it is possible. Beyond testing these specific feature, this patch also serves as an example of how to write more tests that need to control the effective USE setting. Specifically, it adds a "new_cql" function that can be used to create a new connection with a fresh USE setting. This is necessary in such tests, because if multiple tests use the same cql fixture and its single connection, they will share their USE setting and there is no way to undo or reset it after being set. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #11741	2022-10-11 08:20:19 +03:00
Kamil Braun	df2fb21972	test/topology: reenable test_remove_node_add_column After #11691 was merged the test should no longer be flaky. Reenable it. Closes #11754	2022-10-11 08:18:20 +03:00
Konstantin Osipov	3e46c32d7b	raft: (discovery) do not use raft::server_address to carry IP data We plan to remove IP information from Raft addresses. raft::server_address is used in Raft configuration and also in discovery, which is a separate algorithm, as a handy data structure, to avoid having new entities in RPC. Since we plan to remove IP addresses from Raft configuration, using raft::server_address in discovery and still storing IPs in it would create ambiguity: in some uses raft::server_address would store an IP, and in others - would not. So switch to an own data structure for the purposes of discovery, discovery_peer, which contains a pair ip, raft server id. Note to reviewers: ideally we should switch to URIs in discovery_peer right away. Otherwise we may have to deal with incompatible changes in discovery when adding URI support to Scylla.	2022-10-10 16:24:33 +03:00
Pavel Emelyanov	b1f4273f0d	large_data_handler: Use local system_keyspace to update entries The l._d._h.'s way to update system keyspace is not like in other code. Instead of a dedicated helper on the system_keyspace's side it executes the insertion query directly with the help of qctx. Now when the l._d._h. has the weak system keyspace reference it can execute queries on _it_ rather than on the qctx. Just like in previous patch, it needs to keep the sys._k.s. weak reference alive until the query's future resolves. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-10 16:20:59 +03:00
Pavel Emelyanov	907fd2d355	system_keyspace: De-static compaction history update Compaction manager now has the weak reference on the system keyspace object and can use it to update its stats. It only needs to take care and keep the shared pointer until the respective future resolves. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-10 16:20:59 +03:00
Pavel Emelyanov	3e0b61d707	compaction_manager: Relax history paths There's a virtual method on table_state to update the entry in system keyspace. It's an overkill to facilitate tests that don't want this. With new system_keyspace weak referencing it can be made simpled by moving the updating call to the compaction_manager itself. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-10 16:20:59 +03:00
Pavel Emelyanov	f9b57df471	database: Plug/unplug system_keyspace There's a circular dependency between system_keyspace and database. The former needs the latter because it needs to execula local requests via query_processor. The latter needs the former via compaction manager and large data handler, database depends on both and these too need to insert their entries into system keyspace. To cut this loop the compaction manager and large data handler both get a weak reference on the system keysace. Once system keyspace starts is activcates this reference via the database call. When system keyspace is shutdown-ed on stop, it deactivates the reference. Technically the weak reference is implemented by marking the system_k.s. object as async_sharded_service, and the "reference" in question is the shared_from_this() pointer. When compaction manager or large data handler need to update a system keyspace's table, they both hold an extra reference on the system keyspace until the entry is committed, thus making sure that sys._k.s. doesn't stop from under their feet. At the same time, unplugging the reference on shutdown makes sure that no new entries update will appear and the system_k.s. will eventually be released. It's not a C++ classical reference, because system_keyspace starts after and stops before database. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-10 16:20:59 +03:00
Konstantin Osipov	8857e017c7	raft: (group0) API refactoring to avoid raft::server_address Replace raft::server_address in a few raft_group0 API calls with raft::server_id. These API calls do not need raft::server_address, i.e. the address part, anyway, and since going forward raft::server_address will not contain the IP address, stop using it in these calls. This is a beginning of a multi-patch series to reduce raft::server_address usage to core raft only.	2022-10-10 15:58:48 +03:00
Konstantin Osipov	224dd9ce1e	raft: rename group0_upgrade.hh to group0_fwd.hh The plan is to add other group-0-related forward declarations to this file, not just the ones for upgrade.	2022-10-10 15:58:48 +03:00
Konstantin Osipov	e226624daf	raft: (group0) move the code around Move load/store functions for discovered peers up, since going forward they'll be used to in start_server_for_group0(), to extend the address map prior to start (and thus speed up bootstrap).	2022-10-10 15:58:48 +03:00
Konstantin Osipov	199b6d6705	raft: (discovery) persist a list of discovered peers, not a set We plan to reuse the discovery table to store the peers after discovery is over, so load/store API must be generalized to use outside discovery. This includes sending the list of persisted peers over to a new member of the cluster.	2022-10-10 15:58:48 +03:00
Konstantin Osipov	746322b740	raft: (group0) always start group0 using start_server_for_group0() When IP addresses are removed from raft::configuration, it's key to initialize raft_address_map with IP addresses before we start group 0. Best place to put this initialization is start_server_for_group0(), so make sure all paths which create group 0 use start_server_for_group0().	2022-10-10 15:58:48 +03:00
Kamil Braun	4974a31510	test/topology_raft_disabled: more Raft upgrade tests The tests are checking the upgrade procedure and recovery from failure in scenarios like when a node fails causing the procedure to get stuck or when we lose a majority in a fully upgraded cluster. Added some new functionalities to `ScyllaRESTAPIClient` like injecting errors and obtaining gossip generation numbers.	2022-10-10 14:32:10 +02:00
Pavel Emelyanov	caed12c8f2	system_keyspace: Add .shutdown() method Many services out there have one (sometimes called .drain()) that's called early on stop and that's responsible for prearing the service for stop -- aborting pending/in-flight fibers and alike. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-10 15:29:33 +03:00
Kamil Braun	4460b4e63c	test/topology_raft_disabled: refactor `test_raft_upgrade` Take reusable parts out of the test to helper functions.	2022-10-10 12:59:12 +02:00
Kamil Braun	fa8dcb0d54	test/pylib: scylla_cluster: pass a list of ignored nodes to removenode The `removenode` operation normally requires the removing node to contact every node in the cluster except the one that is being removed. But if more than 1 node is down it's possible to specify a list of nodes to ignore for the operation; the `/storage_service/remove_node` endpoint accepts an `ignore_nodes` param which is a comma-separated list of IPs. Extend `ScyllaRESTAPIClient`, `ScyllaClusterManager` and `ManagerClient` so it's possible to pass the list of ignored nodes. We also modify the `/cluster/remove-node` Manager endpoint to use `put_json` instead of `get_text` and pass all parameters except the initiator IP (the IP of the node who coordinates the `removenode` operation) through JSON. This simplifies the URL greatly (it was already messy with 3 parameters) and more closely resembles Scylla's endpoint.	2022-10-10 12:59:12 +02:00
Kamil Braun	130ab1d312	test/pylib: rest_client: propagate errors from put_json	2022-10-10 12:59:12 +02:00
Kamil Braun	63892326d5	test/pylib: fix some type hints	2022-10-10 12:59:12 +02:00
Kamil Braun	6e3fe13fcf	test/pylib: scylla_cluster: don't create and drop keyspaces to check if cql is up Do a simple `SELECT` instead. This speeds up tests - creating and dropping keyspaces is relatively expensive, and we did this on every server restart.	2022-10-10 12:59:12 +02:00
Alejo Sanchez	7e2a3f2040	test.py: improve pylint score for conftest Remove unused imports, fix long lines, add ignore flags. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-10-10 12:07:41 +02:00
Alejo Sanchez	aa1f4a321c	test.py: fix variable name collision with ssl Change variable name to avoid collision with module ssl. This bug was reintroduced when moving code. Signed-off-by: Alejo Sanchez <alejo.sanchez@scylladb.com>	2022-10-10 11:59:13 +02:00
Tomasz Grabiec	fcf0628bc5	dbuild: Use .gdbinit from the host Useful when starting gdb inside the dbuild container. Message-Id: <20221007154230.1936584-1-tgrabiec@scylladb.com>	2022-10-09 11:14:33 +03:00
Petr Gusev	0923cb435f	raft: mark removed servers as expiring instead of dropping them There is a flaw in how the raft rpc endpoints are currently managed. The io_fiber in raft::server is supposed to first add new servers to rpc, then send all the messages and then remove the servers which have been excluded from the configuration. The problem is that the send_messages function isn't synchronous, it schedules send_append_entries to run after all the current requests to the target server, which can happen after we have already removed the server from address_map. In this patch the remove_server function is changed to mark the server_id as expiring rather than synchronously dropping it. This means all currently scheduled requests to that server will still be able to resolve the ip address for that server_id. Fixes: #11228 Closes #11748	2022-10-07 19:08:34 +02:00
Kamil Braun	06b87869ba	Merge 'Raft transport error' from Gusev Petr The `add_entry` and `modify_config` methods sometimes do an rpc to execute the request on the current leader. If the tcp connection was broken, a `seastar::rpc::closed_error` would be thrown to the client. This exception was not documented in the method comments and the client could have missed handling it. For example, this exception was not handled when calling `modify_config` in `raft_group0`, which sometimes broke the `removenode` command. An `intermittent_connection_error` exception was added earlier to solve a similar problem with the `read_barrier` method. In this patch it is renamed to `transport_error`, as it seems to better describe the situation, and an explicit specification for this exception was added - the rpc implementation can throw it if it is not known whether the call reached the destination and whether any mutations were made. In case of `read_barrier` it does not matter and we just retry, in case of `add_entry` and `modify_config` we cannot retry because of possible mutations, so we convert this exception to `commit_status_unknown`, which the client has to handle. Explicit comments have also been added to `raft::server` methods describing all possible exceptions. Closes #11691 * github.com:scylladb/scylladb: raft_group0: retry modify_config on commit_status_unknown raft: convert raft::transport_error to raft::commit_status_unknown	2022-10-07 15:53:22 +02:00
Petr Gusev	12bb8b7c8d	raft_group0: retry modify_config on commit_status_unknown modify_config can throw commit_status_unknown in case of a leader change or when the leader is unavailable, but the information about it has not yet reached the current node. In this patch modify_config is run again after some time in this case.	2022-10-07 13:34:23 +04:00
Petr Gusev	d79fbab682	raft: convert raft::transport_error to raft::commit_status_unknown The add_entry and modify_config methods sometimes do an rpc to execute the request on the current leader. If the tcp connection was broken, a seastar::rpc::closed_error would be thrown to the client. This exception was not documented in the method comments and the client could have missed handling it. For example, this exception was not handled when calling modify_config in raft_group0, which sometimes broke the removenode command. An intermittent_connection_error exception was added earlier to solve a similar problem with the read_barrier method. In this patch it is renamed to transport_error, as it seems to better describe the situation, and an explicit specification for this exception was added - the rpc implementation can throw it if it is not known whether the call reached the target node and whether any actions were performed on it. In case of read_barrier it does not matter and we just retry. In case of add_entry and modify_config we cannot retry because the rpc calls are not idempotent, so we convert this exception to commit_status_unknown, which the client has to handle. Explicit comments have also been added to raft::server methods describing all possible exceptions.	2022-10-07 13:34:16 +04:00
Botond Dénes	b247f29881	Merge 'De-static system_keyspace::get_{saved\|local}_tokens()' from Pavel Emelyanov Yet another user of global qctx object. Making the method(s) non-static requires pushing the system_keyspace all the way down to size_estimate_virtual_reader and a small update of the cql_test_env Closes #11738 * github.com:scylladb/scylladb: system_keyspace: Make get_{local\|saved}_tokens non static size_estimates_virtual_reader: Pass sys_ks argument to get_local_ranges() cql_test_env: Keep sharded<system_keyspace> reference size_estimate_virtual_reader: Keep system_keyspace reference system_keyspace: Pass sys_ks argument to install_virtual_readers() system_keyspace: Make make() non-static distributed_loader: Pass sys_ks argument to init_system_keyspace() system_keyspace: Remove dangling forward declaration	2022-10-07 11:28:32 +03:00
Botond Dénes	992afc5b8c	Merge 'storage_proxy: coroutinize some functions with do_with' from Avi Kivity do_with() is a sure indicator for coroutinization, since it adds an allocation (like the coroutine does with its frame). Therefore translating a function with do_with is at least a break-even, and usually a win since other continuations no longer allocate. This series converts most of storage_proxy's function that have do_with to coroutines. Two remain, since they are not simple to convert (the do_with() is kept running in the background and its future is discarded). Individual patches favor minimal changes over final readability, and there is a final patch that restores indentation. The patches leave some moves from coroutine reference parameters to the coroutine frame, this will be cleaned up in a follow-up. I wanted this series not to touch headers to reduce rebuild times. Closes #11683 * github.com:scylladb/scylladb: storage_proxy: reindent after coroutinization storage_proxy: convert handle_read_digest() to a coroutine storage_proxy: convert handle_read_mutation_data() to a coroutine storage_proxy: convert handle_read_data() to a coroutine storage_proxy: convert handle_write() to a coroutine storage_proxy: convert handle_counter_mutation() to a coroutine storage_proxy: convert query_nonsingular_mutations_locally() to a coroutine	2022-10-07 07:37:37 +03:00
Nadav Har'El	72dbce8d46	docs, alternator: mention S3 Import feature in compatibility.md In August 2022, DynamoDB added a "S3 Import" feature, which we don't yet support - so let's document this missing feature in the compatibility document. Refs #11739. Signed-off-by: Nadav Har'El <nyh@scylladb.com> Closes #11740	2022-10-06 19:50:16 +03:00
Avi Kivity	20bad62562	Merge 'Detect and record large collections' from Benny Halevy This series adds support for detecting collections that have too many items and recording them in `system.large_cells`. A configuration variable was added to db/config: `compaction_collection_items_count_warning_threshold` set by default to 10000. Collections that have more items than this threshold will be warned about and will be recorded as a large cell in the `system.large_cells` table. Documentation has been updated respectively. A new column was added to system.large_cells: `collection_items`. Similar to the `rows` column in system.large_partition, `collection_items` holds the number of items in a collection when the large cell is a collection, or 0 if it isn't. Note that the collection may be recorded in system.large_cells either due to its size, like any other cell, and/or due to the number of items in it, if it cross the said threshold. Note that #11449 called for a new system.large_collections table, but extending system.large_cells follows the logic of system.large_partitions is a smaller change overall, hence it was preferred. Since the system keyspace schema is hard coded, the schema version of system.large_cells was bumped, and since the change is not backward compatible, we added a cluster feature - `LARGE_COLLECTION_DETECTION` - to enable using it. The large_data_handler large cell detection record function will populate the new column only when the new cluster feature is enabled. In addition, unit tests were added in sstable_3_x_test for testing large cells detection by cell size, and large_collection detection by the number of items. Closes #11449 Closes #11674 * github.com:scylladb/scylladb: sstables: mx/writer: optimize large data stats members order sstables: mx/writer: keep large data stats entry as members db: large_data_handler: dynamically update config thresholds utils/updateable_value: add transforming_value_updater db/large_data_handler: cql_table_large_data_handler: record large_collections db/large_data_handler: pass ref to feature_service to cql_table_large_data_handler db/large_data_handler: cql_table_large_data_handler: move ctor out of line docs: large-rows-large-cells-tables: fix typos db/system_keyspace: add collection_elements column to system.large_cells gms/feature_service: add large_collection_detection cluster feature test: sstable_3_x_test: add test_sstable_too_many_collection_elements test: lib: simple_schema: add support for optional collection column test: lib: simple_schema: build schema in ctor body test: lib: simple_schema: cql: define s1 as static only if built this way db/large_data_handler: maybe_record_large_cells: consider collection_elements db/large_data_handler: debug cql_table_large_data_handler::delete_large_data_entries sstables: mx/writer: pass collection_elements to writer::maybe_record_large_cells sstables: mx/writer: add large_data_type::elements_in_collection db/large_data_handler: get the collection_elements_count_threshold db/config: add compaction_collection_elements_count_warning_threshold test: sstable_3_x_test: add test_sstable_write_large_cell test: sstable_3_x_test: pass cell_threshold_bytes to large_data_handler test: sstable_3_x_test: large_data_handler: prepare callback for testing large_cells test: sstable_3_x_test: large_data tests: use BOOST_REQUIRE_[GL]T test: sstable_3_x_test: test_sstable_log_too_many_rows: use tests::random	2022-10-06 18:28:21 +03:00
Avi Kivity	62a4d2d92b	Merge 'Preliminary changes for multiple Compaction Groups' from Raphael "Raph" Carvalho What's contained in this series: - Refactored compaction tests (and utilities) for integration with multiple groups - The idea is to write a new class of tests that will stress multiple groups, whereas the existing ones will still stress a single group. - Fixed a problem when cloning compound sstable set (cannot be triggered today so I didn't open a GH issue) - Many changes in replica::table for allowing integration with multiple groups Next: - Introduce for_each_compaction_group() for iterating over groups wherever needed. - Use for_each_compaction_group() in replica::table operations spanning all groups (API, readers, etc). - Decouple backlog tracker from compaction strategy, to allow for backlog isolation across groups - Introduce static option for defining number of compaction groups and implement function to map a token to its respective group. - Testing infrastructure for multiple compaction groups (helpful when testing the dynamic behavior: i.e. merging / splitting). Closes #11592 * github.com:scylladb/scylladb: sstable_resharding_test: Switch to table_for_tests replica: Move compacted_undeleted_sstables into compaction group replica: Use correct compaction_group in try_flush_memtable_to_sstable() replica: Make move_sstables_from_staging() robust and compaction group friendly test: Rename column_family_for_tests to table_for_tests sstable_compaction_test: Use column_family_for_tests::as_table_state() instead test: Don't expose compound set in column_family_for_tests test: Implement column_family_for_tests::table_state::is_auto_compaction_disabled_by_user() sstable_compaction_test: Merge table_state_for_test into column_family_for_tests sstable_compaction_test: use table_state_for_test itself in fully_expired_sstables() sstable_compaction_test: Switch to table_state in compact_sstables() sstable_compaction_test: Reduce boilerplate by switching to column_family_for_tests	2022-10-06 18:23:47 +03:00
Kamil Braun	f94d547719	test.py: include modes in log file name Instead of `test.py.log`, use: `test.py.dev.log` when running with `--mode dev`, `test.py.dev-release.log` when running with `--mode dev --mode release`, and so on. This is useful in Jenkins which is running test.py multiple times in different modes; a later run would overwrite a previous run's test.py file. With this change we can preserve the test.py files of all of these runs. Closes #11678	2022-10-06 18:20:39 +03:00
Kamil Braun	3af68052c4	test/topology: disable flaky `test_remove_node_add_column` test The test was added recently and since then causes CI failures. We suspect that it happens if the node being removed was the Raft group 0 leader. The removenode coordinator tries to send to it the `remove_from_group0` request and fails. A potential fix is in review: #11691.	2022-10-06 17:04:42 +02:00
Pavel Emelyanov	59da903054	system_keyspace: Make get_{local\|saved}_tokens non static Now all callers have system_keyspace reference at hand. This removes one more user of the global qctx object Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-06 18:02:09 +03:00
Pavel Emelyanov	b03f1e7b17	size_estimates_virtual_reader: Pass sys_ks argument to get_local_ranges() This method static calls system_keyspace::get_local_tokens(). Having the system_keyspace reference will make this method non-static Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-06 18:00:09 +03:00
Pavel Emelyanov	4c099bb3ed	cql_test_env: Keep sharded<system_keyspace> reference There's a test_get_local_ranges() call in size-estimate reader which will need system keyspace reference. There's no other place for tests to get it from but the cql_test_env thing Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-06 17:59:21 +03:00
Pavel Emelyanov	34e8e5959f	size_estimate_virtual_reader: Keep system_keyspace reference The s._e._v._reader::fill_buffer() method needs system keyspace to get node's local tokens. Now it's a static method, having system_keyspace reference will make it non-static Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-06 17:58:07 +03:00
Pavel Emelyanov	04552f2d58	system_keyspace: Pass sys_ks argument to install_virtual_readers() The size-estimate-virtual-reader will need it, now it's available as "this" from system_keyspace::make() method Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-06 17:57:13 +03:00
Pavel Emelyanov	1938412d7a	system_keyspace: Make make() non-static This helper needs system_keyspace reference and using "this" as this looks natural. Also this de-static-ification makes it possible to put some sense into the invoke_on_all() call from init_system_keyspace() Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2022-10-06 17:56:11 +03:00

1 2 3 4 5 ...

33393 Commits