scylla

Author	SHA1	Message	Date
Pavel Emelyanov	66e43912d6	code: Switch to seastar API level 7 In that level no io_priority_class-es exist. Instead, all the IO happens in the context of current sched-group. File API no longer accepts prio class argument (and makes io_intent arg mandatory to impls). So the change consists of - removing all usage of io_priority_class - patching file_impl's inheritants to updated API - priority manager goes away altogether - IO bandwidth update is performed on respective sched group - tune-up scylla-gdb.py io_queues command The first change is huge and was made semi-autimatically by: - grep io_priority_class \| default_priority_class - remove all calls, found methods' args and class' fields Patching file_impl-s is smaller, but also mechanical: - replace io_priority_class& argument with io_intent* one - pass intent to lower file (if applicatble) Dropping the priority manager is: - git-rm .cc and .hh - sed out all the #include-s - fix configure.py and cmakefile The scylla-gdb.py update is a bit hairry -- it needs to use task queues list for IO classes names and shares, but to detect it should it checks for the "commitlog" group is present. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com> Closes #13963	2023-06-06 13:29:16 +03:00
Raphael S. Carvalho	156d771101	compaction: Fix sstable cleanup after resharding on refresh Problem can be reproduced easily: 1) wrote some sstables with smp 1 2) shut down scylla 3) moved sstables to upload 4) restarted scylla with smp 2 5) ran refresh (resharding happens, adds sstable to cleanup set and never removes it) 6) cleanup (tries to cleanup resharded sstables which were leaked in the cleanup set) Bumps into assert "Assertion `!sst->is_shared()' failed", as cleanup picks a shared sstable that was leaked and already processed by resharding. Fix is about not inserting shared sstables into cleanup set, as shared sstables are restricted to resharding and cannot be processed later by cleanup (nor it should because resharding itself cleaned up its input files). Dtest: https://github.com/scylladb/scylla-dtest/pull/3206 Fixes #14001. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #14147	2023-06-06 12:14:03 +03:00
Piotr Smaroń	5f6491987d	Deregister table's metrics when disposing a table to work around #8627 The metrics that are being deregistered (in this PR) cause Scylla to crash when a table is dropped, but the corresponding table object in memory is not yet deallocated, and a new table with the same name is created. This caused a double-metrics-registration exception to be thrown. In order to avoid it, we are deregistering table's metrics as soon as the table is marked to be disposed from the database. Table's representation in memory can still live, but shouldn't forbid other table with the same name to be created. Fixes #13548 Closes #13971	2023-05-23 18:41:51 +03:00
Botond Dénes	3b424e391b	Merge 'perform_cleanup: wait until all candidates are cleaned up' from Benny Halevy cleanup_compaction should resolve only after all sstables that require cleanup are cleaned up. Since it is possible that some of them are in staging and therefore cannot be cleaned up, retry once a second until they become eligible. Timeout if there is no progress within 5 minutes to prevent hanging due to view building bug. Fixes #9559 Closes #13812 * github.com:scylladb/scylladb: table: signal compaction_manager when staging sstables become eligible for cleanup compaction_manager: perform_cleanup: wait until all candidates are cleaned up compaction_manager: perform_cleanup: perform_offstrategy if needed compaction_manager: perform_cleanup: update_sstables_cleanup_state in advance sstable_set: add for_each_sstable_gently* helpers	2023-05-19 12:35:59 +03:00
Pavel Emelyanov	d7f99d031d	replica: Make global_table_ptr a class Right now all users of global_table know it's a vector and reference its elements with this_shard_id() index. Making the global_table_ptr a class makes it possible to stop using operator[] and "index" this_shard_id() in its -> and * operators. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 18:14:34 +03:00
Pavel Emelyanov	b4a8843907	replica: Add type alias for vector of foreign lw-pointers This is to convert the global_table_ptr into a class with less bulky patch further Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-17 18:14:34 +03:00
Benny Halevy	bb59687116	table: signal compaction_manager when staging sstables become eligible for cleanup perform_cleanup may be waiting for those sstables to become eligible for cleanup so signal it when table::move_sstables_from_staging detects an sstable that requires cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-05-17 11:33:22 +03:00
Botond Dénes	20ff122a84	Merge 'Delete S3 sstables without the help of deletion log' from Pavel Emelyanov There are two layers of stables deletion -- delete-atomically and wipe. The former is in fact the "API" method, it's called by table code when the specific sstable(s) are no longer needed. It's called "atomically" because it's expected to fail in the middle in a safe manner so that subsequent boot would pick the dangling parts and proceed. The latter is a low-level removal function that can fail in the middle, but it's not of _its_ care. Currently the atomic deletion is implemented with the help of sstable_directory::delete_atomically() method that commits sstables files names into deletion log, then calls wipe (indirectly), then drops the deletion log. On boot all found deletion logs are replayed. The described functionality is used regardless of the sstable storage type, even for S3, though deletion log is an overkill for S3, it's better be implemented with the help of ownership table. In fact, S3 storage already implements atomic deletion in its wipe method thus being overly careful. So this PR - makes atomic deletion be storage-specific - makes S3 wipe non-atomic fixes: #13016 note: Replaying sstables deletion from ownership table on boot is not here, see #13024 Closes #13562 * github.com:scylladb/scylladb: sstables: Implement atomic deleter for s3 storage sstables: Get atomic deleter from underlying storage sstables: Move delete_atomically to manager and rename	2023-05-15 08:57:47 +03:00
Pavel Emelyanov	5985f00da9	sstables: Move delete_atomically to manager and rename This is to let manager decide which storage driver to call for atomic sstables deletion in the next patch. While at it -- rename the sstable_directory's method into something more descriptive (to make compiler catch all callers of it). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-05-12 17:52:12 +03:00
Raphael S. Carvalho	3b28c26c77	table: Allow tombstone GC in compaction to be disabled on user request If tombstone GC was disabled, compaction will ensure that fully expired sstables won't be bypassed and that no expired tombstones will be purged. Changing the value takes immediate effect even on ongoing compactions. Not wired into an API yet. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-05-12 10:14:28 -03:00
Botond Dénes	287ccce1cc	Merge 'sstables: extract storage out ' from Kefu Chai this change extracts the storage class and its derived classes out into their own source files. for couple reasons: - for better readability. the sstables.hh is over 1005 lines. and sstables.cc 3602 lines. it's a little bit difficult to figure out how the different parts in these sources interact with each other. for instance, with this change, it's clear some of helper functions are only used by file_system_storage. - probably less inter-source dependency. by extracting the sources files out, they can be compiled individually, so changing one .cc file does not impact others. this could speed up the compilation time. Closes #13785 * github.com:scylladb/scylladb: sstables: storage: coroutinize idempotent_link_file() sstables: extract storage out	2023-05-09 14:03:40 +03:00
Kefu Chai	2eefcb37eb	sstables: extract storage out this change extracts the storage class and its derived classes out into storage.cc and storage.hh. for couple reasons: - for better readability. the sstables.hh is over 1005 lines. and sstables.cc 3602 lines. it's a little bit difficult to figure out how the different parts in these sources interact with each other. for instance, with this change, it's clear some of helper functions are only used by file_system_storage. - probably less inter-source dependency. by extracting the sources files out, they can be compiled individually, so changing one .cc file does not impact others. this could speed up the compilation time. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-09 16:47:00 +08:00
Kefu Chai	9b35faf485	treewide: replace generation_type::value() with generation_type::as_int() * replace generation_type::value() with generation_type::as_int() * drop generation_value() because we will switch over to UUID based generation identifier, the member function or the free function generation_value() cannot fulfill the needs anymore. so, in this change, they are consolidated and are replaced by "as_int()", whose name is more specific, and will also work and won't be misleading even after switching to UUID based generation identifier. as `value()` would be confusing by then: it could be an integer or a UUID. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-05-06 18:24:45 +08:00
Botond Dénes	022465d673	Merge 'Tone down offstrategy log message' from Benny Halevy In many cases we trigger offstrategy compaction opportunistically also when there's nothing to do. In this case we still print to the log lots of info-level message and call `run_offstrategy_compaction` that wastes more cpu cycles on learning that it has nothing to do. This change bails out early if the maintenance set is empty and prints a "Skipping off-strategy compaction" message in debug level instead. Fixes #13466 Also, add an group_id class and return it from compaction_group and table_state. Use that to identify the compaction_group / table_state by "ks_name.cf_name compaction_group=idx/total" in log messages. Fixes #13467 Closes #13520 * github.com:scylladb/scylladb: compaction_manager: print compaction_group id compaction_group, table_state: add group_id member compaction_manager: offstrategy compaction: skip compaction if no candidates are found	2023-05-02 08:05:18 +03:00
Avi Kivity	7b7d9bcb14	Merge 'Do not access owned_ranges_ptr across shards in update_sstable_cleanup_state' from Benny Halevy This series fixes a few issues caused by `f1bbf705f9` (`f1bbf705f9`): - table, compaction_manager: prevent cross shard access to owned_ranges_ptr - Fixes #13631 - distributed_loader: distribute_reshard_jobs: pick one of the sstable shard owners - compaction: make_partition_filter: do not assert shard ownership - allow the filtering reader now used during resharding to process tokens owned by other shards Closes #13635 * github.com:scylladb/scylladb: compaction: make_partition_filter: do not assert shard ownership distributed_loader: distribute_reshard_jobs: pick one of the sstable shard owners table, compaction_manager: prevent cross shard access to owned_ranges_ptr	2023-05-01 22:51:00 +03:00
Kamil Braun	30cc07b40d	Merge 'Introduce tablets' from Tomasz Grabiec This PR introduces an experimental feature called "tablets". Tablets are a way to distribute data in the cluster, which is an alternative to the current vnode-based replication. Vnode-based replication strategy tries to evenly distribute the global token space shared by all tables among nodes and shards. With tablets, the aim is to start from a different side. Divide resources of replica-shard into tablets, with a goal of having a fixed target tablet size, and then assign those tablets to serve fragments of tables (also called tablets). This will allow us to balance the load in a more flexible manner, by moving individual tablets around. Also, unlike with vnode ranges, tablet replicas live on a particular shard on a given node, which will allow us to bind raft groups to tablets. Those goals are not yet achieved with this PR, but it lays the ground for this. Things achieved in this PR: - You can start a cluster and create a keyspace whose tables will use tablet-based replication. This is done by setting `initial_tablets` option: ``` CREATE KEYSPACE test WITH replication = {'class': 'NetworkTopologyStrategy', 'replication_factor': 3, 'initial_tablets': 8}; ``` All tables created in such a keyspace will be tablet-based. Tablet-based replication is a trait, not a separate replication strategy. Tablets don't change the spirit of replication strategy, it just alters the way in which data ownership is managed. In theory, we could use it for other strategies as well like EverywhereReplicationStrategy. Currently, only NetworkTopologyStrategy is augmented to support tablets. - You can create and drop tablet-based tables (no DDL language changes) - DML / DQL work with tablet-based tables Replicas for tablet-based tables are chosen from tablet metadata instead of token metadata Things which are not yet implemented: - handling of views, indexes, CDC created on tablet-based tables - sharding is done using the old method, it ignores the shard allocated in tablet metadata - node operations (topology changes, repair, rebuild) are not handling tablet-based tables - not integrated with compaction groups - tablet allocator piggy-backs on tokens to choose replicas. Eventually we want to allocate based on current load, not statically Closes #13387 * github.com:scylladb/scylladb: test: topology: Introduce test_tablets.py raft: Introduce 'raft_server_force_snapshot' error injection locator: network_topology_strategy: Support tablet replication service: Introduce tablet_allocator locator: Introduce tablet_aware_replication_strategy locator: Extract maybe_remove_node_being_replaced() dht: token_metadata: Introduce get_my_id() migration_manager: Send tablet metadata as part of schema pull storage_service: Load tablet metadata when reloading topology state storage_service: Load tablet metadata on boot and from group0 changes db, migration_manager: Notify about tablet metadata changes via migration_listener::on_update_tablet_metadata() migration_notifier: Introduce before_drop_keyspace() migration_manager: Make prepare_keyspace_drop_announcement() return a future<> test: perf: Introduce perf-tablets test: Introduce tablets_test test: lib: Do not override table id in create_table() utils, tablets: Introduce external_memory_usage() db: tablets: Add printers db: tablets: Add persistence layer dht: Use last_token_of_compaction_group() in split_token_range_msb() locator: Introduce tablet_metadata dht: Introduce first_token() dht: Introduce next_token() storage_proxy: Improve trace-level logging locator: token_metadata: Fix confusing comment on ring_range() dht, storage_proxy: Abstract token space splitting Revert "query_ranges_to_vnodes_generator: fix for exclusive boundaries" db: Exclude keyspace with per-table replication in get_non_local_strategy_keyspaces_erms() db: Introduce get_non_local_vnode_based_strategy_keyspaces() service: storage_proxy: Avoid copying keyspace name in write handler locator: Introduce per-table replication strategy treewide: Use replication_strategy_ptr as a shorter name for abstract_replication_strategy::ptr_type locator: Introduce effective_replication_map locator: Rename effective_replication_map to vnode_effective_replication_map locator: effective_replication_map: Abstract get_pending_endpoints() db: Propagate feature_service to abstract_replication_strategy::validate_options() db: config: Introduce experimental "TABLETS" feature db: Log replication strategy for debugging purposes db: Log full exception on error in do_parse_schema_tables() db: keyspace: Remove non-const replication strategy getter config: Reformat	2023-04-27 09:40:18 +02:00
Raphael S. Carvalho	59904be5c3	table: Avoid reallocations in make_compaction_groups() Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-04-25 11:14:33 -03:00
Raphael S. Carvalho	2d45dd35c7	table: Remove outdated comment regarding automatic compaction We already provide a way to disable automatic compaction. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-04-25 11:09:45 -03:00
Tomasz Grabiec	9b17ad3771	locator: Introduce per-table replication strategy Will be used by tablet-based replication strategies, for which effective replication map is different per table. Also, this patch adapts existing users of effective replication map to use the per-table effective replication map. For simplicity, every table has an effective replication map, even if the erm is per keyspace. This way the client code can be uniform and doesn't have to check whether replication strategy is per table. Not all users of per-keyspace get_effective_replication_map() are adapted yet to work per-table. Those algorithms will throw an exception when invoked on a keyspace which uses per-table replication strategy.	2023-04-24 10:49:36 +02:00
Benny Halevy	dabf46c37f	compaction_group, table_state: add group_id member To help identify the compaction group / table_state. Ref #13467 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-24 10:06:04 +03:00
Botond Dénes	1750bb34b7	Merge 'sstables, replica: add generation generator' from Kefu Chai this is the first step to the uuid-based generation identifier. the goal is to encapsulate the generation related logic in generator, so its consumers do not have to understand the difference between the int64_t based generation and UUID v1 based generation. this commit should not change the behavior of existing scylla. it just allows us to derive from `generation_generator` so we can have another generator which generates UUID based generation identifier. Closes #13073 * github.com:scylladb/scylladb: replica, test: create generation id using generator sstables: add generation_generator test: sstables: use generate_n for generating ids for testing	2023-04-24 09:31:08 +03:00
Benny Halevy	2f61de8f7b	table, compaction_manager: prevent cross shard access to owned_ranges_ptr Seen after `f1bbf705f9` in debug mode distributed_loader collect_all_shared_sstables copies compaction::owned_ranges_ptr (lw_shared_ptr<const dht::token_range_vector>) across shards. Since update_sstable_cleanup_state is synchronous, it can be passed a const refrence to the token_range_vector instead. It is ok to access the memory read-only across shards and since this happens on start-up, there are no special performance requirements. Fixes #13631 Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 15:12:13 +03:00
Benny Halevy	c5d819ce60	gms: versioned_value: make members private and provide accessor functions to get them. 1. So they can't be modified by mistake, as the versioned value is immutable. A new value must have a higher version. 2. Before making the version a strong gms::version_type. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-23 08:37:32 +03:00
Kefu Chai	576adbdbc5	replica, test: create generation id using generator reuse generation_generator for generating generation identifiers for less repeatings. also, add allow update generator to update its lastest known generation id. Signed-off-by: Kefu Chai <kefu.chai@scylladb.com>	2023-04-21 22:02:30 +08:00
Pavel Emelyanov	837fde84b1	view: Carry data_dictionary arg through standalone helpers There's a bunch of functions in view.{hh\|cc} that don't belong to any class and perform view-related claculations for view updates. Lots of them eventually call view_info::select_statement() which will later need the dictionary. By now all those methods' callers have data dictionary at hand and can share it via argument. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-20 11:17:46 +03:00
Pavel Emelyanov	9d3d533561	view_update_builder: Construct with data dictionary The caller is table with view-update-generator at hand (it calls mutate_MV on). Builder here is used as a temporary object that destroys once the caller coroutine co_return-s, so keeping the database obtained from the view-update-generator is safe. Later the v.u.b. object will propagate its data dictionary down the callstacks. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-20 11:17:38 +03:00
Pavel Emelyanov	4a16ab3bd4	table: Push view_update_generator arg to affected_views() Caller already has it to call mutate_MV() on. The method in question will need the generator in one of the next patches. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-20 10:42:31 +03:00
Raphael S. Carvalho	a47bac931c	Move TWCS option from table into TWCS itself enable_optimized_twcs_queries is specific to TWCS, therefore it belongs to TWCS, not replica::table. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #13489	2023-04-14 08:28:16 +03:00
Botond Dénes	f1bbf705f9	Merge 'Cleanup sstables in resharding and other compaction types' from Benny Halevy This series extends sstable cleanup to resharding and other (offstrategy, major, and regular) compaction types so to: * cleanup uploaded sstables (#11933) * cleanup staging sstables after they are moved back to the main directory and become eligible for compaction (#9559) When perform_cleanup is called, all sstables are scanned, and those that require cleanup are marked as such, and are added for tracking to table_state::cleanup_sstable_set. They are removed from that set once released by compaction. Along with that sstables set, we keep the owned_ranges_ptr used by cleanup in the table_state to allow other compaction types (offstrategy, major, or regular) to cleanup those sstables that are marked as require_cleanup and that were skipped by cleanup compaction for either being in the maintenance set (requiring offstrategy compaction) or in staging. Resharding is using a more straightforward mechanism of passing the owned token ranges when resharding uploaded sstables and using it to detect sstable that require cleanup, now done as piggybacked on resharding compaction. Closes #12422 * github.com:scylladb/scylladb: table: discard_sstables: update_sstable_cleanup_state when deleting sstables compaction_manager: compact_sstables: retrieve owned ranges if required sstables: add a printer for shared_sstable compaction_manager: keep owned_ranges_ptr in compaction_state compaction_manager: perform_cleanup: keep sstables in compaction_state::sstables_requiring_cleanup compaction: refactor compaction_state out of compaction_manager compaction: refactor compaction_fwd.hh out of compaction_descriptor.hh compaction_manager: compacting_sstable_registration: keep a ref to the compaction_state compaction_manager: refactor get_candidates compaction_manager: get_candidates: mark as const table, compaction_manager: add requires_cleanup sstable_set: add for_each_sstable_until distributed_loader: reshard: update sstable cleanup state table, compaction_manager: add update_sstable_cleanup_state compaction_manager: needs_cleanup: delete unused schema param compaction_manager: perform_cleanup: disallow empty sorted_owened_ranges distributed_loader: reshard: consider sstables for cleanup distributed_loader: process_upload_dir: pass owned_ranges_ptr to reshard distributed_loader: reshard: add optional owned_ranges_ptr param distributed_loader: reshard: get a ref to table_state distributed_loader: reshard: capture creator by ref distributed_loader: reshard: reserve num_jobs buckets compaction: move owned ranges filtering to base class compaction: move owned_ranges into descriptor	2023-04-11 14:52:29 +03:00
Benny Halevy	96660b2ef7	table: discard_sstables: update_sstable_cleanup_state when deleting sstables We need to remove the deleted sstables from update_sstable_cleanup_state otherwise their data and index files will remain opened and their storage space won't be reclaimed. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:37:56 +03:00
Benny Halevy	6ebafe74b9	table, compaction_manager: add requires_cleanup Returns true iff any of the sstables in the set requries cleanup. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:14:36 +03:00
Benny Halevy	d0690b64c1	table, compaction_manager: add update_sstable_cleanup_state update_sstable_cleanup_state calls needs_cleanup and inserts (or erases) the sstable into the respective compaction_state.sstables_requiring_cleanup set. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-04-10 23:10:55 +03:00
Pavel Emelyanov	4bb885b759	sstable: Make storage instance based on storage options This patch adds storage options lw-ptr to sstables_manager::make_sstable and makes the storage instance creation depend on the options. For local it just creates the filesystem storage instance, for S3 -- throws, but next patch will fix that. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-04-10 16:43:01 +03:00
Botond Dénes	9a02315c6b	Merge 'Compaction reevaluation bug fixes' from Raphael "Raph" Carvalho A problem in compaction reevaluation can cause the SSTable set to be left uncompacted for indefinite amount of time, potentially causing space and read amplification to be suboptimal. Two revaluation problems are being fixed, one after off-strategy compaction ended, and another in compaction manager which intends to periodically reevaluate a need for compaction. Fixes https://github.com/scylladb/scylladb/issues/13429. Fixes https://github.com/scylladb/scylladb/issues/13430. Closes #13431 * github.com:scylladb/scylladb: compaction: Make compaction reevaluation actually periodic replica: Reevaluate regular compaction on off-strategy completion	2023-04-05 13:51:21 +03:00
Raphael S. Carvalho	457c772c9c	replica: Make compaction_group responsible for deleting off-strategy compaction input Compaction group is responsible for deleting SSTables of "in-strategy" compactions, i.e. regular, major, cleanup, etc. Both in-strategy and off-strategy compaction have their completion handled using the same compaction group interface, which is compaction_group::table_state::on_compaction_completion(..., sstables::offstrategy offstrategy) So it's important to bring symmetry there, by moving the responsibility of deleting off-strategy input, from manager to group. Another important advantage is that off-strategy deletion is now throttled and gated, allowing for better control, e.g. table waiting for deletion on shutdown. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #13432	2023-04-05 08:37:48 +03:00
Raphael S. Carvalho	2652b41606	replica: Reevaluate regular compaction on off-strategy completion When off-strategy compaction completes, regular compaction is not triggered. If off-strategy output causes the table's SSTable set to not conform the strategy goal, it means that read and space amplification will be suboptimal until the next compaction kicks in, which can take undefinite amount of time (e.g. when active memtable is flushed). Let's reevaluate compaction on main SSTable set when off-strategy ends. Fixes #13429. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-04-04 09:16:16 -03:00
Botond Dénes	207dcbb8fa	Merge 'sstables: prepare for uuid-based generation_type' from Benny Halevy Preparing for #10459, this series defines sstables::generation_type::int_t as `int64_t` at the moment and use that instead of naked `int64_t` variables so it can be changed in the future to hold e.g. a `std::variant<int64_t, sstables::generation_id>`. sstables::new_generation was defined to generation new, unique generations. Currently it is based on incrementing a counter, but it can be extended in the future to manufacture UUIDs. The unit tests are cleaned up in this series to minimize their dependency on numeric generations. Basically, they should be used for loading sstables with hard coded generation numbers stored under `test/resource/sstables`. For all the rest, the tests should use existing and mechanisms introduced in this series such as generation_factory, sst_factory and smart make_sstable methods in sstable_test_env and table_for_tests to generate new sstables with a unique generation, and use the abstract sst->generation() method to get their generation if needed, without resorting the the actual value it may hold. Closes #12994 * github.com:scylladb/scylladb: everywhere: use sstables::generation_type test: sstable_test_env: use make_new_generation sstable_directory::components_lister::process: fixup indentation sstables: make highest_generation_seen return optional generation replica: table: add make_new_generation function replica: table: move sstable generation related functions out of line test: sstables: use generation_type::int_t sstables: generation_type: define int_t	2023-03-30 17:05:07 +03:00
Botond Dénes	972b24a969	Merge 'Break the proxy -> database -> [views] -> proxy loop' from Pavel Emelyanov ... and drop usage of global storage proxy from several places of mutate_MV(). This is the last dependency loop around storage proxy left as long as the last user of the global storage proxy. The trouble is that while proxy naturally depends on database, the database SUDDENLY requires proxy to push view updates from the guts of database::do_apply(). Similar loop existed in a form of database -> { large_data_handler, compaction manager } -> system keyspace -> database and it was cut in `917fdb9e53` (Cut database-system_keyspace circular dependency) by introducing a soft dependency link from l. d. handler / compaction manager to system keyspace. The similar solution is proposed here. The database instance gets a soft dependency (shared_ptr) to view_update_generator instance. On start the link is nullptr and pushing view updates is not possible until view_updates_generator starts and plugs itself to the database. The plugging happens naturally, because v.u.generator needs proxy as explicit dependency and, thus, can reach database via proxy. This (seems to) works because tables that need view updates don't start being mutated until late enough, as late as v.u.generator starts. As a nice side effect this allows removing a bunch of global storage proxy usages from mutate_MV() which opens a pretty short way towards de-globalizing proxy (after it only qctx, tracing and schema registry will be left). Closes #13367 * github.com:scylladb/scylladb: view: Drop global storage_proxy usage from mutate_MV() view: Make mutate_MV() method of view_update_generator table: Carry v.u.generator down to populate_views() table: Carry v.u.generator down to do_push_view_replica_updates() view: Keep v.u.generator shared pointer on view_builder::consumer view: Capture v.u.generator on view_updating_consumer lambda view: Plug view update generator to database view: Add view_builder -> view_update_generator dependency view: Add view_update_generator -> sharded<storage_proxy> dependency	2023-03-30 08:29:29 +03:00
Pavel Emelyanov	7cabdc54a6	view: Make mutate_MV() method of view_update_generator Nowadays its a static helper, but internally it depends on storage proxy, so it grabs its global instance. Making it a method of view update generator makes it possible to use the proxy dependency from the generator. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-29 18:48:14 +03:00
Pavel Emelyanov	e78e64a920	table: Carry v.u.generator down to populate_views() The method is called by view_builder::consumer when building a view and the consumer already has stable dependency reference on the view updates generator. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-29 18:48:13 +03:00
Pavel Emelyanov	a95d3446fd	table: Carry v.u.generator down to do_push_view_replica_updates() The latter is the place where mutate_MV is called and it needs the view updates generator nearby. The call-stack starts at database::do_apply(). As was described in one of the previous patches, applying mutations that need updating views happen late enough, so if the view updates generator is not plugged to the database yet, it's OK to bail out with exception. If it's plugged, it's carried over thus keeping the generator instance alive and waited for on its stop. Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-29 14:12:01 +03:00
Raphael S. Carvalho	2186a75e9b	compaction: Expose compaction_strategy_state through table_state That will allow compaction_strategy to access the compaction group state through compaction::table_state, which is the interface at which replica talks to the compaction layer. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-28 08:48:10 -03:00
Raphael S. Carvalho	ee89ff24f2	replica: Add compaction_strategy_state to compaction group The state is not wired anywhere yet. It will replice the ones stored in compaction strategies themselves. Therefore, allowing each compaction group to have its own state. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com>	2023-03-27 15:46:14 -03:00
Benny Halevy	a8dc2fda29	sstables: make highest_generation_seen return optional generation It is possible to find no generation in an empty table directory, and in he future, with uuid generations it'd be possible to find no numeric generations in the directory. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-03-22 13:55:23 +02:00
Benny Halevy	ba680a7b96	replica: table: add make_new_generation function make_new_generation generates a new generation from an optional one. If disengaged, it just generates a new generation based on the shard_id. Otherwise, it generates the next generation in sequence by adding smp::count to the previous value, like we do today. In the future, with uuid-based generations, the function could be used to generate a new random uuid based on the optional parameter. It will be up to the caller, e.g. replica::table or sstables manager to decide which kind of generation to create. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-03-22 13:52:22 +02:00
Benny Halevy	b28eacce6f	replica: table: move sstable generation related functions out of line updating the highest generation happens only during startup and creating sstables is done rarely enough there is no reason to inline either functions. Signed-off-by: Benny Halevy <bhalevy@scylladb.com>	2023-03-22 13:49:18 +02:00
Botond Dénes	156e5d346d	reader_permit: keep trace_state pointer on permit And propagate it down to where it is created. This will be used to add trace points for semaphore related events, but this will come in the next patches.	2023-03-22 04:58:01 -04:00
Pavel Emelyanov	e882269d93	table: Keep storage options lw-shared-ptr Tables need to know which storage their sstables need to be located at, so class table needs to have itw reference of the storage options. The thing can be inherited from the keyspace metadata. Tests sometimes create table without keyspace at hand. For those use default-initialized storage options (which is local storage). Signed-off-by: Pavel Emelyanov <xemul@scylladb.com>	2023-03-16 17:30:45 +03:00
Raphael S. Carvalho	3fae46203d	replica: Fix undefined behavior in table::generate_and_propagate_view_updates() Undefined behavior because the evaluation order is undefined. With GCC, where evaluation is right-to-left, schema will be moved once it's forwarded to make_flat_mutation_reader_from_mutations_v2(). The consequence is that memory tracking of mutation_fragment_v2 (for tracking only permit used by view update), which uses the schema, can be incorrect. However, it's more likely that Scylla will crash when estimating memory usage for row, which access schema column information using schema::column_at(), which in turn asserts that the requested column does really exist. Fixes #13093. Signed-off-by: Raphael S. Carvalho <raphaelsc@scylladb.com> Closes #13092	2023-03-08 07:38:55 +02:00
Botond Dénes	1b5f8916d6	Merge 'Generalize sstable::move_to_new_dir() method' from Pavel Emelyanov This method requires callers to remember that the sstable is the collection of files on a filesystem and to know what exact directory they are all in. That's not going to work for object storage, instead, sstable should be moved between more abstract states. This PR replaces move_to_new_dir() call with the change_state() one that accepts target sub-directory string and moves files around. Currently supported state changes: * staging -> normal * upload -> normal \| staging * any -> quarantine All are pretty straightforward and move files between table basedir subdirectories with the exception that upload -> quarantine should move into upload/quarantine subdirectory. Another thing to keep in mind, that normal state doesn't have its subdir but maps directory to table's base directory. Closes #12648 * github.com:scylladb/scylladb: sstable: Remove explicit quarantization call test: Move move_to_new_dir() method from sstable class sstable, dist.-loader: Introduce and use pick_up_from_upload() method sstables, code: Introduce and use change_state() call distributed_loader: Let make_sstables_available choose target directory	2023-03-02 09:22:14 +02:00

1 2 3 4 5 ...

298 Commits