Go to file

Patryk Jędrzejczak b0eef50b2e raft topology: make left_token_ring a transition state

A node can be in the `left_token_ring` state after:
- a finished decommission,
- a failed bootstrap,
- a failed replace.

When a node is in the `left_token_ring` state, we don't know how
it has ended up in this state. We cannot distinguish a node that
has finished decommissioning from a node that has failed bootstrap.

The main problem it causes is that we incorrectly send the
`barrier_and_drain` command to a node that has failed
bootstrapping or replacing. We must do it for a node that has
finished decommissioning because it could still coordinate
requests. However, since we cannot distinguish nodes in the
`left_token_ring` state, we must send the command to all of them.
This issue appeared in scylladb/scylladb#16797 and this patch is
a follow-up that fixes it.

The solution is changing `left_token_ring` from a node state
to a transition state.

Regarding implementation, most of the changes are simple
refactoring. The less obvious are:
- Before this patch, in `system_keyspace::left_topology_state`, we
had to keep the ignored nodes' IDs for replace to ensure that the
replacing node will have access to it after moving to the
`left_token_ring` state, which happens when replace fails. We
don't need this workaround anymore. When we enter the new
`left_token_ring` transition state, the new node will still be in
the `decommissioning` state, so it won't lose its request param.
- Before this patch, a decommissioning node lost its tokens
while moving to the `left_token_ring` state. After the patch, it
loses tokens while still being in the `decommissioning` state. We
ensure that all `decommissioning` handlers correctly handle a node
that lost its tokens.

Moving the `left_token_ring` handler from `handle_node_transition`
to `handle_topology_transition` created a large diff. There are
only three changes:
- adding `auto node = get_node_to_work_on(std::move(guard));`,
- adding `builder.del_transition_state()`,
- changing error logged when `global_token_metadata_barrier` fails.

2024-01-29 10:39:07 +01:00

.github

.git: do not apply codespell to licenses

2024-01-26 09:39:27 +02:00

alternator

alternator: allow empty tag value

2024-01-23 11:26:08 +02:00

api

Merge 'Add maintenance mode' from Mikołaj Grzebieluch

2024-01-26 11:02:34 +01:00

auth

service/maintenance_mode: move maintenance_socket_enabled definition to seperate file

2024-01-25 15:27:53 +01:00

bin

tools: add cqlsh shortcut

2023-07-12 09:36:59 +03:00

cdc

cdc: not include unused headers

2024-01-11 09:13:37 +02:00

cmake

build: cmake: use # for line comment

2024-01-03 15:05:00 +02:00

compaction

compaction/compaction_manager: perform_cleanup(): hold the compaction gate

2024-01-25 14:52:50 +01:00

conf

Merge 'Add maintenance socket' from Mikołaj Grzebieluch

2023-12-20 19:04:40 +02:00

cql3

cql3/type_json.cc: move stringstream content instead of copying it

2024-01-26 09:41:09 +02:00

data_dictionary

keyspace_metadata: Drop vector-of-schemas argument from new_keyspace()

2023-12-26 13:00:44 +03:00

raft topology: make left_token_ring a transition state

2024-01-29 10:39:07 +01:00

debug

…

dht

dht: do not include unused headers

2024-01-21 16:56:16 +02:00

direct_failure_detector

direct_failure_detector: Avoid throwing exceptions in the success path

2023-03-31 12:40:43 +02:00

dist

scylla_util.py: wait for apt operation on other processes

2023-12-28 19:00:36 +02:00

docs

doc: document nodetool resetlocalschema

2024-01-28 21:09:02 +01:00

exceptions

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

gms

Merge 'Add more logging for gossiper::lock_endpoint and storage_service::handle_state_normal' from Kamil Braun

2024-01-12 10:51:21 +02:00

idl

storage_service: topology request: drop explicit shutdown rpc

2024-01-16 17:02:54 +02:00

index

Merge 'scylla-sstable: add support for loading schema of views and indexes' from Botond Dénes

2024-01-24 23:36:54 +02:00

interface

Typos: fix typos in comments

2023-12-02 22:37:22 +02:00

lang

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

licenses

scripts: remove git-archive-all

2023-03-29 18:59:23 +03:00

locator

Merge 'tablets: Add support for removenode and replace handling' from Tomasz Grabiec

2024-01-25 14:49:43 +02:00

message

message_service: add sanity check that rpc connections are not created in the maintenance mode

2024-01-25 15:27:53 +01:00

mutation

mutation/mutation.hh: remove anonymous namespace from header

2024-01-26 08:38:39 +01:00

mutation_writer

mutation_writer: do not include unused headers

2024-01-24 15:20:02 +02:00

node_ops

token_metadata: drop the template

2023-12-12 23:19:54 +04:00

raft

server, raft_group0_client: remove the default nullptr values

2024-01-05 18:45:50 +01:00

readers

readers/multishard: evictable_reader::fast_forward_to(): close reader on exception

2024-01-15 20:55:55 +01:00

redis

cql3: Add feature service to ks_prop_defs::as_ks_metadata()

2024-01-15 13:12:12 +03:00

reloc

…

repair

repair: do not include unused headers

2024-01-26 13:12:38 +02:00

replica

Merge ' db: commitlog_replayer: ignore mutations affected by (tablet) cleanups ' from Michał Chojnowski

2024-01-25 20:51:03 +02:00

rust

rust: update dependencies

2023-12-17 13:20:25 +02:00

schema

schema: provide method to get sharder, iff it is static

2024-01-23 22:20:59 +02:00

scripts

Typos: fix typos in code

2023-12-13 10:45:21 +02:00

seastar @ 85359b2866

Update seastar submodule

2024-01-22 11:29:50 +01:00

service

raft topology: make left_token_ring a transition state

2024-01-29 10:39:07 +01:00

sstables

sstable/storage: use fs::path to represent _dir and _temp_dir

2024-01-26 09:54:41 +02:00

streaming

Merge 'tablets: Add support for removenode and replace handling' from Tomasz Grabiec

2024-01-25 14:49:43 +02:00

swagger-ui @ 12f1da1082

…

tasks

tasks: don't keep internal root tasks after they complete

2024-01-09 13:13:54 +01:00

test

raft topology: make left_token_ring a transition state

2024-01-29 10:39:07 +01:00

thrift

keyspace_metadata: Carry optional<initial_tablets> on board

2023-12-25 15:58:05 +03:00

tools

Merge 'scylla-sstable: add support for loading schema of views and indexes' from Botond Dénes

2024-01-24 23:36:54 +02:00

tracing

tracing: do not include unused headers

2024-01-23 08:57:11 +02:00

transport

service/maintenance_mode: move maintenance_socket_enabled definition to seperate file

2024-01-25 15:27:53 +01:00

types

utils: do not include unused headers

2024-01-18 12:50:06 +02:00

unified

Update unified/build_unified.sh

2023-12-05 15:23:38 +02:00

utils

Merge 'Remove anonymous namespaces from headers' from Patryk Wróbel

2024-01-26 13:20:17 +02:00

.dockerignore

…

.gitattributes

…

.gitignore

docs: download iam csv files

2023-10-02 12:28:56 +03:00

.gitmodules

…

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

…

absl-flat_hash_map.hh

…

amplify.yml

…

backlog_controller.hh

treewide: apply codespell to the comments in source code

2023-12-20 10:25:03 +02:00

build_mode.hh

release: correct a typo in comment

2023-03-29 13:42:38 +03:00

bytes_ostream.hh

utils/managed_bytes, serializer: add conversion between buffer_view<bytes_ostream> and managed_bytes_view

2023-05-07 17:17:34 +03:00

bytes.cc

bytes: implement formatting helpers using formatter

2023-03-27 20:06:45 +08:00

bytes.hh

bytes.hh: correct spelling of delimiter and delimited

2023-12-18 20:46:21 +02:00

cache_flat_mutation_reader.hh

cache_flat_mutation_reader: fix a broken iterator validity guarantee in ensure_population_lower_bound()

2023-11-16 19:01:18 +01:00

cache_temperature.hh

…

cartesian_product.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

cell_locking.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

checked-file-impl.hh

code: Switch to seastar API level 7

2023-06-06 13:29:16 +03:00

client_data.cc

…

client_data.hh

…

clocks-impl.cc

clocks-impl: format time_point using fmt

2023-11-22 17:44:07 +02:00

clocks-impl.hh

…

clustering_bounds_comparator.hh

…

clustering_interval_set.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

clustering_key_filter.hh

…

clustering_ranges_walker.hh

…

CMakeLists.txt

build: cmake: add "mode_list" target

2023-12-24 12:35:02 +08:00

collection_mutation.cc

…

collection_mutation.hh

…

column_computation.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

combine.hh

…

compound_compat.hh

compound_compat: do not format an sstring with {:d}

2023-07-08 15:13:11 +03:00

compound.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

compress.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

compress.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

concrete_types.hh

make timestamp string format cassandra compatible

2023-07-27 12:01:09 +03:00

configure.py

test: boost: add commitlog_cleanup_test

2024-01-24 10:37:39 +01:00

CONTRIBUTING.md

…

converting_mutation_partition_applier.cc

…

converting_mutation_partition_applier.hh

…

counters.cc

counters: move fmt::formatter<counter_{shard,cell}_view>::format() to .cc

2023-05-24 09:36:49 +03:00

counters.hh

counters: move fmt::formatter<counter_{shard,cell}_view>::format() to .cc

2023-05-24 09:36:49 +03:00

coverage_excludes.txt

test.py: support code coverage

2024-01-18 11:11:34 +02:00

coverage_sources.list

configure.py support coverage profiles on standrad build modes

2024-01-18 11:11:34 +02:00

cql_serialization_format.hh

…

db_clock.hh

db_clock: specialize fmt::formatter<db_clock::time_point>

2023-04-28 15:48:06 +08:00

debug.cc

…

debug.hh

…

default.nix

…

Doxyfile

…

duration.cc

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

duration.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

encoding_stats.hh

encoding_state: mark helper methods protected

2023-08-29 15:41:13 +03:00

enum_set.hh

…

fix_system_distributed_tables.py

…

flake.lock

…

flake.nix

…

frozen_schema.cc

…

frozen_schema.hh

…

full_position.hh

…

gc_clock.hh

…

gdbinit

…

gen_segmented_compress_params.py

Typos: fix typos in code

2023-12-13 10:45:21 +02:00

generic_server.cc

generic_server: use mutable reference in for_each_gently

2023-11-14 14:25:22 +02:00

generic_server.hh

generic_server: use mutable reference in for_each_gently

2023-11-14 14:25:22 +02:00

HACKING.md

commitlog: use separate directory for schema commitlog

2023-03-30 21:55:50 +04:00

hashing_partition_visitor.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

idl-compiler.py

Typos: fix typos in code

2023-12-13 10:45:21 +02:00

inet_address_vectors.hh

abstract_replication_strategy: calculate_natural_endpoints: make it work with both versions of token_metadata

2023-12-12 23:19:53 +04:00

init.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

init.hh

Merge 'Typos: fix typos in code' from Yaniv Kaul

2023-12-06 07:36:41 +02:00

install-dependencies.sh

build: add crypto++ to dependencies

2024-01-11 16:26:20 +02:00

install.sh

install.sh: use a temporary file when packaging scylla.yaml

2024-01-01 21:50:29 +02:00

interval.hh

interval: make default ctor and make_open_ended_both_sides constexpr

2023-11-06 18:39:53 +01:00

keys.cc

keys: Move exploded_clustering_prefix's operator<< to keys.cc

2023-07-19 11:57:27 +03:00

keys.hh

keys: do not use zip_iterator for printing key components

2023-07-01 23:49:02 +03:00

LICENSE.AGPL

…

log.hh

…

main.cc

main: Postpone start-up of hint manager

2024-01-26 12:49:40 +01:00

map_difference.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

marshal_exception.hh

…

multishard_mutation_query.cc

reader_permit: store schema_ptr instead of raw schema pointer

2024-01-11 08:37:56 +02:00

multishard_mutation_query.hh

treewide: apply codespell to the comments in source code

2023-12-20 10:25:03 +02:00

mutation_query.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

mutation_query.hh

mutation_query: add formatter for reconcilable_result::printer

2023-11-26 20:20:50 +02:00

noexcept_traits.hh

…

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

…

partition_range_compat.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

partition_slice_builder.cc

partition_slice_builder: add set_specific_ranges()

2023-05-08 07:35:39 -04:00

partition_slice_builder.hh

partition_slice_builder: add set_specific_ranges()

2023-05-08 07:35:39 -04:00

partition_snapshot_reader.hh

…

partition_snapshot_row_cursor.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

protocol_server.hh

…

querier.cc

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

querier.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

query_id.hh

…

query_ranges_to_vnodes.cc

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

query_ranges_to_vnodes.hh

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

query_result_merger.hh

…

query-request.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

query-result-reader.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

query-result-set.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

query-result-set.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

query-result-writer.hh

…

query-result.hh

treewide: do not mark return value const if this has no effect

2023-11-17 17:46:19 +08:00

query.cc

treewide: use #include <seastar/...> for seastar headers

2023-06-06 08:36:09 +03:00

range.hh

…

read_context.hh

compact and remove expired rows from cache on read

2023-06-26 15:29:01 +02:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: add name of semaphore in tracing messages

2024-01-23 10:25:34 +01:00

reader_concurrency_semaphore.hh

reader_permit: store schema_ptr instead of raw schema pointer

2024-01-11 08:37:56 +02:00

reader_permit.hh

reader_permit: store schema_ptr instead of raw schema pointer

2024-01-11 08:37:56 +02:00

README.md

…

real_dirty_memory_accounter.hh

real_dirty_memory_accounter: document what the class is doing

2023-05-23 09:11:31 +03:00

release.cc

…

release.hh

…

reversibly_mergeable.hh

…

row_cache.cc

Merge 'row_cache: abort on exteral_updater::execute errors' from Benny Halevy

2023-10-31 10:07:01 +02:00

row_cache.hh

Merge 'row_cache: abort on exteral_updater::execute errors' from Benny Halevy

2023-10-31 10:07:01 +02:00

schema_mutations.cc

schema_mutations, migration_manager: Ignore empty partitions in per-table digest

2023-07-03 23:06:55 +02:00

schema_mutations.hh

schema_mutations, migration_manager: Ignore empty partitions in per-table digest

2023-07-03 23:06:55 +02:00

schema_upgrader.hh

…

scylla_post_install.sh

dist: drop legacy control group parameters

2023-12-11 19:38:28 +09:00

scylla-gdb.py

reader_permit: store schema_ptr instead of raw schema pointer

2024-01-11 08:37:56 +02:00

SCYLLA-VERSION-GEN

Typos: fix typos in code

2023-12-13 10:45:21 +02:00

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

serializer.cc

utils/managed_bytes, serializer: add conversion between buffer_view<bytes_ostream> and managed_bytes_view

2023-05-07 17:17:34 +03:00

serializer.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

service_permit.hh

…

setup.py

…

shell.nix

…

sstables_loader.cc

sstables_loader: load_new_sstables: auto-enable load-and-stream for tablets

2024-01-16 18:43:52 +02:00

sstables_loader.hh

…

supervisor.hh

…

table_helper.cc

keyspace_metadata: Add default value for new_keyspace's durable_writes

2023-12-26 11:47:37 +03:00

table_helper.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

test.py

test.py: s/defalt/default/

2024-01-25 16:54:07 +02:00

timeout_config.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

timeout_config.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

timestamp.hh

…

tombstone_gc_extension.hh

…

tombstone_gc_options.cc

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

tombstone_gc_options.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

tombstone_gc.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

tombstone_gc.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

tox.ini

…

ubsan-suppressions.supp

…

unimplemented.cc

unimplemented: add format_as() for unimplemented::cause

2024-01-19 08:38:30 +02:00

unimplemented.hh

./: not include unused headers

2024-01-17 16:30:14 +02:00

validation.cc

…

validation.hh

…

version.hh

treewide: use defaulted operator!=() and operator==()

2023-04-27 10:24:46 +03:00

view_info.hh

everywhere: reduce dependencies on i_partitioner.hh

2023-11-05 20:47:44 +02:00

vint-serialization.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

vint-serialization.hh

Typos: fix typos in code

2023-12-05 15:18:11 +02:00

zstd.cc

./: not include unused headers

2024-01-17 16:30:14 +02:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.7%

Python 26.1%

CMake 0.3%

GAP 0.3%

Shell 0.3%