Go to file

Botond Dénes 56cc7bbeec Merge 'Allow "global" snapshot using topology coordinator + add tablet metadata to manifest' from Calle Wilund

Refs: SCYLLADB-193

Adds a "snapshot_table" topology operation and associated data structure/table columns to support dispatching a snapshot operation as a topo coordinator op.

Logic is similar, and thus broken out and semi-shared with, truncation.

Also adds optional tablet metadata to manifest, listing all tablets present in a given snapshot, as well as
tablet sstable ownership, repair status, and token ranges.

As per description in SCYLLADB-193, the alternative snapshot mechanism is in
a separate namespace under 'tablets', which while dubious is the desired destination.

The API is accessed via `nodetool cluster snapshot`, which more or less mirrors `nodetool snapshot`, but using topo op.

TTL is added to message propagation as a separate patch here, since it is not (yet) used from API (or nodetool).
Requires a syntax for both API and command line.

Closes scylladb/scylladb#28525

* github.com:scylladb/scylladb:
  topology::snapshot: Add expiry (ttl) to RPC/topo op
  test_snapshot_with_tablets: Extend test to check manifest content
  table::manifest: Add tablet info to manifest.json
  test::test_snapshot_with_tablets: Add small test for topo coordinated snapshot
  scylla-nodetool: Add "cluster snapshot" command
  api::storage_service: Add tablets/snapshots command for cluster level snapshot
  db::snapshot-ctl: Add method to do snapshot using topo coordinator
  storage_proxy: Add snapshot_keyspace method
  topology_coordinator: Add handler for snapshot_tables
  storage_proxy: Add handler for SNAPSHOT_WITH_TABLETS
  messaging_service: Add SNAPSHOT_WITH_TABLETS verb
  feature_service: Add SNAPSHOT_AS_TOPOLOGY_OPERATION feature
  topology_mutation: Add setter for snapshot part of row
  system_keyspace::topology_requests_entry: Add snapshot info to table
  topology_state_machine: Add snapshot_tables operation
  topology_coordinator: Break out logic from handle_truncate_table
  storage_proxy: Break out logic from request_truncate_with_tablets
  test/object_store: Remove create_ks_and_cf() helper
  test/object_store: Replace create_ks_and_cf() usage with standard methods
  test/object_store: Shift indentation right for test cases

2026-02-25 10:17:53 +02:00

.github

.github/workflows: add SMI to milestone sync Jira project keys

2026-02-18 09:35:37 +02:00

abseil @ d7aaad83b4

…

alternator

locator: fix get_secondary_replica() to match get_primary_replica()

2026-02-23 16:19:30 +02:00

api

api::storage_service: Add tablets/snapshots command for cluster level snapshot

2026-02-23 11:37:16 +01:00

audit

audit: replace batch dynamic_cast with static_cast

2026-01-26 18:14:38 +01:00

auth

auth: cache: fix permissions iterator invalidation in reload_all_permissions

2026-02-23 12:14:22 +01:00

bin

…

cdc

topology: disable force-gossip-topology-changes option

2026-02-02 09:56:32 +01:00

cmake

build: drop -fexperimental-assignment-tracking clang option

2025-12-22 14:33:48 +02:00

compaction

treewide: fix some spelling errors

2025-12-29 13:53:56 +01:00

conf

Merge 'db/config: enable table audit by default' from Piotr Smaron

2026-02-19 16:30:11 +01:00

cql3

Merge 'vector_search: return NaN for similarity_cosine with all-zero vectors' from Dawid Pawlik

2026-02-23 13:10:44 +01:00

data_dictionary

data_dictionary: table: add get_truncation_time()

2025-12-02 14:21:25 +02:00

Merge 'Allow "global" snapshot using topology coordinator + add tablet metadata to manifest' from Calle Wilund

2026-02-25 10:17:53 +02:00

debug

…

dht

Add precompiled headers to CMakeLists.txt

2025-11-21 12:27:41 +02:00

dist

Merge 'docs: fix link to docker build readme in the README.MD' from Marcin Szopa

2026-02-20 08:21:46 +02:00

docs

doc: remove reduntant Java-related information

2026-02-24 14:37:39 +01:00

ent

gcp: Add handling of 429 (too many requests) to exponential backoff

2026-02-19 09:42:39 +01:00

exceptions

exceptions.hh: fix message argument passing

2025-08-13 13:39:52 +02:00

gms

feature_service: Add SNAPSHOT_AS_TOPOLOGY_OPERATION feature

2026-02-23 10:44:41 +01:00

idl

topology::snapshot: Add expiry (ttl) to RPC/topo op

2026-02-23 11:37:17 +01:00

index

vector_search: allow full secondary indexes syntax while creating the vector index

2026-01-30 01:14:31 +02:00

keys

api/storage_service: add GET 'natural_endpoints' v2 to support composite keys with ':'

2025-10-01 15:53:25 +02:00

lang

lua: avoid undefined behavior when converting doubles to integers

2026-02-24 10:41:21 +02:00

licenses

utils: license: import crypt_sha512.c from musl to the project

2025-12-10 15:36:18 +01:00

locator

locator: fix get_secondary_replica() to match get_primary_replica()

2026-02-23 16:19:30 +02:00

message

messaging_service: Add SNAPSHOT_WITH_TABLETS verb

2026-02-23 10:44:42 +01:00

mutation

mutation_compactor: Fix tombstone GC metrics to account for only expired

2026-02-20 10:43:58 +02:00

mutation_writer

Add precompiled headers to CMakeLists.txt

2025-11-21 12:27:41 +02:00

node_ops

node_ops: task_manager_module: Populate entity field also for active requests

2026-01-18 15:36:06 +01:00

pgo

Update pgo profiles - aarch64

2026-02-15 05:22:17 +02:00

query

code: Replace distributed<> with sharded<>

2025-09-19 12:22:51 +02:00

raft

raft: Describe exception types for wait_for_state_change and wait_for_leader

2026-02-19 12:47:14 +01:00

readers

reader_permit: remove check_abort()

2026-01-13 10:47:57 +02:00

reloc

…

repair

service: pass topology guard to RBNO

2026-01-20 10:06:34 +01:00

replica

table::manifest: Add tablet info to manifest.json

2026-02-23 11:37:17 +01:00

rust

build: apply sccache to rust builds too

2025-12-22 15:36:15 +02:00

schema

Merge 'schema: Apply sstable_compression_user_table_options to CQL aux and Alternator tables' from Nikos Dragazis

2026-01-22 06:50:48 +02:00

scripts

open-coredump: Change to use new backtrace

2026-02-05 11:50:47 +02:00

seastar @ d2953d2ad1

Update seastar submodule

2026-02-17 13:13:22 +02:00

service

Merge 'Allow "global" snapshot using topology coordinator + add tablet metadata to manifest' from Calle Wilund

2026-02-25 10:17:53 +02:00

sstables

table::manifest: Add tablet info to manifest.json

2026-02-23 11:37:17 +01:00

streaming

streaming: Release space incrementally during file streaming

2026-02-18 10:10:40 +03:00

swagger-ui @ 12f1da1082

…

tasks

tasks: increase tasks_vt_get_children timeout

2026-02-18 11:39:19 +03:00

test

Merge 'Allow "global" snapshot using topology coordinator + add tablet metadata to manifest' from Calle Wilund

2026-02-25 10:17:53 +02:00

tools

scylla-nodetool: Add "cluster snapshot" command

2026-02-23 11:37:16 +01:00

tracing

Add precompiled headers to CMakeLists.txt

2025-11-21 12:27:41 +02:00

transport

transport: fix connection code to consume only initially taken semaphore units

2026-02-17 17:55:48 +01:00

types

fix rjson::value to bytes conversion with missing GetStringLength call

2025-12-09 19:27:22 +01:00

unified

…

utils

object_storage: add retryable machinery to object storage

2026-02-22 14:00:44 +02:00

vector_search

vector_search/dns: Use newer seastar get_host_by_name API

2026-02-23 21:28:43 +02:00

.clang-format

…

.dockerignore

…

.gitattributes

…

.gitignore

.gitignore: add rust target

2025-08-19 13:09:18 +03:00

.gitmodules

build: replace tools/java submodule with packaged cassandra-stress

2025-04-15 10:11:28 +03:00

.gitorderfile

…

.mailmap

…

absl-flat_hash_map.cc

…

absl-flat_hash_map.hh

…

amplify.yml

…

backlog_controller_fwd.hh

db/config: introduce new config parameter compaction_max_shares

2025-11-24 12:52:29 -03:00

backlog_controller.hh

db/config: introduce new config parameter compaction_max_shares

2025-11-24 12:52:29 -03:00

build_mode.hh

…

bytes_fwd.hh

…

bytes_ostream.hh

treewide: Replace __builtin_expect with (un)likely

2025-07-03 13:34:04 +03:00

bytes.cc

…

bytes.hh

bytes: adapt fmt_hex to std::span<const std::byte>

2025-04-01 00:07:27 +02:00

cartesian_product.hh

…

client_data.cc

…

client_data.hh

service/client_state and alternator/server: use cached values for driver_name and driver_version fields

2025-12-20 12:26:22 -05:00

clocks-impl.cc

treewide: Move mutation related files to a mutation directory

2025-09-24 13:23:38 +03:00

clocks-impl.hh

…

CMakeLists.txt

Add precompiled headers to CMakeLists.txt

2025-11-21 12:27:41 +02:00

configure.py

object_storage: add retryable machinery to object storage

2026-02-22 14:00:44 +02:00

CONTRIBUTING.md

docs: fix typos and spelling errors

2025-09-30 13:16:49 +02:00

coverage_excludes.txt

…

coverage_sources.list

…

db_clock.hh

…

debug.cc

storage_service: Check raft rpc scheduling group from debug namespace

2026-02-03 06:34:03 +02:00

debug.hh

storage_service: Check raft rpc scheduling group from debug namespace

2026-02-03 06:34:03 +02:00

default.nix

…

Doxyfile

…

encoding_stats.hh

treewide: Move mutation related files to a mutation directory

2025-09-24 13:23:38 +03:00

enum_set.hh

auth: add possibilty to check for any permission in set

2025-10-03 16:55:57 +02:00

exported_templates.cc

Add precompiled headers to CMakeLists.txt

2025-11-21 12:27:41 +02:00

exported_templates.hh

Add precompiled headers to CMakeLists.txt

2025-11-21 12:27:41 +02:00

fix_system_distributed_tables.py

…

flake.lock

…

flake.nix

…

gc_clock.hh

…

gdbinit

…

gen_segmented_compress_params.py

compress: move compress.cc/hh to sstables/compressor

2025-07-31 13:10:41 +03:00

HACKING.md

docs: fix typos and spelling errors

2025-09-30 13:16:49 +02:00

hashing_partition_visitor.hh

…

idl-compiler.py

idl-compiler.py: raise TypeError instead of raw str

2026-01-13 08:33:17 +02:00

inet_address_vectors.hh

storage_proxy: handle node_local_only in mutate

2025-07-24 19:48:08 +02:00

init.cc

init: fix infinite loop on npos wrap with updated Seastar

2026-02-17 17:57:13 +00:00

init.hh

Revert "Merge 'Unify configuration of object storage endpoints' from Pavel Emelyanov"

2026-01-05 08:53:41 +02:00

install-dependencies.sh

build: install cassandra-stress RPM with no signature check

2026-02-18 10:08:13 +03:00

install.sh

scripts: fixes flagged by CodeQL/PyLens

2026-01-09 15:13:12 +02:00

LICENSE-ScyllaDB-Source-Available.md

…

main.cc

db::snapshot-ctl: Add method to do snapshot using topo coordinator

2026-02-23 11:27:15 +01:00

marshal_exception.hh

…

mutation_query.cc

…

mutation_query.hh

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

NOTICE.txt

PowerPC: remove ppc stuff

2025-07-08 10:38:23 +03:00

ORIGIN

…

partition_builder.hh

mutation: async_utils: add unfreeze_and_split_gently

2025-09-30 17:15:41 +03:00

partition_range_compat.hh

treewide: Move misc files to utils directory

2025-07-21 11:56:40 +03:00

partition_slice_builder.cc

…

partition_slice_builder.hh

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

query_ranges_to_vnodes.cc

interval: rename start_ref() back to start() (and end_ref() etc).

2025-06-14 21:26:16 +03:00

query_ranges_to_vnodes.hh

…

reader_concurrency_semaphore_group.cc

reader_concurrency_semaphore: Add preemptive_abort_factor to constructors

2026-01-28 14:20:01 +01:00

reader_concurrency_semaphore_group.hh

reader_concurrency_semaphore: Add preemptive_abort_factor to constructors

2026-01-28 14:20:01 +01:00

reader_concurrency_semaphore.cc

reader_concurrency_semaphore: Check during admission if read may timeout

2026-01-28 14:24:45 +01:00

reader_concurrency_semaphore.hh

reader_concurrency_semaphore: Add preemptive_abort_factor to constructors

2026-01-28 14:20:01 +01:00

reader_permit.hh

permit_reader: Add a new state: preemptive_aborted

2026-01-28 14:20:01 +01:00

README.md

docs: fix link to docker build README.MD

2026-02-18 12:12:46 +01:00

real_dirty_memory_accounter.hh

…

release.cc

release: adjust doc_link() for the post source-available world

2025-09-29 17:02:55 +03:00

release.hh

…

reversibly_mergeable.hh

…

schema_upgrader.hh

treewide: Move mutation related files to a mutation directory

2025-09-24 13:23:38 +03:00

scylla_post_install.sh

…

scylla-gdb.py

scylla-gdb.py: scylla small-objects: make freelist traversal more robust

2025-12-25 13:26:09 +03:00

SCYLLA-VERSION-GEN

Update ScyllaDB version to: 2026.2.0-dev

2026-01-25 11:09:17 +02:00

seastarx.hh

…

serialization_visitors.hh

…

serializer_impl.hh

serializer_impl.hh: add as_input_stream(managed_bytes_view) overload

2025-05-13 10:32:32 +02:00

serializer.cc

…

serializer.hh

treewide: include boost headers as "system" headers

2025-08-22 17:21:24 +03:00

service_permit.hh

…

shell.nix

…

sstable_dict_autotrainer.cc

dictionary compression: add missing co_awaits on get_units

2026-02-18 16:40:40 +01:00

sstable_dict_autotrainer.hh

dict_autotrainer: introduce sstable_dict_autotrainer

2025-04-01 00:07:30 +02:00

sstables_loader.cc

streaming: enable direct download of contained sstables

2026-01-25 13:27:44 +02:00

sstables_loader.hh

streaming: refactor get_sstables_for_tablets to make it accessible

2025-12-08 12:30:23 +02:00

stdafx.cc

Add precompiled headers to CMakeLists.txt

2025-11-21 12:27:41 +02:00

stdafx.hh

code: Stop using seastar::compat::source_location

2025-11-27 19:10:11 +02:00

supervisor.hh

…

table_helper.cc

schema: Allow configuring consistency setting for a keyspace

2025-10-16 13:34:49 +03:00

table_helper.hh

…

test.py

test.py: add cluster tests to be executed by pytest

2026-02-24 09:48:38 +01:00

timeout_config.cc

…

timeout_config.hh

…

tombstone_gc_extension.hh

…

tombstone_gc_options.cc

…

tombstone_gc_options.hh

…

tombstone_gc-internals.hh

treewide: Add missing #pragma once

2025-09-01 14:58:21 +03:00

tombstone_gc.cc

tombstone_gc: don't use 'repair' mode for colocated tables

2025-11-25 09:15:46 +01:00

tombstone_gc.hh

tombstone_gc: don't use 'repair' mode for colocated tables

2025-11-25 09:15:46 +01:00

ubsan-suppressions.supp

…

unimplemented.cc

…

unimplemented.hh

…

validation.cc

treewide: Move keys related files to a new keys directory

2025-07-25 10:45:32 +03:00

validation.hh

…

version.hh

…

view_info.hh

treewide: Move query related files to a new query directory

2025-09-16 23:40:47 +03:00

vint-serialization.cc

treewide: Replace __builtin_expect with (un)likely

2025-07-03 13:34:04 +03:00

vint-serialization.hh

…

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++23 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain. This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its API - CQL. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The community forum and Slack channel are for users to discuss configuration, management, and operations of ScyllaDB.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.7%

Python 26.1%

CMake 0.3%

GAP 0.3%

Shell 0.3%