Go to file

Avi Kivity dab56b82fa Merge 'Per-partition rate limiting' from Piotr Dulikowski

Due to its sharded and token-based architecture, Scylla works best when the user workload is more or less uniformly balanced across all nodes and shards. However, a common case when this assumption is broken is the "hot partition" - suddenly, a single partition starts getting a lot more reads and writes in comparison to other partitions. Because the shards owning the partition have only a fraction of the total cluster capacity, this quickly causes latency problems for other partitions within the same shard and vnode.

This PR introduces per-partition rate limiting feature. Now, users can choose to apply per-partition limits to their tables of choice using a schema extension:

```
ALTER TABLE ks.tbl
WITH per_partition_rate_limit = {
	'max_writes_per_second': 100,
	'max_reads_per_second': 200
};
```

Reads and writes which are detected to go over that quota are rejected to the client using a new RATE_LIMIT_ERROR CQL error code - existing error codes didn't really fit well with the rate limit error, so a new error code is added. This code is implemented as a part of a CQL protocol extension and returned to clients only if they requested the extension - if not, the existing CONFIG_ERROR will be used instead.

Limits are tracked and enforced on the replica side. If a write fails with some replicas reporting rate limit being reached, the rate limit error is propagated to the client. Additionally, the following optimization is implemented: if the coordinator shard/node is also a replica, we account the operation into the rate limit early and return an error in case of exceeding the rate limit before sending any messages to other replicas at all.

The PR covers regular, non-batch writes and single-partition reads. LWT and counters are not covered here.

Results of `perf_simple_query --smp=1 --operations-per-shard=1000000`:

- Write mode:
  ```
  8f690fdd47 (PR base):
  129644.11 tps ( 56.2 allocs/op,  13.2 tasks/op,   49785 insns/op)
  This PR:
  125564.01 tps ( 56.2 allocs/op,  13.2 tasks/op,   49825 insns/op)
  ```
- Read mode:
  ```
  8f690fdd47 (PR base):
  150026.63 tps ( 63.1 allocs/op,  12.1 tasks/op,   42806 insns/op)
  This PR:
  151043.00 tps ( 63.1 allocs/op,  12.1 tasks/op,   43075 insns/op)
  ```

Manual upgrade test:
- Start 3 nodes, 4 shards each, Scylla version 8f690fdd47
- Create a keyspace with scylla-bench, RF=3
- Start reading and writing with scylla-bench with CL=QUORUM
- Manually upgrade nodes one by one to the version from this PR
- Upgrade succeeded, apart from a small number of operations which failed when each node was being put down all reads/writes succeeded
- Successfully altered the scylla-bench table to have a read and write limit and those limits were enforced as expected

Fixes: #4703

Closes #9810

* github.com:scylladb/scylla:
  storage_proxy: metrics for per-partition rate limiting of reads
  storage_proxy: metrics for per-partition rate limiting of writes
  database: add stats for per partition rate limiting
  tests: add per_partition_rate_limit_test
  config: add add_per_partition_rate_limit_extension function for testing
  cf_prop_defs: guard per-partition rate limit with a feature
  query-request: add allow_limit flag
  storage_proxy: add allow rate limit flag to get_read_executor
  storage_proxy: resultize return type of get_read_executor
  storage_proxy: add per partition rate limit info to read RPC
  storage_proxy: add per partition rate limit info to query_result_local(_digest)
  storage_proxy: add allow rate limit flag to mutate/mutate_result
  storage_proxy: add allow rate limit flag to mutate_internal
  storage_proxy: add allow rate limit flag to mutate_begin
  storage_proxy: choose the right per partition rate limit info in write handler
  storage_proxy: resultize return types of write handler creation path
  storage_proxy: add per partition rate limit to mutation_holders
  storage_proxy: add per partition rate limit info to write RPC
  storage_proxy: add per partition rate limit info to mutate_locally
  database: apply per-partition rate limiting for reads/writes
  database: move and rename: classify_query -> classify_request
  schema: add per_partition_rate_limit schema extension
  db: add rate_limiter
  storage_proxy: propagate rate_limit_exception through read RPC
  gms: add TYPED_ERRORS_IN_READ_RPC cluster feature
  storage_proxy: pass rate_limit_exception through write RPC
  replica: add rate_limit_exception and a simple serialization framework
  docs: design doc for per-partition rate limiting
  transport: add rate_limit_error

2022-06-24 01:32:13 +03:00

.github

docs: disable link checker

2022-05-09 12:45:28 +02:00

abseil @ 9e408e050f

Update abseil submodule

2022-05-22 23:46:33 +03:00

alternator

Merge 'Per-partition rate limiting' from Piotr Dulikowski

2022-06-24 01:32:13 +03:00

api

api: Get rack/datacenter from topology

2022-06-22 11:47:27 +03:00

auth

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

cdc

cdc/log.hh: expose is_log_name()

2022-06-10 10:57:12 +03:00

compaction

Merge "Sanitize compaction manager construction and stopping" from Pavel Emelyanov

2022-06-21 11:58:13 +03:00

conf

conf: update the description of the seeds parameter in scylla.yaml

2022-06-02 18:45:11 +03:00

cql3

cf_prop_defs: guard per-partition rate limit with a feature

2022-06-22 20:16:49 +02:00

data_dictionary

data_dictionary: Introduce user types storage

2022-05-05 09:44:26 +03:00

Merge 'Per-partition rate limiting' from Piotr Dulikowski

2022-06-24 01:32:13 +03:00

debug

…

dht

range_streamer: Get rack/datacenter from topology

2022-06-22 11:47:26 +03:00

direct_failure_detector

treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines

2022-05-31 09:06:24 +03:00

dist

scylla_cpuset_setup: stop deleting perftune.yaml and skip update cpuset.conf when same parameter specified

2022-06-23 10:28:36 +03:00

docs

docs: design doc for per-partition rate limiting

2022-06-22 20:07:58 +02:00

exceptions

replica: add rate_limit_exception and a simple serialization framework

2022-06-22 20:07:58 +02:00

gms

Merge 'Per-partition rate limiting' from Piotr Dulikowski

2022-06-24 01:32:13 +03:00

idl

storage_proxy: add per partition rate limit info to read RPC

2022-06-22 20:16:49 +02:00

index

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

interface

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

lang

hints: Remove snitch dependency

2022-06-22 11:47:26 +03:00

libdeflate @ e7e54eab42

…

licenses

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

locator

topology: Add get_rack/_datacenter methods

2022-06-22 11:47:26 +03:00

message

storage_proxy: add per partition rate limit info to write RPC

2022-06-22 20:16:48 +02:00

mutation_writer

flat_mutation_reader ist tot

2022-05-31 23:42:34 +03:00

raft

raft: server: if add_entry with wait_type::applied successfully returns, ensure state_machine::apply is called for this entry

2022-05-27 12:06:18 +02:00

readers

flat_mutation_reader ist tot

2022-05-31 23:42:34 +03:00

redis

query-request: add allow_limit flag

2022-06-22 20:16:49 +02:00

reloc

…

repair

repair: Get rack/datacenter from topology

2022-06-22 11:47:26 +03:00

replica

Merge 'Per-partition rate limiting' from Piotr Dulikowski

2022-06-24 01:32:13 +03:00

rust

tests: add rust example

2022-05-11 16:49:31 +02:00

scripts

configure.py: speed up and simplify compdb generation

2022-06-15 16:40:52 +03:00

seastar @ ff46af9ae0

Update seastar submodule

2022-06-22 00:39:24 +03:00

service

Merge 'Per-partition rate limiting' from Piotr Dulikowski

2022-06-24 01:32:13 +03:00

sstables

sstable_set: Fix partitioned_sstable_set constructor

2022-06-21 11:58:13 +03:00

streaming

streaming: Enable auto off strategy compaction trigger for all rbno ops

2022-06-09 17:10:14 +03:00

swagger-ui @ 12f1da1082

…

test

Merge 'Per-partition rate limiting' from Piotr Dulikowski

2022-06-24 01:32:13 +03:00

thrift

query-request: add allow_limit flag

2022-06-22 20:16:49 +02:00

tools

install-dependencies.sh: uprgade node_exporter to 1.3.1

2022-06-23 11:47:13 +03:00

tracing

trace-state: Remove unused fields

2022-06-17 15:02:51 +03:00

transport

transport: add rate_limit_error

2022-06-22 20:07:58 +02:00

types

fix "ninja dev-headers"

2022-05-31 23:42:34 +03:00

unified

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

utils

util/chunked_vector: more complete comment

2022-06-23 10:33:35 +03:00

.dockerignore

…

.gitattributes

…

.gitignore

.gitignore: ignore mypy_cache, the python lint cache

2022-04-19 16:48:47 +03:00

.gitmodules

…

.gitorderfile

…

absl-flat_hash_map.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

absl-flat_hash_map.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

atomic_cell_hash.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

atomic_cell_or_collection.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

atomic_cell.cc

atomic_cell: compare_atomic_cell_for_merge: compare ttl if expiry is equal

2022-03-07 11:05:30 +02:00

atomic_cell.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

backlog_controller.hh

backlog_controller: Generalize scheduling groups

2022-06-16 17:40:19 +03:00

bytes_ostream.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

bytes.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

bytes.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cache_flat_mutation_reader.hh

flat_mutation_reader ist tot

2022-05-31 23:42:34 +03:00

cache_temperature.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

caching_options.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

caching_options.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

canonical_mutation.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

canonical_mutation.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cartesian_product.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cell_locking.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

checked-file-impl.hh

treewide: use system-#include (angle brackets) for seastar

2022-04-26 14:46:42 +03:00

client_data.cc

client_data: Sanitize connection_notifier

2022-02-18 15:02:26 +03:00

client_data.hh

client_data: Sanitize connection_notifier

2022-02-18 15:02:26 +03:00

clocks-impl.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clocks-impl.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_bounds_comparator.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_interval_set.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_key_filter.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

clustering_ranges_walker.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

CMakeLists.txt

db: add rate_limiter

2022-06-22 20:16:48 +02:00

collection_mutation.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

collection_mutation.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

column_computation.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

combine.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compatible_ring_position.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compound_compat.hh

compound_compat.hh: add missing methods of iterator

2022-03-08 15:37:03 +02:00

compound.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compress.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

compress.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

concrete_types.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

configure.py

tests: add per_partition_rate_limit_test

2022-06-22 20:16:49 +02:00

CONTRIBUTING.md

docs/contribute/CONTRIBUTING.md: add reference to review checklist:

2022-06-16 10:29:26 +03:00

converting_mutation_partition_applier.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

converting_mutation_partition_applier.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

counters.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

counters.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

cql_serialization_format.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

db_clock.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

debug.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

default.nix

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

digest_algorithm.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

digester.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

dirty_memory_manager.hh

table: clear: serialize with ongoing flush

2022-04-25 18:57:07 +03:00

Doxyfile

…

duration.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

duration.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

encoding_stats.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

enum_set.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

fix_system_distributed_tables.py

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

frozen_mutation.cc

frozen_mutation: add unfreeze_gently

2022-05-05 13:32:25 +03:00

frozen_mutation.hh

messaging: forward-declare types in messaging_service.hh

2022-06-09 15:52:12 +03:00

frozen_schema.cc

frozen_schema: avoid allocating contiguous memory

2022-02-21 01:39:02 +01:00

frozen_schema.hh

frozen_schema: avoid allocating contiguous memory

2022-02-21 01:39:02 +01:00

gc_clock.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

gdbinit

docs: debugging.md: add a sample gdbinit file

2022-05-11 10:23:08 +03:00

gen_segmented_compress_params.py

treewide: clean up stray license blurbs

2022-02-13 14:16:16 +02:00

generic_server.cc

generic_server: Gentle iterator

2022-02-18 14:25:08 +03:00

generic_server.hh

generic_server.hh: add missing include

2022-04-04 17:31:55 +03:00

HACKING.md

docs: update theme 1.2.1

2022-04-03 13:45:07 +03:00

hashers.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

hashers.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

hashing_partition_visitor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

hashing.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

idl-compiler.py

ser: use vector_deserializer by default for all idl vectors

2022-05-18 19:24:18 +03:00

inet_address_vectors.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

init.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

init.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

install-dependencies.sh

install-dependencies.sh: uprgade node_exporter to 1.3.1

2022-06-23 11:47:13 +03:00

install.sh

install.sh: install files with correct permission in strict umask setting

2022-06-20 17:52:03 +03:00

interval.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

intrusive_set_external_comparator.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

keys.cc

replica, partition_snapshot_reader, keys: replace boost::any with std::any

2022-04-28 07:18:53 +03:00

keys.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

LICENSE.AGPL

…

log.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

main.cc

Merge 'Per-partition rate limiting' from Piotr Dulikowski

2022-06-24 01:32:13 +03:00

map_difference.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

marshal_exception.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

multishard_mutation_query.cc

multishard_mutation_query: do_query: couroutinize save_readers lambda

2022-06-08 09:31:17 +03:00

multishard_mutation_query.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_cleaner.hh

memtable: Subtract from flushed memory when cleaning

2022-06-15 11:30:25 +02:00

mutation_compactor.hh

compacting_reader: Drop irrelevant tombstones

2022-06-15 11:30:01 +02:00

mutation_consumer_concepts.hh

introduce the MutationConsumer concept

2022-02-28 17:11:54 +02:00

mutation_fragment_fwd.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

mutation_fragment_stream_validator.hh

mutation_fragment_stream_validator: validate range tombstone changes

2022-03-29 13:19:05 +03:00

mutation_fragment_v2.hh

mutation_fragment_v2: range_tombstone_change: add minimal_memory_usage()

2022-04-28 14:11:51 +03:00

mutation_fragment.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_fragment.hh

mutation_fragment: add a "from deletable_row" constructor to clustering_row

2022-06-20 15:45:19 +02:00

mutation_partition_serializer.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_partition_serializer.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_partition_view.cc

ser: use vector_deserializer by default for all idl vectors

2022-05-18 19:24:18 +03:00

mutation_partition_view.hh

mutation_partition_view: add accept_gently methods

2022-05-05 13:32:25 +03:00

mutation_partition_visitor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_partition.cc

mutation_fragment: pass the applied row by reference in clustering_row::apply()

2022-06-20 15:22:17 +02:00

mutation_partition.hh

mutation_fragment: pass the applied row by reference in clustering_row::apply()

2022-06-20 15:22:17 +02:00

mutation_query.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_query.hh

query: coroutinize to_data_query_result

2022-05-05 13:32:25 +03:00

mutation_rebuilder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation_source_metadata.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

mutation.cc

test: mutation: Compare against compacted mutations

2022-06-15 11:30:01 +02:00

mutation.hh

test: mutation: Compare against compacted mutations

2022-06-15 11:30:01 +02:00

noexcept_traits.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

NOTICE.txt

…

ORIGIN

…

partition_builder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_range_compat.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_slice_builder.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_slice_builder.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_snapshot_reader.hh

fix "ninja dev-headers"

2022-05-31 23:42:34 +03:00

partition_snapshot_row_cursor.hh

partition_snapshot_row_cursor: construct the clustering_row directly in row()

2022-06-20 15:45:19 +02:00

partition_version_list.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

partition_version.cc

mvcc: Introduce apply_resume to hold state for partition version merging

2022-06-15 11:30:01 +02:00

partition_version.hh

mvcc: Introduce apply_resume to hold state for partition version merging

2022-06-15 11:30:01 +02:00

position_in_partition.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

protocol_server.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

querier.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

querier.hh

mutation_reader: move mutation source into readers/

2022-03-30 15:42:51 +03:00

query_class_config.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query_ranges_to_vnodes.cc

storage_proxy: extract query_ranges_to_vnodes_generator to a separate file

2022-02-01 21:14:41 +01:00

query_ranges_to_vnodes.hh

storage_proxy: extract query_ranges_to_vnodes_generator to a separate file

2022-02-01 21:14:41 +01:00

query_result_merger.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-request.hh

query-request: add allow_limit flag

2022-06-22 20:16:49 +02:00

query-result-reader.hh

ser: use vector_deserializer by default for all idl vectors

2022-05-18 19:24:18 +03:00

query-result-set.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-result-set.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query-result-writer.hh

query_result_builder: remove v1 support

2022-03-11 09:24:17 +02:00

query-result.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

query.cc

query: do not assert in operator<<(ostream&, const forward_result::printer&)

2022-03-09 14:58:11 +01:00

range_tombstone_assembler.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone_change_generator.hh

range_tombstone_change_generator: flush(): add end_of_range

2022-04-21 14:37:10 +03:00

range_tombstone_list.cc

range_tombstone_list: insert_from: correct rev.update range_tombstone in not overlapping case

2022-04-04 22:26:29 +02:00

range_tombstone_list.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone_splitter.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range_tombstone.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

range.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

read_context.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

reader_concurrency_semaphore.cc

treewide: fix compilation issues with fmtlib 8.1.0+

2022-03-16 12:31:50 +03:00

reader_concurrency_semaphore.hh

flat_mutation_reader: Split readers by file and remove unnecessary includes.

2022-03-14 13:20:25 +02:00

reader_permit.hh

evicatble_reader: avoid preemption pitfall around waiting for readmission

2022-03-15 14:37:22 +02:00

README.md

README.md: update link to docker build instructions

2021-09-01 11:50:11 +03:00

real_dirty_memory_accounter.hh

memtable: move to replica module and namespace

2022-02-23 09:05:16 +02:00

release.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

release.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

reversibly_mergeable.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

row_cache.cc

memtable: Add counters for tombstone compaction

2022-06-15 11:30:25 +02:00

row_cache.hh

row_cache: update reader implementations to v2

2022-04-21 14:57:04 +03:00

schema_builder.hh

schema: add per_partition_rate_limit schema extension

2022-06-22 20:16:48 +02:00

schema_fwd.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_mutations.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_mutations.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_registry.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_registry.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

schema_upgrader.hh

compile: Fix headers so that *-headers targets compile cleanly.

2022-03-25 16:19:26 +02:00

schema.cc

schema: add per_partition_rate_limit schema extension

2022-06-22 20:16:48 +02:00

schema.hh

schema: add per_partition_rate_limit schema extension

2022-06-22 20:16:48 +02:00

scylla_post_install.sh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

scylla-gdb.py

scylla-gdb: Fix scylla_compaction_tasks

2022-06-23 16:17:31 +03:00

SCYLLA-VERSION-GEN

SCYLLA-VERSION-GEN:set release-version value length

2022-02-21 13:28:04 +02:00

seastarx.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serialization_visitors.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serializer_impl.hh

serializer_impl: add vector_deserializer

2022-05-18 19:10:13 +03:00

serializer.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

serializer.hh

code: Convert is_integral assertions to concepts

2022-02-24 19:44:29 +03:00

service_permit.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

setup.py

…

shell.nix

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

sstables_loader.cc

sstable_set: Fix partitioned_sstable_set constructor

2022-06-21 11:58:13 +03:00

sstables_loader.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

supervisor.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

table_helper.cc

treewide: replace parallel_for_each with coroutine::parallel_for_each in coroutines

2022-05-31 09:06:24 +03:00

table_helper.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

test.py

tests: Introduce optional RNG seed for boost suite

2022-06-20 07:19:08 +03:00

timeout_config.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

timeout_config.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

timestamp.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

to_string.hh

to_string.hh: include <map>

2022-02-17 08:53:48 +02:00

tombstone_gc_extension.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tombstone_gc_options.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tombstone_gc_options.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tombstone_gc.cc

gms: feature_service: remove variable/helper function duplication

2022-05-04 18:59:56 +03:00

tombstone_gc.hh

Merge "tools: cut schema loader free of replica::database" from Botond

2022-03-27 17:01:05 +03:00

tombstone.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

tox.ini

…

types.cc

types: time_point_to_string: harden against out of range timestamps

2022-06-21 08:08:57 +03:00

types.hh

Merge 'cql: Add proper validation for null and unset inside collections send as bound values' from Jan Ciołek

2022-05-19 11:25:24 +03:00

ubsan-suppressions.supp

…

unimplemented.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

unimplemented.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

validation.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

validation.hh

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

version.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

view_info.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

vint-serialization.cc

treewide: remove empty comments in top-of-files

2022-05-13 07:11:58 +02:00

vint-serialization.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

xx_hasher.hh

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

zstd.cc

treewide: use Software Package Data Exchange (SPDX) license identifiers

2022-01-18 12:15:18 +01:00

README.md

Scylla

What is Scylla?

Scylla is the real-time big data database that is API-compatible with Apache Cassandra and Amazon DynamoDB. Scylla embraces a shared-nothing approach that increases throughput and storage capacity to realize order-of-magnitude performance improvements and reduce hardware costs.

For more information, please see the ScyllaDB web site.

Build Prerequisites

Scylla is fairly fussy about its build environment, requiring very recent versions of the C++20 compiler and of many libraries to build. The document HACKING.md includes detailed information on building and developing Scylla, but to get Scylla building quickly on (almost) any build machine, Scylla offers a frozen toolchain, This is a pre-configured Docker image which includes recent versions of all the required compilers, libraries and build tools. Using the frozen toolchain allows you to avoid changing anything in your build machine to meet Scylla's requirements - you just need to meet the frozen toolchain's prerequisites (mostly, Docker or Podman being available).

Building Scylla

Building Scylla with the frozen toolchain dbuild is as easy as:

$ git submodule update --init --force --recursive
$ ./tools/toolchain/dbuild ./configure.py
$ ./tools/toolchain/dbuild ninja build/release/scylla

For further information, please see:

Developer documentation for more information on building Scylla.
Build documentation on how to build Scylla binaries, tests, and packages.
Docker image build documentation for information on how to build Docker images.

Running Scylla

To start Scylla server, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --workdir tmp --smp 1 --developer-mode 1

This will start a Scylla node with one CPU core allocated to it and data files stored in the tmp directory. The --developer-mode is needed to disable the various checks Scylla performs at startup to ensure the machine is configured for maximum performance (not relevant on development workstations). Please note that you need to run Scylla with dbuild if you built it with the frozen toolchain.

For more run options, run:

$ ./tools/toolchain/dbuild ./build/release/scylla --help

Testing

See test.py manual.

Scylla APIs and compatibility

By default, Scylla is compatible with Apache Cassandra and its APIs - CQL and Thrift. There is also support for the API of Amazon DynamoDB™, which needs to be enabled and configured in order to be used. For more information on how to enable the DynamoDB™ API in Scylla, and the current compatibility of this feature as well as Scylla-specific extensions, see Alternator and Getting started with Alternator.

Documentation

Documentation can be found here. Seastar documentation can be found here. User documentation can be found here.

Training

Training material and online courses can be found at Scylla University. The courses are free, self-paced and include hands-on examples. They cover a variety of topics including Scylla data modeling, administration, architecture, basic NoSQL concepts, using drivers for application development, Scylla setup, failover, compactions, multi-datacenters and how Scylla integrates with third-party applications.

Contributing to Scylla

If you want to report a bug or submit a pull request or a patch, please read the contribution guidelines.

If you are a developer working on Scylla, please read the developer guidelines.

Contact

The users mailing list and Slack channel are for users to discuss configuration, management, and operations of the ScyllaDB open source.
The developers mailing list is for developers and people interested in following the development of ScyllaDB to discuss technical topics.

Languages

C++ 72.7%

Python 26.1%

CMake 0.3%

GAP 0.3%

Shell 0.3%