valkey

mirror of http://github.com/valkey-io/valkey synced 2024-11-22 18:54:58 +00:00

Author	SHA1	Message	Date
Oran Agra	7d6744c739	fix new cluster tests issues (#9657 ) Following #9483 the daily CI exposed a few problems. * The cluster creation code (uses redis-cli) is complicated to test with TLS enabled. for now i'm just skipping them since the tests we run there don't really need that kind of coverage * cluster port binding failures note that `find_available_port` already looks for a free cluster port but the code in `wait_server_started` couldn't detect the failure of binding (the text it greps for wasn't found in the log)	2021-10-20 15:40:28 +03:00
guybe7	43e736f79b	Treat subcommands as commands (#9504 ) ## Intro The purpose is to allow having different flags/ACL categories for subcommands (Example: CONFIG GET is ok-loading but CONFIG SET isn't) We create a small command table for every command that has subcommands and each subcommand has its own flags, etc. (same as a "regular" command) This commit also unites the Redis and the Sentinel command tables ## Affected commands CONFIG Used to have "admin ok-loading ok-stale no-script" Changes: 1. Dropped "ok-loading" in all except GET (this doesn't change behavior since there were checks in the code doing that) XINFO Used to have "read-only random" Changes: 1. Dropped "random" in all except CONSUMERS XGROUP Used to have "write use-memory" Changes: 1. Dropped "use-memory" in all except CREATE and CREATECONSUMER COMMAND No changes. MEMORY Used to have "random read-only" Changes: 1. Dropped "random" in PURGE and USAGE ACL Used to have "admin no-script ok-loading ok-stale" Changes: 1. Dropped "admin" in WHOAMI, GENPASS, and CAT LATENCY No changes. MODULE No changes. SLOWLOG Used to have "admin random ok-loading ok-stale" Changes: 1. Dropped "random" in RESET OBJECT Used to have "read-only random" Changes: 1. Dropped "random" in ENCODING and REFCOUNT SCRIPT Used to have "may-replicate no-script" Changes: 1. Dropped "may-replicate" in all except FLUSH and LOAD CLIENT Used to have "admin no-script random ok-loading ok-stale" Changes: 1. Dropped "random" in all except INFO and LIST 2. Dropped "admin" in ID, TRACKING, CACHING, GETREDIR, INFO, SETNAME, GETNAME, and REPLY STRALGO No changes. PUBSUB No changes. CLUSTER Changes: 1. Dropped "admin in countkeysinslots, getkeysinslot, info, nodes, keyslot, myid, and slots SENTINEL No changes. (note that DEBUG also fits, but we decided not to convert it since it's for debugging and anyway undocumented) ## New sub-command This commit adds another element to the per-command output of COMMAND, describing the list of subcommands, if any (in the same structure as "regular" commands) Also, it adds a new subcommand: ``` COMMAND LIST [FILTERBY (MODULE <module-name>\|ACLCAT <cat>\|PATTERN <pattern>)] ``` which returns a set of all commands (unless filters), but excluding subcommands. ## Module API A new module API, RM_CreateSubcommand, was added, in order to allow module writer to define subcommands ## ACL changes: 1. Now, that each subcommand is actually a command, each has its own ACL id. 2. The old mechanism of allowed_subcommands is redundant (blocking/allowing a subcommand is the same as blocking/allowing a regular command), but we had to keep it, to support the widespread usage of allowed_subcommands to block commands with certain args, that aren't subcommands (e.g. "-select +select\|0"). 3. I have renamed allowed_subcommands to allowed_firstargs to emphasize the difference. 4. Because subcommands are commands in ACL too, you can now use "-" to block subcommands (e.g. "+client -client\|kill"), which wasn't possible in the past. 5. It is also possible to use the allowed_firstargs mechanism with subcommand. For example: `+config -config\|set +config\|set\|loglevel` will block all CONFIG SET except for setting the log level. 6. All of the ACL changes above required some amount of refactoring. ## Misc 1. There are two approaches: Either each subcommand has its own function or all subcommands use the same function, determining what to do according to argv[0]. For now, I took the former approaches only with CONFIG and COMMAND, while other commands use the latter approach (for smaller blamelog diff). 2. Deleted memoryGetKeys: It is no longer needed because MEMORY USAGE now uses the "range" key spec. 4. Bugfix: GETNAME was missing from CLIENT's help message. 5. Sentinel and Redis now use the same table, with the same function pointer. Some commands have a different implementation in Sentinel, so we redirect them (these are ROLE, PUBLISH, and INFO). 6. Command stats now show the stats per subcommand (e.g. instead of stats just for "config" you will have stats for "config\|set", "config\|get", etc.) 7. It is now possible to use COMMAND directly on subcommands: COMMAND INFO CONFIG\|GET (The pipeline syntax was inspired from ACL, and can be used in functions lookupCommandBySds and lookupCommandByCString) 8. STRALGO is now a container command (has "help") ## Breaking changes: 1. Command stats now show the stats per subcommand (see (5) above)	2021-10-20 11:52:57 +03:00
qetu3790	4962c5526d	Release clients blocked on module commands in cluster resharding and down state (#9483 ) Prevent clients from being blocked forever in cluster when they block with their own module command and the hash slot is migrated to another master at the same time. These will get a redirection message when unblocked. Also, release clients blocked on module commands when cluster is down (same as other blocked clients) This commit adds basic tests for the main (non-cluster) redis test infra that test the cluster. This was done because the cluster test infra can't handle some common test features, but most importantly we only build the test modules with the non-cluster test suite. note that rather than really supporting cluster operations by the test infra, it was added (as dup code) in two files, one for module tests and one for non-modules tests, maybe in the future we'll refactor that. Co-authored-by: Oran Agra <oran@redislabs.com>	2021-10-19 11:50:37 +03:00
Bjorn Svensson	c9fabc2ef0	Move config `unixsocketperm` to generic configs (#9607 ) Since the size of mode_t is platform dependant we handle the `unixsocketperm` configuration as a generic int type. mode_t is either an unsigned int or unsigned short (macOS) and the range-limits allows for a simple cast to a mode_t.	2021-10-18 23:58:52 -07:00
Wen Hui	1c2b5f5318	Make Cluster-bus port configurable with new cluster-port config (#9389 ) Make Cluster-bus port configurable with new cluster-port config	2021-10-18 22:28:27 -07:00
Viktor Söderqvist	b7f2a1a217	Add RedisModule_KeyExists (#9600 ) The LRU of the key is not touched. Locically expired keys are logically not existing, so they're treated as such.	2021-10-18 22:21:19 +03:00
DarrenJiang13	aa6deff01e	add missed error counting (#9646 ) * add: add missed error counting in sentinel.c and cluster.c	2021-10-18 15:53:10 +03:00
yoav-steinberg	81095b1bd9	Skip Active-defrag edge case test until we fix it. (#9645 ) Test started failing consistently in 32bit builds after upgrading to jemalloc 5.2.1 (#9623).	2021-10-18 13:28:52 +03:00
Oran Agra	c4b4b6c06b	Merge pull request #9623 from yoav-steinberg/upgrade_jemalloc_5.2.1 Upgraded to jemalloc 5.2.1 from 5.1.0. Cherry picked all relevant fixes (by diffing our 5.1.0 to upstream 5.10 and finding relevant commits). Details of what was done: [cherry-picked] `fd7d51c` 2021-05-03 Resolve nonsense static analysis warnings (Oran Agra) [cherry-picked] `448c435` 2020-09-29 Fix compilation warnings in Lua and jemalloc dependencies (#7785) (YoongHM) [skipped - already in upstream] `9216b96` 2020-09-21 Fix compilation warning in jemalloc's malloc_vsnprintf (#7789) (YoongHM) [cherry-picked] `88d71f4` 2020-05-20 fix a rare active defrag edge case bug leading to stagnation (Oran Agra) [skipped - already in upstream] `2fec7d9` 2019-05-30 Jemalloc: Avoid blocking on background thread lock for stats. [cherry-picked] `920158e` 2018-07-11 Active defrag fixes for 32bit builds (again) (Oran Agra) [cherry-picked] `e8099ca` 2018-06-26 add defrag hint support into jemalloc 5 (Oran Agra) [re-done] `4e729fc` 2018-05-24 Generate configure for Jemalloc. (antirez) Additionally had to do this: 7727cc2 2021-10-10 Fix defrag to support sharded bins in arena (added in v5.2.1) (Yoav Steinberg) When reviewing please look at all except the first commit which is just replacing 5.1.0 with 5.2.1 sources. Also I think we should merge this without squashing to preserve the changes we did to to jemalloc.	2021-10-18 12:45:11 +03:00
Oran Agra	276b460ea9	Attempt to fix a valgrind test failure due to timing (#9643 ) in the past few days i've seen two failures in the valgrind daily test. *** [err]: slave fails full sync and diskless load swapdb recovers it in tests/integration/replication.tcl Replica didn't get into loading mode can't reproduce it, but i'm hoping it's just too slow (to start loading within 5 seconds)	2021-10-18 10:45:45 +03:00
七飒	afd8c4e007	sdstrim remove excessive check (#4045 ) there is no need to compare the value of ep and sp ``` sp = start = s; // the only way that make ep > sp is sdslen(s) == 0 // so when ep > sp,must exist ep-sp == -1 ep = end = s+sdslen(s)-1; while(sp <= end && strchr(cset, sp)) sp++; while(ep > sp && strchr(cset, ep)) ep--; // -1 + 1 already equals 0 len = (sp > ep) ? 0 : ((ep-sp)+1); ``` Signed-off-by: Bo Cai <charpty@gmail.com>	2021-10-17 20:37:52 +03:00
Ilya Shipitsin	94fded4f4f	Code cleanup, resolve an issue identified by cppcheck (#4373 ) [src/bitops.c:512] -> [src/bitops.c:507]: (warning) Either the condition 'if(o&&o->encoding==1)' is redundant or there is possible null pointer dereference: o. This function has checks for `o` to be null or non-null, so it is odd that it accesses it first..	2021-10-17 18:48:15 +03:00
Hanna Fadida	61bb044156	Modify mem_usage2 module callback to enable to take sample_size argument (#9612 ) This is useful for approximating size computation of complex module types. Note that the mem_usage2 callback is new and has not been released yet, which is why we can modify it.	2021-10-17 17:31:06 +03:00
Oran Agra	e7864a2b70	fix typos in module doc / header (#9641 ) the RedisModule_ReplyWithPush prototype was merged by mistake (no such API yet)	2021-10-17 17:15:27 +03:00
Yossi Gottlieb	6d5a911707	Fix daily failures due to macos-latest change. (#9637 ) * Fix test modules linking on macOS 11.x. * Use macOS 10.x for FreeBSD VM as VirtualBox is not yet supported on 11.	2021-10-17 00:07:27 +03:00
guoxiang1996	3c9e5271c6	Use fcntl(fd,F_FULLFSYNC) instead of fsync on OSX, improve power failure safety (#9545 ) On MacOS calling fsync does not guarantee the cache on the disk itself is flushed.	2021-10-15 08:44:25 +03:00
Shaya Potter	24b67d5520	Add RM_ReplyWithVerbatimStringType that takes an ext/type arg (#9632 ) Verbatim Stings in RESP3 have a type/extension. The existing redismoule reply function, hard coded it to "txt".	2021-10-14 09:53:46 +03:00
Ofir Luzon	49d26a9658	Add sleep interval to redis-cli --scan option (#3751 ) Adding -i option (sleep interval) of repeat and bigkeys to redis-cli --scan. When the keyspace contains many already expired keys scanning the dataset with redis-cli --scan can impact the performance Co-authored-by: Oran Agra <oran@redislabs.com>	2021-10-13 16:54:35 +03:00
Madelyn Olson	a6b5d518a9	Improved the reliability of cluster replica sync tests (#9628 ) Improved the reliability of cluster replica sync tests	2021-10-13 00:06:53 -07:00
Ning Xie	075ac34545	Fix redis-cli SCAN sleep interval for big/hot keys (could have been skipped) (#9624 ) bigkeys sleep is defined each 100 scanned keys, and it is checked it only between scan cycles. In cases that scan does not return exactly 10 keys it will never sleep. In addition the comment was sleep each 100 SCANs but it was 100 scanned keys.	2021-10-12 23:00:49 +03:00
Yossi Gottlieb	922ef86a3b	hiredis: improve calloc() overflow fix. (#9630 ) Cherry pick a more complete fix to `0215324a6` that also doesn't leak memory from latest hiredis.	2021-10-12 20:16:29 +03:00
Yoav Steinberg	85737e6745	Added jemalloc subtree upgrade instructions.	2021-10-12 12:55:35 +03:00
Oran Agra	ed92a3e8ed	Resolve nonsense static analysis warnings	2021-10-12 12:55:35 +03:00
YoongHM	5a82940452	Fix compilation warnings in Lua and jemalloc dependencies (#7785 ) - The argument `u` in for `ar` is ignored (and generates warnings since `D` became the default. All it does is avoid updating unchanged objects (shouldn't have any impact on our build) - Enable `LUA_USE_MKSTEMP` to force the use of `mkstemp()` instead of `tmpname()` (which is dead code in redis anyway). - Remove unused variable `c` in `f_parser()` - Removed misleadingly indented space in `luaL_loadfile()` and ``addfield()` Co-authored-by: Oran Agra <oran@redislabs.com>	2021-10-12 12:55:35 +03:00
Oran Agra	c6a26519a1	fix a rare active defrag edge case bug leading to stagnation There's a rare case which leads to stagnation in the defragger, causing it to keep scanning the keyspace and do nothing (not moving any allocation), this happens when all the allocator slabs of a certain bin have the same % utilization, but the slab from which new allocations are made have a lower utilization. this commit fixes it by removing the current slab from the overall average utilization of the bin, and also eliminate any precision loss in the utilization calculation and move the decision about the defrag to reside inside jemalloc. and also add a test that consistently reproduce this issue.	2021-10-12 12:55:35 +03:00
yoav-steinberg	252981914f	XADD - skip rewrite the id arg if it was given and is valid. (#9599 ) When calling `XADD` with a predefined id (instead of `*`) there's no need to run the code which replaces the supplied id with itself. Only when we pass a wildcard id we need to do this. For apps which always supply their own id this is a slight optimization.	2021-10-11 13:09:18 +03:00
zhaozhao.zz	484a1ad67e	master client should ignore proto_max_bulk_len in bitops (#9626 )	2021-10-11 13:58:42 +08:00
Yoav Steinberg	908d3bdad9	Fix defrag to support sharded bins in arena (added in v5.2.1) See `37b8913925`	2021-10-10 18:29:13 +03:00
Oran Agra	91bc78a8b8	Active defrag fixes for 32bit builds (again) * overflow in jemalloc fragmentation hint to the defragger	2021-10-10 18:29:13 +03:00
Oran Agra	29d7f97c96	add defrag hint support into jemalloc 5	2021-10-10 18:29:13 +03:00
Yoav Steinberg	9e5cd2cb26	Generate configure for Jemalloc 5.2.1. ./autogen.sh --with-version=5.2.1-0-g0	2021-10-10 18:29:13 +03:00
Yoav Steinberg	4d5911b4e4	Merge commit '220a0f0880419450c9409202aac1fab4b8be0719' as 'deps/jemalloc'	2021-10-10 18:26:48 +03:00
Yoav Steinberg	220a0f0880	Squashed 'deps/jemalloc/' content from commit 886e40bb3 git-subtree-dir: deps/jemalloc git-subtree-split: 886e40bb339ec1358a5ff2a52fdb782ca66461cb	2021-10-10 18:26:48 +03:00
Yoav Steinberg	4a884343f5	Delete old jemalloc before pulling in subtree.	2021-10-10 18:03:38 +03:00
menwen	7ff7536e2c	Delete unused 'time' fields from struct bio_job (#9622 ) looks like this field was never actually used and the call to time() is excessive.	2021-10-10 08:17:54 +03:00
Bjorn Svensson	b874c6f1fc	Move config logfile to generic config (#9592 ) Move config `logfile` to generic configs	2021-10-07 22:33:08 -07:00
Bjorn Svensson	54d01e363a	Move config `cluster-config-file` to generic configs (#9597 )	2021-10-07 22:32:40 -07:00
yoav-steinberg	834e8843de	obuf based eviction tests run until eviction occurs (#9611 ) obuf based eviction tests run until eviction occurs instead of assuming a certain amount of writes will fill the obuf enough for eviction to occur. This handles the kernel buffering written data and emptying the obuf even though no one actualy reads from it. The tests have a new timeout of 20sec: if the test doesn't pass after 20 sec it'll fail. Hopefully this enough for our slow CI targets. This also eliminates the need to skip some tests in TLS.	2021-10-07 15:43:48 +03:00
Huang Zhw	fd135f3e2d	Make tracking invalidation messages always after command's reply (#9422 ) Tracking invalidation messages were sometimes sent in inconsistent order, before the command's reply rather than after. In addition to that, they were sometimes embedded inside other commands responses, like MULTI-EXEC and MGET.	2021-10-07 15:13:42 +03:00
GutovskyMaria	d98d1ad574	Hide empty and loading replicas from CLUSTER SLOTS responses (#9287 ) Hide empty and loading replicas from CLUSTER SLOTS responses	2021-10-06 22:22:27 -07:00
Andy Pan	2391aefd82	Implement anetPipe() to combine creating pipe and setting flags (#9511 ) Implement createPipe() to combine creating pipe and setting flags, also reduce system calls by prioritizing pipe2() over pipe(). Without createPipe(), we have to call pipe() to create a pipe and then call some functions (like anetCloexec() and anetNonBlock()) of anet.c to set flags respectively, which leads to some extra system calls, now we can leverage pipe2() to combine them and make the process of creating pipe more convergent in createPipe(). Co-authored-by: Viktor Söderqvist <viktor.soderqvist@est.tech> Co-authored-by: Oran Agra <oran@redislabs.com>	2021-10-06 16:08:13 +03:00
yoav-steinberg	123cc1a1bc	Test fails when flushdb triggers a bgsave (#9535 ) Flush db and then wait for the bgsave to complete.	2021-10-06 11:50:47 +03:00
yoav-steinberg	5725088ff2	Avoid argv memcpy when queuing a multi command. (#9602 ) When queuing a multi command we duplicated the argv (meaning an alloc and a memcpy). This isn't needed since we can use the previously allocated argv and just reset the client objects argv to NULL. This should saves some memory and is a minor optimization in heavy MULTI/EXEC traffic, especially if there are lots of arguments.	2021-10-06 11:39:09 +03:00
Meir Shpilraien (Spielrein)	4fb39b6700	Added module-acquire-GIL latency stats (#9608 ) The new value indicates how long Redis wait to acquire the GIL after sleep. This can help identify problems where a module perform some background operation for a long time (with the GIL held) and blocks the Redis main thread.	2021-10-06 11:33:01 +03:00
tzongw	f5160ed0aa	improve latency when a client is unblocked by module timer (#9593 ) Scenario: 1. client block on command `XREAD BLOCK 0 STREAMS mystream $` 2. in a module, calling `XADD mystream * field value` via lua from a timer callback 3. client will receive response after some latency up to 100ms Reason: When `XADD` signal the key `mystream` as ready, `beforeSleep` in next eventloop will call `handleClientsBlockedOnKeys` to unblock the client and add pending data to write but not actually install a write handler, so next redis will block in `aeApiPoll` up to 100ms given `hz` config as default 10, pending data will be sent in another next eventloop by `handleClientsWithPendingWritesUsingThreads`. Calling `handleClientsBlockedOnKeys` before `handleClientsWithPendingWritesUsingThreads` in `beforeSleep` solves the problem.	2021-10-06 10:15:03 +03:00
yoav-steinberg	897c7bddf5	Attempt to fix rare pubsub oubuf maxmemory eviction test failure (#9603 ) * Reduce delay between publishes to allow less time to write the obufs. * More subscribed clients to buffer more data per publish. * Make sure main connection isn't evicted (it has a large qbuf).	2021-10-05 18:00:19 +03:00
yoav-steinberg	83478e6102	argv mem leak during multi command execution. (#9598 ) Changes in #9528 lead to memory leak if the command implementation used rewriteClientCommandArgument inside MULTI-EXEC. Adding an explicit test for that case since the test that uncovered it didn't specifically target this scenario	2021-10-05 12:17:36 +03:00
Meir Shpilraien (Spielrein)	0f8b634cd5	Fix invalid memory write on lua stack overflow (CVE-2021-32626) (#9591 ) When LUA call our C code, by default, the LUA stack has room for 10 elements. In most cases, this is more than enough but sometimes it's not and the caller must verify the LUA stack size before he pushes elements. On 3 places in the code, there was no verification of the LUA stack size. On specific inputs this missing verification could have lead to invalid memory write: 1. On 'luaReplyToRedisReply', one might return a nested reply that will explode the LUA stack. 2. On 'redisProtocolToLuaType', the Redis reply might be deep enough to explode the LUA stack (notice that currently there is no such command in Redis that returns such a nested reply, but modules might do it) 3. On 'ldbRedis', one might give a command with enough arguments to explode the LUA stack (all the arguments will be pushed to the LUA stack) This commit is solving all those 3 issues by calling 'lua_checkstack' and verify that there is enough room in the LUA stack to push elements. In case 'lua_checkstack' returns an error (there is not enough room in the LUA stack and it's not possible to increase the stack), we will do the following: 1. On 'luaReplyToRedisReply', we will return an error to the user. 2. On 'redisProtocolToLuaType' we will exit with panic (we assume this scenario is rare because it can only happen with a module). 3. On 'ldbRedis', we return an error.	2021-10-04 15:17:50 +03:00
Oran Agra	9e3dca8bef	Fix mem leak in loading AOF, introduced by #9528 (#9582 ) Recently merged PR introduced a leak when loading AOF files. This was because argv_len wasn't set, so rewriteClientCommandArgument would shrink the argv array and updating argc to a small value.	2021-10-04 12:17:22 +03:00
Oran Agra	b0ca3be2bb	Fix protocol parsing on 'ldbReplParseCommand' (CVE-2021-32672) (#9590 ) The protocol parsing on 'ldbReplParseCommand' (LUA debugging) Assumed protocol correctness. This means that if the following is given: *1 $100 test The parser will try to read additional 94 unallocated bytes after the client buffer. This commit fixes this issue by validating that there are actually enough bytes to read. It also limits the amount of data that can be sent by the debugger client to 1M so the client will not be able to explode the memory. Co-authored-by: meir@redislabs.com <meir@redislabs.com>	2021-10-04 12:14:12 +03:00

1 2 3 4 5 ...

10619 Commits