valkey

mirror of http://github.com/valkey-io/valkey synced 2024-11-22 18:54:58 +00:00

Author	SHA1	Message	Date
Oran Agra	6cd84b64f0	Test infra, handle RESP3 attributes and big-numbers and bools (#9235 ) - promote the code in DEBUG PROTOCOL to addReplyBigNum - DEBUG PROTOCOL ATTRIB skips the attribute when client is RESP2 - networking.c addReply for push and attributes generate assertion when called on a RESP2 client, anything else would produce a broken protocol that clients can't handle. (cherry picked from commit `6a5bac309e`)	2021-07-21 21:06:49 +03:00
Binbin	c6b3966d02	hrandfield and zrandmember with count should return emptyarray when key does not exist. (#9178 ) due to a copy-paste bug, it used to reply with null response rather than empty array. this commit includes new tests that are looking at the RESP response directly in order to be able to tell the difference between them. Co-authored-by: Oran Agra <oran@redislabs.com> (cherry picked from commit `a418a2d3fc`)	2021-07-21 21:06:49 +03:00
Oran Agra	432e056659	Tests: add a way to read raw RESP protocol reponses (#9193 ) This makes it possible to distinguish between null response and an empty array (currently the tests infra translates both to an empty string/list) (cherry picked from commit `7103367ad4`)	2021-07-21 21:06:49 +03:00
Leibale Eidelman	095a6e5937	fix ZRANGESTORE - should return 0 when src points to an empty key (#9089 ) mistakenly it used to return an empty array rather than 0. Co-authored-by: Oran Agra <oran@redislabs.com> (cherry picked from commit `95274f1f8a`)	2021-07-21 21:06:49 +03:00
Binbin	11235c0e40	ZRANDMEMBER WITHSCORES with negative COUNT may return bad score (#9162 ) Return a bad score when used with negative count (or count of 1), and non-ziplist encoded zset. Also add test to validate the return value and cover the issue. (cherry picked from commit `4bc5a8324d`)	2021-07-21 21:06:49 +03:00
Evan	406caa5f56	modules: Add newlen == 0 handling to RM_StringTruncate (#3717 ) (#3718 ) Previously, passing 0 for newlen would not truncate the string at all. This adds handling of this case, freeing the old string and creating a new empty string. Other changes: - Move `src/modules/testmodule.c` to `tests/modules/basics.c` - Introduce that basic test into the test suite - Add tests to cover StringTruncate - Add `test-modules` build target for the main makefile - Extend `distclean` build target to clean modules too (cherry picked from commit `1ccf2ca2f4`)	2021-07-21 21:06:49 +03:00
Huang Zhw	e6df2f6210	Fix XTRIM or XADD with LIMIT may delete more entries than Count. (#9048 ) The decision to stop trimming due to LIMIT in XADD and XTRIM was after the limit was reached. i.e. the code was deleting at least that count of records (from the LIMIT argument's perspective, not the MAXLEN), instead of up to that count of records. see #9046 (cherry picked from commit `eaa7a7bb93`)	2021-07-21 21:06:49 +03:00
YaacovHazan	ff27217639	stabilize tests that involved with load handlers (#8967 ) When test stop 'load handler' by killing the process that generating the load, some commands that already in the input buffer, still might be processed by the server. This may cause some instability in tests, that count on that no more commands processed after we stop the `load handler' In this commit, new proc 'wait_load_handlers_disconnected' added, to verify that no more cammands from any 'load handler' prossesed, by checking that the clients who genreate the load is disconnceted. Also, replacing check of dbsize with wait_for_ofs_sync before comparing debug digest, as it would fail in case the last key the workload wrote was an overridden key (not a new one). Affected tests Race fix: - failover command to specific replica works - Connect multiple replicas at the same time (issue #141), master diskless=$mdl, replica diskless=$sdl - AOF rewrite during write load: RDB preamble=$rdbpre Cleanup and speedup: - Test replication with blocking lists and sorted sets operations - Test replication with parallel clients writing in different DBs - Test replication partial resync: $descr (diskless: $mdl, $sdl, reconnect: $reconnect (cherry picked from commit `32a2584e07`)	2021-07-21 21:06:49 +03:00
perryitay	3f4f9b6331	Fail EXEC command in case a watched key is expired (#9194 ) There are two issues fixed in this commit: 1. we want to fail the EXEC command in case there is a watched key that's logically expired but not yet deleted by active expire or lazy expire. 2. we saw that currently cache time is update in every `call()` (including nested calls), this time is being also being use for the isKeyExpired comparison, we want to update the cache time only in the first call (execCommand) Co-authored-by: Oran Agra <oran@redislabs.com> (cherry picked from commit `ac8b1df885`)	2021-07-21 21:06:49 +03:00
Oran Agra	abd44c8393	Fix race in client side tracking (#9116 ) The `Tracking gets notification of expired keys` test in tracking.tcl used to hung in valgrind CI quite a lot. It turns out the reason is that with valgrind and a busy machine, the server cron active expire cycle could easily run in the same event loop as the command that created `mykey`, so that when they key got expired, there were two change events to broadcast, one that set the key and one that expired it, but since we used raxTryInsert, the client that was associated with the "last" change was the one that created the key, so the NOLOOP filtered that event. This commit adds a test that reproduces the problem by using lazy expire in a multi-exec which makes sure the key expires in the same event loop as the one that added it. (cherry picked from commit `9b564b525d`)	2021-07-21 21:06:49 +03:00
YaacovHazan	5102c0da92	unregister AE_READABLE from the read pipe in backgroundSaveDoneHandlerSocket (#8991 ) In diskless replication, we create a read pipe for the RDB, between the child and the parent. When we close this pipe (fd), the read handler also needs to be removed from the event loop (if it still registered). Otherwise, next time we will use the same fd, the registration will be fail (panic), because we will use EPOLL_CTL_MOD (the fd still register in the event loop), on fd that already removed from epoll_ctl (cherry picked from commit `501d775583`)	2021-06-01 17:03:36 +03:00
Madelyn Olson	875a1f07d8	Hide migrate command from slowlog if they include auth (#8859 ) Redact commands that include sensitive data from slowlog and monitor (cherry picked from commit `a59e75a475`)	2021-06-01 17:03:36 +03:00
yoav-steinberg	15c078df61	Enforce client output buffer soft limit when no traffic. (#8833 ) When client breached the output buffer soft limit but then went idle, we didn't disconnect on soft limit timeout, now we do. Note this also resolves some sporadic test failures in due to Linux buffering data which caused tests to fail if during the test we went back under the soft COB limit. Co-authored-by: Oran Agra <oran@redislabs.com> Co-authored-by: sundb <sundbcn@gmail.com> (cherry picked from commit `152fce5e2c`)	2021-06-01 17:03:36 +03:00
Huang Zhw	34b9a3fa2e	Fix potential CONFIG SET bind test failure. (#8875 ) Use an invalid IP address to trigger CONFIG SET bind failure, instead of DNS which is not guaranteed to always fail. (cherry picked from commit `2b22fffc78`)	2021-05-03 22:57:00 +03:00
bugwz	0851705304	Print the number of abnormal line in AOF (#8823 ) When redis-check-aof finds an error, it prints the line number for faster troubleshooting. (cherry picked from commit `761d7d2771`)	2021-05-03 22:57:00 +03:00
Madelyn Olson	d01afe885c	Fix memory leak when doing lazyfreeing client tracking table (#8822 ) Interior rax pointers were not being freed (cherry picked from commit `c73b4ddfd9`)	2021-05-03 22:57:00 +03:00
Hanna Fadida	53a4d6c3b1	Modules: adding a module type for key space notification (#8759 ) Adding a new type mask for key space notification, REDISMODULE_NOTIFY_MODULE, to enable unique notifications from commands on REDISMODULE_KEYTYPE_MODULE type keys (which is currently unsupported). Modules can subscribe to a module key keyspace notification by RM_SubscribeToKeyspaceEvents, and clients by notify-keyspace-events of redis.conf or via the CONFIG SET, with the characters 'd' or 'A' (REDISMODULE_NOTIFY_MODULE type mask is part of the 'All' notation for key space notifications). Refactor: move some pubsub test infra from pubsub.tcl to util.tcl to be re-used by other tests.	2021-04-19 21:33:26 +03:00
guybe7	f40ca9cb58	Modules: Replicate lazy-expire even if replication is not allowed (#8816 ) Before this commit using RM_Call without "!" could cause the master to lazy-expire a key (delete it) but without replicating to replicas. This could cause the replica's memory usage to gradually grow and could also cause consistency issues if the master and replica have a clock diff. This bug was introduced in #8617 Added a test which demonstrates that scenario.	2021-04-19 17:16:02 +03:00
Harkrishn Patro	7a3d1487e4	ACL channels permission handling for save/load scenario. (#8794 ) In the initial release of Redis 6.2 setting a user to only allow pubsub access to a specific channel, and doing ACL SAVE, resulted in an assertion when ACL LOAD was used. This was later changed by #8723 (not yet released), but still not properly resolved (now it errors instead of crash). The problem is that the server that generates an ACL file, doesn't know what would be the setting of the acl-pubsub-default config in the server that will load it. so ACL SAVE needs to always start with resetchannels directive. This should still be compatible with old acl files (from redis 6.0), and ones from earlier versions of 6.2 that didn't mess with channels. Co-authored-by: Harkrishn Patro <harkrisp@amazon.com> Co-authored-by: Oran Agra <oran@redislabs.com>	2021-04-19 13:27:44 +03:00
sundb	3a955d9ad4	Fix ouput buffer limit test (#8803 ) The tail size of c->reply is 16kb, but in the test only publish a few chars each time, due to a change in #8699, the obuf limit is now checked a new memory allocation is made, so this test would have sometimes failed to trigger a soft limit disconnection in time. The solution is to write bigger payloads to the output buffer, but still limit their rate (not more than 100k/s).	2021-04-19 10:08:07 +03:00
Yossi Gottlieb	c0f5c678c2	Revert cluster slot migration tests. (#8806 ) Disables #8649 and subsequent attempts to stabilize the test.	2021-04-18 20:51:08 +03:00
Oran Agra	a9897b0084	Fix timing of new replication test (#8807 ) In github actions CI with valgrind, i saw that even the fast replica (one that wasn't paused), didn't get to complete the replication fast enough, and ended up getting disconnected by timeout. Additionally, due to a typo in uname, we didn't get to actually run the CPU efficiency part of the test.	2021-04-18 15:12:34 +03:00
Oran Agra	f4b5a4d869	Improve testsuite print of log file (#8805 ) 1. the `dump_logs` option would have printed only logs of servers that were spawn before the test proc started, and not ones that the test proc started inside it. 2. when a server proc catches an exception it should normally forward the exception upwards, specifically when it's an assertion that should be caught by a test proc above. however, in `durable` mode, we caught all exceptions printed them to stdout and let the code continue, this was wrong to do for assertions, which should have still been propagated to the test function. 3. don't bother to search for crash log to print if we printed the the entire log anyway 4. if no crash log was found, no need to print anything (i.e. the fact it wasn't found) 5. rename warnings_from_file to crashlog_from_file	2021-04-18 11:55:54 +03:00
guybe7	d63d02601f	Add a timeout mechanism for replicas stuck in fullsync (#8762 ) Starting redis 6.0 (part of the TLS feature), diskless master uses pipe from the fork child so that the parent is the one sending data to the replicas. This mechanism has an issue in which a hung replica will cause the master to wait for it to read the data sent to it forever, thus preventing the fork child from terminating and preventing the creations of any other forks. This PR adds a timeout mechanism, much like the ACK-based timeout, we disconnect replicas that aren't reading the RDB file fast enough.	2021-04-15 17:18:51 +03:00
YaacovHazan	645c664fbb	stabilized and improve pendingquerybuf test suit (#8780 ) replace the hardcoded after 2000, with waiting for the sync and wait for condition	2021-04-14 11:49:00 +03:00
Oran Agra	b278e44376	Revert "Fix: server will crash if rdbload or rdbsave method is not provided in module (#8670 )" (#8771 ) This reverts commit `808f3004f0`.	2021-04-13 17:41:46 +03:00
Oran Agra	c07e16fadd	Add more attempts to a timing sensitive test (#8770 )	2021-04-13 17:35:10 +03:00
Yossi Gottlieb	5e3a15ae1b	Fix failing cluster tests. (#8763 ) Disable replica migration to avoid a race condition where the migrated-from node turns into a replica. Long term, this test should probably be improved to handle multiple slots and accept such auto migrations but this is a quick fix to stabilize the CI without completely dropping this test.	2021-04-13 00:00:57 +03:00
Yang Bodong	4c14e8668c	Fix out of range confusing error messages (XAUTOCLAIM, RPOP count) (#8746 ) Fix out of range error messages to be clearer (avoid mentioning 9223372036854775807) * Fix XAUTOCLAIM COUNT option confusing error msg * Fix other RPOP and alike error message to mention positive	2021-04-07 10:01:28 +03:00
Bonsai	808f3004f0	Fix: server will crash if rdbload or rdbsave method is not provided in module (#8670 ) With this fix, module data type registration will fail if the load or save callbacks are not defined, or the optional aux load and save callbacks are not either both defined or both missing.	2021-04-06 12:09:36 +03:00
Yossi Gottlieb	4724dd439e	Clean up and stabilize cluster migration tests. (#8745 ) This is work in progress, focusing on two main areas: * Avoiding race conditions with cluster configuration propagation. * Ignoring limitations with redis-cli --cluster fix which makes it hard to distinguish real errors (e.g. failure to fix) from expected conditions in this test (e.g. nodes not agreeing on configuration).	2021-04-06 11:57:57 +03:00
Huang Zhw	3b74b55084	Fix "default" and overwritten / reset users will not have pubsub channels permissions by default. (#8723 ) Background: Redis 6.2 added ACL control for pubsub channels (#7993), which were supposed to be permissive by default to retain compatibility with redis 6.0 ACL. But due to a bug, only newly created users got this `acl-pubsub-default` applied, while overwritten (updated) users got reset to `resetchannels` (denied). Since the "default" user exists before loading the config file, any ACL change to it, results in an update / overwrite. So when a "default" user is loaded from config file or include ACL file with no channels related rules, the user will not have any permissions to any channels. But other users will have default permissions to any channels. When upgraded from 6.0 with config rewrite, this will lead to "default" user channels permissions lost. When users are loaded from include file, then call "acl load", users will also lost channels permissions. Similarly, the `reset` ACL rule, would have reset the user to be denied access to any channels, ignoring `acl-pubsub-default` and breaking compatibility with redis 6.0. The implication of this fix is that it regains compatibility with redis 6.0, but breaks compatibility with redis 6.2.0 and 2.0.1. e.g. after the upgrade, the default user will regain access to pubsub channels. Other changes: Additionally this commit rename server.acl_pubusub_default to server.acl_pubsub_default and fix typo in acl tests.	2021-04-05 23:13:20 +03:00
Sokolov Yura	1cab962098	Add cluster-allow-replica-migration option. (#5285 ) Previously (and by default after commit) when master loose its last slot (due to migration, for example), its replicas will migrate to new last slot holder. There are cases where this is not desired: * Consolidation that results with removed nodes (including the replica, eventually). * Manually configured cluster topologies, which the admin wishes to preserve. Needlessly migrating a replica triggers a full synchronization and can have a negative impact, so we prefer to be able to avoid it where possible. This commit adds 'cluster-allow-replica-migration' configuration option that is enabled by default to preserve existed behavior. When disabled, replicas will not be auto-migrated. Fixes #4896 Co-authored-by: Oran Agra <oran@redislabs.com>	2021-04-04 09:43:24 +03:00
Valentino Geron	44d8b039e8	Fix XAUTOCLAIM response to return the next available id as the cursor (#8725 ) This command used to return the last scanned entry id as the cursor, instead of the next one to be scanned. so in the next call, the user could / should have sent `(cursor` and not just `cursor` if he wanted to avoid scanning the same record twice. Scanning the record twice would look odd if someone is checking what exactly was scanned, but it also has a side effect of incrementing the delivery count twice.	2021-04-01 12:13:55 +03:00
Oran Agra	370ab4c4db	Solve sentinel test issue in TLS due to recent tests change. (#8728 ) `5629dbe71` added a change that configures the tcp (plaintext) port alongside the tls port, this causes the INFO command for tcp_port to return that instead of the tls port when running in tls, and that broke the sentinel tests that query it. the fix is to add a method that gets the right port from CONFIG instead of relying on the tcp_port info field.	2021-04-01 09:44:44 +03:00
guybe7	843f769b96	zsetAdd: Fix wrong reply in case of INCR and GT/LT (#8717 ) If GT/LT fails the operation we need to reply with nill (like failure due to NX). Other changes: Add the missing $encoding suffix to many zset tests Note: there's a behavior change just in case of INCR + GT/LT that fails. The old code was replying with the wrong (rejected) score, and now it'll reply with nil. Note that that's anyway a corner case so this "behavior change" shouldn't have too much affect. Using GT/LT with INCR has a predictable result even before we run the command (INCR GT will only only / always fail if the increment is negative).	2021-04-01 09:33:53 +03:00
sundb	569a3f4548	Use chi-square for random distributivity verification in test (#8709 ) Problem: Currently, when performing random distribution verification, we determine the probability of each element occurring in the sum, but the probability is only an estimate, these tests had rare sporadic failures, and we cannot verify what the probability of failure will be. Solution: Using the chi-square distribution instead of the original random distribution validation makes the test more reasonable and easier to find problems.	2021-04-01 08:20:15 +03:00
Jérôme Loyet	91f4f41665	Add replica-announced config option (#8653 ) The 'sentinel replicas <master>' command will ignore replicas with `replica-announced` set to no. The goal of disabling the config setting replica-announced is to allow ghost replicas. The replica is in the cluster, synchronize with its master, can be promoted to master and is not exposed to sentinel clients. This way, it is acting as a live backup or living ghost. In addition, to prevent the replica to be promoted as master, set replica-priority to 0.	2021-03-30 23:40:22 +03:00
Yossi Gottlieb	6a052af890	Cluster migration test cleanup. (#8726 ) * Dump more output on error (always, cluster tests currently have no verbose flag). * Slow down redis-cli check iteration.	2021-03-30 23:33:01 +03:00
Viktor Söderqvist	5629dbe715	Add support for plaintext clients in TLS cluster (#8587 ) The cluster bus is established over TLS or non-TLS depending on the configuration tls-cluster. The client ports distributed in the cluster and sent to clients are assumed to be TLS or non-TLS also depending on tls-cluster. The cluster bus is now extended to also contain the non-TLS port of clients in a TLS cluster, when available. The non-TLS port of a cluster node, when available, is sent to clients connected without TLS in responses to CLUSTER SLOTS, CLUSTER NODES, CLUSTER SLAVES and MOVED and ASK redirects, instead of the TLS port. The user was able to override the client port by defining cluster-announce-port. Now cluster-announce-tls-port is added, so the user can define an alternative announce port for both TLS and non-TLS clients. Fixes #8134	2021-03-30 23:11:32 +03:00
JunhuaY	28375ff63e	re-fix config rewrite for empty save directive (#8722 ) the bug was also discussed in #8716, and was solved in #8719, but incompletely: when the server is started, and the save option is default, if you issue the " config set save "" " to change the save option, and then issue the “config rewrite” command, the " save "" " won't be saved.	2021-03-30 22:49:06 +03:00
Oran Agra	cd81dcf18b	solve race conditions in psync2-pingoff test (#8720 ) Another test race condition in the macos tests. the test was waiting for PINGs to be generated and put on the replication stream, but waiting for 1 or 2 seconds doesn't really guarantee that. then the test that expected 6 full syncs, found only 4	2021-03-30 11:41:06 +03:00
Yossi Gottlieb	65311a3360	Fix config rewrite with an empty "save" parameter. (#8719 )	2021-03-29 18:53:20 +03:00
Sokolov Yura	315df9ada0	Add cluster slot migration tests (#8649 ) Add tests for fixing migrating slot at all stages: 1. when migration is half inited on "migrating" node 2. when migration is half inited on "importing" node 3. migration inited, but not finished 4. migration is half finished on "migrating" node 5. migration is half finished on "importing" node Also add tests for many simultaneous slot migrations. Co-authored-by: Yossi Gottlieb <yossigo@gmail.com>	2021-03-29 13:52:02 +03:00
Meir Shpilraien (Spielrein)	036963a7da	Restore old client 'processCommandAndResetClient' to fix false dead client indicator (#8715 ) 'processCommandAndResetClient' returns 1 if client is dead. It does it by checking if serve.current_client is NULL. On script timeout, Redis will re-enter 'processCommandAndResetClient' and when finish we will set server.current_client to NULL. This will cause later to falsely return 1 and think that the client that sent the timed-out script is dead (Redis to stop reading from the client buffer).	2021-03-29 13:34:16 +03:00
Huang Zhw	e138698e54	make processCommand check publish channel permissions. (#8534 ) Add publish channel permissions check in processCommand. processCommand didn't check publish channel permissions, so we can queue a publish command in a transaction. But when exec the transaction, it will fail with -NOPERM. We also union keys/commands/channels permissions check togegher in ACLCheckAllPerm. Remove pubsubCheckACLPermissionsOrReply in publishCommand/subscribeCommand/psubscribeCommand. Always check permissions in processCommand/execCommand/ luaRedisGenericCommand.	2021-03-26 14:10:01 +03:00
Oran Agra	497351ad07	Fix SLOWLOG for blocked commands (#8632 ) * SLOWLOG didn't record anything for blocked commands because the client was reset and argv was already empty. there was a fix for this issue specifically for modules, now it works for all blocked clients. * The original command argv (before being re-written) was also reset before adding the slowlog on behalf of the blocked command. * Latency monitor is now updated regardless of the slowlog flags of the command or its execution (their purpose is to hide sensitive info from the slowlog, not hide the fact the latency happened). * Latency monitor now uses real_cmd rather than c->cmd (which may be different if the command got re-written, e.g. GEOADD) Changes: * Unify shared code between slowlog insertion in call() and updateStatsOnUnblock(), hopefully prevent future bugs from happening due to the later being overlooked. * Reset CLIENT_PREVENT_LOGGING in resetClient rather than after command processing. * Add a test for SLOWLOG and BLPOP Notes: - real_cmd == c->lastcmd, except inside MULTI and Lua. - blocked commands never happen in these cases (MULTI / Lua) - real_cmd == c->cmd, except for when the command is rewritten (e.g. GEOADD) - blocked commands (currently) are never rewritten - other than the command's CLIENT_PREVENT_LOGGING, and the execution flag CLIENT_PREVENT_LOGGING, other cases that we want to avoid slowlog are on AOF loading (specifically CMD_CALL_SLOWLOG will be off when executed from execCommand that runs from an AOF)	2021-03-25 10:20:27 +02:00
Qu Chen	7de6451818	Properly initialize variable to make valgrind happy in checkChildrenDone(). Removed usage for the obsolete wait3() and wait4() in favor of waitpid(), and properly check for the exit status code. (#8666 )	2021-03-24 08:41:05 -07:00
yoav-steinberg	3060de88ce	Remove cron saving during BGSAVE test. (#8688 ) This fixes a race where a bgsave can start during the test after we verified no bgsave is running.	2021-03-24 15:14:47 +02:00
Oran Agra	f6e1a94e03	Corrupt stream key access to uninitialized memory (#8681 ) the corrupt-dump-fuzzer test found a case where an access to a corrupt stream would have caused accessing to uninitialized memory. now it'll panic instead. The issue was that there was a stream that says it has more than 0 records, but looking for the max ID came back empty handed. p.s. when sanitize-dump-payload is used, this corruption is detected, and the RESTORE command is gracefully rejected.	2021-03-24 11:33:49 +02:00

1 2 3 4 5 ...

1399 Commits