Skip to content

Latest commit

 

History

History
312 lines (205 loc) · 14.3 KB

README.md

File metadata and controls

312 lines (205 loc) · 14.3 KB

redis-async

Build Status

Experimental asynchronous Redis client for Clojure.

Available from Clojars:

Clojars Project

Project status

The version on Clojars is a beta.

Design goals

There are two primary design goals. First, to be async, using core.async, for the purposes of being easily used in an application already heavily based around that. Second, to be as close to the Redis API as much as possible; it is a low-level API, but a very useful one.

Features

  • All redis commands, including blocking commands and pub/sub commands.
  • Multiplex many concurrent requests onto a single Redis connection.
  • Implicit pipelining.
  • Support for transactions.

Missing features

  • Support for clustering

How to use

Simple interactive example

(require '[redis-async.core :as redis-async])
(require '[redis-async.client :as client])

;; Build a connection pool

(def p (redis-async/make-pool {:hostname "localhost" :port 6379}))

;; Set a value

(<!! (client/set p "KEY1" "VALUE-A"))

;; Get a value

(<!! (client/get p "KEY1"))

;; Will show "VALUE-A"

Each Redis command has an equivalent function in redis-async.client. These can be called passing the connection pool as the first parameter.

The connection pool

Rather than a traditional connection pool as a means of allowing multiple threads access to the database via multiple connections, this library will multiplex Redis commands from many threads into a single queue (pipelining if possible). However some commands require a dedicated connecton, these are explained in more detail below.

To create a connection pool, call make-pool in redis-async.core passing in a map of connection details (host, port, password (if any), and database (defaults to 0)). An empty map will default to localhost and the default Redis port.

To clean-up a connection pool at the end, call close-pool in redis-async.core.

This library does not enforce the use of any component systems, but the above was designed to painlessly be used by them.

Implementation challenges

There's a number of differences of philosophy that crop up implementing a Redis client in Clojure. One example is differences in the definition of a 'string', in Redis this means 'byte-array' essentially; however most use-cases for a Redis client would expect to use Strings rather than byte arrays. But, converting byte arrays to String silently would break a number of edge cases (e.g. the DUMP and RESTORE commands, the conversion would subtly alter the string in such a way it couldn't be restored).

To overcome this, an intermediate format is used. Using the raw core.async operations on a channel will return the intermediate format from which a byte-array could be extracted; but in most cases, when high-level defaults apply, there are alternative channel operations that do sensible defaults.

Client functions

Each function that implements a Redis command returns a channel, from which the result of that command can be read. These can be read like any other core.async channel, or one of the convenience functions/macros can be used instead; the main difference between the convenience options and anything else is that they ensure conventions are in place (e.g. it allows a Redis operation to return nil, usually you cannot send nil through a core.async channel).

Exceptions to the rule

The vast majority of Redis commands are simple request/response commands. There are a number however with behave differently, in that they either: a) return an arbitrary/infinite number of results, or b) are blocking. These are implemented seperately with slightly different semantics.

MONITOR

The monitor command will return an infinite (until you tell it to stop) sequence of all activity on a Redis server.

The monitor function returns a channel, if call succeeds this yields a vector containing two channels. The first channel contains each monitored command, on a line-by-line basis; the second is used to stop monitoring, closing this channel will stop.

BLPOP, BRPOP, and BRPOPLPUSH

All these commands are blocking, the Redis server will wait until data becomes available to return. While you may think blocking operations are an anethema to an async client, it doesn't have to be so.

The Redis protocol defines that such operations will mean a connection cannot be used for any other purpose while it's blocked. So the implementation of these blocking commands cannot use the single, shared, connection; users should therefore be aware that by using these commands will result in this library creating more connections. Such additional connections will be reused where appropriate, the number created therefore will be the maximum of the number of concurrent blocking connections.

On the other hand, with an async client, such "blocking" operations will never block a thread so can be used for triggering/co-ordinating asynchronous code. For example:

redis-async.client> (def p (make-pool {}))
#'redis-async.client/p
redis-async.client> (a/go (let [[list msg] (<! (blpop p "LIST1" 0))] (println "*** Got" msg "from" list)))
#<ManyToManyChannel clojure.core.async.impl.channels.ManyToManyChannel@49bc5015>
redis-async.client> ;; time passes
redis-async.client> (rpush p "LIST1" "Some Data")
#<ManyToManyChannel clojure.core.async.impl.channels.ManyToManyChannel@155de840>

;; Std-out then shows:
*** Got Some Data from LIST1
Pub/sub

Redis has a publish/subscribe mechanism. Once again the semantics for these commands differ from the default. Once subscribed to a (Redis) channel, the connection will contain multiple messages that are sent to that channel; therefore connections that are used for pub/sub need to be kept apart from those normally used.

However there are similarities, a single connection can alter the channels that it is subscribed to, allowing a single connection to be used the library can route messages.

subscribe and psubscribe each subscribe to one or more (Redis) channels, and returns a single (core.async) channel which will contain every message sent to those subscribed channels.

unsubscribe and punsubscribe does the opposite and unsubscribed from the (Redis) channel, closing the (core.async) channel as it does so.

Transactions

Redis transactions are supported using the with-transaction macro in redis-async.core. Any code contained within the body a with-transaction block will be submitted as a transaction. This means the response to individual commands will be an acknowledgement, the actual results of the whole block are returned at the end.

Other client functions

The convenience functions for dealing with channels follow the same naming convention as core.async namely using a single ! for those which work in a go block, and a double !! for those outside of a go block.

<! and <!! simply read a value from a channel, but will substitute nils correctly, this means that you cannot use nil to understand when the end of the channel has been reached, you need to know how many values to expect, which is trivial in most circumstances.

wait! and wait!! throw away the value, but can be used to ensure that the action has happened before proceeding.

faf is "fire-and-forget", just move on. This will ensure the channel is fully consumed, but doesn't wait.

Scripting support

A useful, and arguably underused, feature of Redis is it's scripting support. The built-in low-level commands EVAL, EVALSHA, SCRIPT LOAD, etc. are all present and work as you would expect. But redis-async also contains higher-level support to make working with server-side scripts easier.

The defscript macro will generate a function given a Lua script. This function will ensure that the script is loaded onto the Redis server (once per instance of a connection pool), calling the function will execute an EVALSHA on that script.

Example:

(defscript test-script
  "redis.call('incr', KEYS[1])
  return redis.call('get', KEYS[1])")

;; then calling it, like so:

redis-async.client> (<!! (test-script p ["NAME-OF-KEY"]))
"1"

For scripts too long/unweildy to be defined in-line, the defscript can be combined with a utility function from to load a script from the class-path, allowing Lua scripts to be bundled with the application.

(defscript test-script (from "path-to-package/test-script.lua"))

Error handling

Error handling is currently rudimentary. If an unrecoverable error occurs (e.g. server being unavailable), all open channels will close and new commands may be rejected. In this case the pool should be closed and a new one created.

The roadmap includes improving this to automatically retry/reconnect.

Why not just use ...?

TBC: list of similarities and differences with other libraries

Implicit pipelining

Unlike most other Redis clients, redis-async will automatically pipeline multiple commands issued within a similar frame of time into a single request/response cycle. It does so in a way that does not harm the worst case, but significantly improves the best case.

For instance, many commands in immediate succession, without waiting for results, will be sent together:

(require '[redis-async.core :as redis-async])
(require '[redis-async.client :as client])

(def p (redis-async/make-pool {})) ;; hostname and port defaults will connect to a local Redis.

(let [c1 (client/set p "X" "TEST")
      c2 (client/set p "Y" "TEST2")
      c3 (client/get p "X")
      c4 (client/get p "Y")]
  (println (client/<!! c3))
  (println (client/<!! c4)))

This prints the expected outcome:

TEST
TEST2

The data sent between client and server, as captured by ngrep is as follows:

T 127.0.0.1:55817 -> 127.0.0.1:6379 [AP]
*3..$3..SET..$1..X..$4..TEST..*3..$3..SET..$1..Y..$5..TEST2..*2..$3..GET..$1..X..*2..$3..GET..$1
..Y..
##
T 127.0.0.1:6379 -> 127.0.0.1:55817 [AP]
+OK..+OK..$4..TEST..$5..TEST2..
##

However, should their be a gap in commands due to: waiting for a response to a previous command, or doing something else entirely; then the commands will be grouped into multiple request/response cycles. For example:

(let [c1 (client/set p "X" 0)
      c2 (client/incr p "X")
      c3 (client/set p "Y" (client/<!! c2))
      c4 (client/get p "Y")]
  (println (client/<!! c4)))

This prints the single digit 1, but ngrep shows how it's split the commands into two requests at the point where there's a need to wait for the rest of a previous command:

T 127.0.0.1:55817 -> 127.0.0.1:6379 [AP]
*3..$3..SET..$1..X..$1..0..*2..$4..INCR..$1..X..
##
T 127.0.0.1:6379 -> 127.0.0.1:55817 [AP]
+OK..:1..
##
T 127.0.0.1:55817 -> 127.0.0.1:6379 [AP]
*3..$3..SET..$1..Y..$1..1..*2..$3..GET..$1..Y..
##
T 127.0.0.1:6379 -> 127.0.0.1:55817 [AP]
+OK..$1..1..
##

Performance

The 0.3.x series of releases have significantly improved performance compared to earlier versions; and starting with 0.3.2 performance is very good, being as-fast and usual faster than the competition.

Version 0.3.2 was tested against Carmine version 2.11.1. The test was the same as the one used in the JRESP performance tests, specifically 1,000,000 SET commands. These are based on tests originally developed for Stefan Wille's test for a Redis client in Crystal.

The test was repeated ten times for each.

redis-async obtained the following times:

Took: 2825.58ms
Took: 2193.04ms
Took: 2141.28ms
Took: 2108.30ms
Took: 2156.06ms
Took: 2028.24ms
Took: 2061.57ms
Took: 2282.01ms
Took: 2200.28ms
Took: 2264.24ms

Carmine obtained the following times:

Took: 4785.35ms
Took: 4530.83ms
Took: 3599.26ms
Took: 2668.91ms
Took: 3854.12ms
Took: 2801.31ms
Took: 3691.85ms
Took: 2642.10ms
Took: 3697.83ms
Took: 2703.41ms

Compared to Carmine, it appears that redis-async elapsed time is between 15 to 40% shorter.

DISCLAIMER: this is one arbitrary, and not very realistic test. These tests should be replaced with more typical usage patterns.

It is also worth comparing these results with the performance results obtained from JRESP alone (the library that redis-async uses), the same test was performed by JRESP in approximately 1200ms.

Testing

To run tests lein test. Please not this requires a Redis instance running on localhost and the default Redis port. Also, please note, this will trash anything in database 1 on that instance.

Revision history

  • 0.1.0 - first feature-complete release (i.e. implementation of all Redis commands, pipelining, transactions, pub/sub, etc., only exception: cluster support).
  • 0.1.1 - additional high-level scripting functionality.
  • 0.1.2 - fixed inconsistencies dealing with Error responses, and also making sure that the first error is reported when dealing with script compilation errors.
  • 0.2.0 - Uses JRESP for serialising RESP.
  • 0.3.0 - Significant performance/stability improvements achieved by moving connection pooling/pipelining/etc. into the JRESP layer.
  • 0.3.1 and 0.3.2 - Performance improvements.

Still to-do

  1. Test coverage.
  2. Documentation (especially around edge-cases, e.g. lost connection).
  3. Cluster support.
  4. More realistic scenarios for performance testing.

License

Copyright 2015 Ben Ashford

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.