You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
134 lines
6.4 KiB
134 lines
6.4 KiB
# gRPC (Core) Compression Cookbook
|
|
|
|
## Introduction
|
|
|
|
This document describes compression as implemented by the gRPC C core. See [the
|
|
full compression specification](compression.md) for details.
|
|
|
|
### Intended Audience
|
|
|
|
Wrapped languages developers, for the purposes of supporting compression by
|
|
interacting with the C core.
|
|
|
|
## Criteria for GA readiness
|
|
|
|
1. Be able to set compression at [channel](#per-channel-settings),
|
|
[call](#per-call-settings) and [message](#per-message-settings) level.
|
|
In principle this API should be based on _compression levels_ as opposed to
|
|
algorithms. See the discussion [below](#level-vs-algorithms).
|
|
1. Have unit tests covering [the cases from the
|
|
spec](https://github.com/grpc/grpc/blob/master/doc/compression.md#test-cases).
|
|
1. Interop tests implemented and passing on Jenkins. The two relevant interop
|
|
test cases are
|
|
[large_compressed_unary](https://github.com/grpc/grpc/blob/master/doc/interop-test-descriptions.md#large_compressed_unary)
|
|
and
|
|
[server_compressed_streaming](https://github.com/grpc/grpc/blob/master/doc/interop-test-descriptions.md#server_compressed_streaming).
|
|
|
|
## Summary Flowcharts
|
|
|
|
The following flowcharts depict the evolution of a message, both _incoming_ and
|
|
_outgoing_, irrespective of the client/server character of the call. Aspects
|
|
still not symmetric between clients and servers (e.g. the [use of compression
|
|
levels](https://github.com/grpc/grpc/blob/master/doc/compression.md#compression-levels-and-algorithms))
|
|
are explicitly marked. The in-detail textual description for the different
|
|
scenarios is described in subsequent sections.
|
|
|
|
## Incoming Messages
|
|
|
|

|
|
|
|
## Outgoing Messages
|
|
|
|

|
|
|
|
## Levels vs Algorithms
|
|
|
|
As mentioned in [the relevant discussion on the spec
|
|
document](https://github.com/grpc/grpc/blob/master/doc/compression.md#compression-levels-and-algorithms),
|
|
compression _levels_ are the primary mechanism for compression selection _at the
|
|
server side_. In the future, it'll also be at the client side. The use of levels
|
|
abstracts away the intricacies of selecting a concrete algorithm supported by a
|
|
peer, on top of removing the burden of choice from the developer.
|
|
As of this writing (Q2 2016), clients can only specify compression _algorithms_.
|
|
Clients will support levels as soon as an automatic retry/negotiation mechanism
|
|
is in place.
|
|
|
|
## Per Channel Settings
|
|
|
|
Compression may be configured at channel creation. This is a convenience to
|
|
avoid having to repeatedly configure compression for every call. Note that any
|
|
compression setting on individual [calls](#per-call-settings) or
|
|
[messages](#per-message-settings) overrides channel settings.
|
|
|
|
The following aspects can be configured at channel-creation time via channel arguments:
|
|
|
|
#### Disable Compression _Algorithms_
|
|
|
|
Use the channel argument key
|
|
`GRPC_COMPRESSION_CHANNEL_ENABLED_ALGORITHMS_BITSET` (from
|
|
[`grpc/impl/codegen/compression_types.h`](https://github.com/grpc/grpc/blob/master/include/grpc/impl/codegen/compression_types.h)),
|
|
takes a 32 bit bitset value. A set bit means the algorithm with that enum value
|
|
according to `grpc_compression_algorithm` is _enabled_.
|
|
For example, `GRPC_COMPRESS_GZIP` currently has a numeric value of 2. To
|
|
enable/disable GZIP for a channel, one would set/clear the 3rd LSB (eg, 0b100 =
|
|
0x4). Note that setting/clearing 0th position, that corresponding to
|
|
`GRPC_COMPRESS_NONE`, has no effect, as no-compression (a.k.a. _identity_) is
|
|
always supported.
|
|
Incoming messages compressed (ie, encoded) with a disabled algorithm will result
|
|
in the call being closed with `GRPC_STATUS_UNIMPLEMENTED`.
|
|
|
|
#### Default Compression _Level_
|
|
|
|
**(currently, Q2 2016, only applicable for server side channels. It's ignored
|
|
for clients.)**
|
|
Use the channel argument key `GRPC_COMPRESSION_CHANNEL_DEFAULT_LEVEL` (from
|
|
[`grpc/impl/codegen/compression_types.h`](https://github.com/grpc/grpc/blob/master/include/grpc/impl/codegen/compression_types.h)),
|
|
valued by an integer corresponding to a value from the `grpc_compression_level`
|
|
enum.
|
|
|
|
#### Default Compression _Algorithm_
|
|
|
|
Use the channel argument key `GRPC_COMPRESSION_CHANNEL_DEFAULT_ALGORITHM` (from
|
|
[`grpc/impl/codegen/compression_types.h`](https://github.com/grpc/grpc/blob/master/include/grpc/impl/codegen/compression_types.h)),
|
|
valued by an integer corresponding to a value from the `grpc_compression_level`
|
|
enum.
|
|
|
|
## Per Call Settings
|
|
|
|
### Compression **Level** in Call Responses
|
|
|
|
The server requests a compression level via initial metadata. The
|
|
`send_initial_metadata` `grpc_op` contains a `maybe_compression_level` field
|
|
with two fields, `is_set` and `compression_level`. The former must be set when
|
|
actively choosing a level to disambiguate the default value of zero (no
|
|
compression) from the proactive selection of no compression.
|
|
|
|
The core will receive the request for the compression level and automatically
|
|
choose a compression algorithm based on its knowledge about the peer
|
|
(communicated by the client via the `grpc-accept-encoding` header. Note that the
|
|
absence of this header means no compression is supported by the client/peer).
|
|
|
|
### Compression **Algorithm** in Call Responses
|
|
|
|
**Server should avoid setting the compression algorithm directly**. Prefer
|
|
setting compression levels unless there's a _very_ compelling reason to choose
|
|
specific algorithms (benchmarking, testing).
|
|
|
|
Selection of concrete compression algorithms is performed by adding a
|
|
`(GRPC_COMPRESS_REQUEST_ALGORITHM_KEY, <algorithm-name>)` key-value pair to the
|
|
initial metadata, where `GRPC_COMPRESS_REQUEST_ALGORITHM_KEY` is defined in
|
|
[`grpc/impl/codegen/compression_types.h`](https://github.com/grpc/grpc/blob/master/include/grpc/impl/codegen/compression_types.h)),
|
|
and `<algorithm-name>` is the human readable name of the algorithm as given in
|
|
[the HTTP2 spec](https://github.com/grpc/grpc/blob/master/doc/PROTOCOL-HTTP2.md)
|
|
for `Message-Encoding` (e.g. gzip, identity, etc.). See
|
|
[`grpc_compression_algorithm_name`](https://github.com/grpc/grpc/blob/master/src/core/lib/compression/compression.c)
|
|
for the mapping between the `grpc_compression_algorithm` enum values and their
|
|
textual representation.
|
|
|
|
## Per Message Settings
|
|
|
|
To disable compression for a specific message, the `flags` field of `grpc_op`
|
|
instances of type `GRPC_OP_SEND_MESSAGE` must have its `GRPC_WRITE_NO_COMPRESS`
|
|
bit set. Refer to
|
|
[`grpc/impl/codegen/compression_types.h`](https://github.com/grpc/grpc/blob/master/include/grpc/impl/codegen/compression_types.h)),
|