Commits · fix-multiplexer-check-config · Platform / Development / swh-objstorage

May 31, 2024

multiplexer: Check backends with respect to their read-only nature · 12bc3ba8

Antoine R. Dumont authored 10 months ago

Prior to this commit, a multiplexer instance would start during its
initialization to check the configuration of all its backends but also check
whether they can write or not even when not needed. As it exists read-only
backends for a multiplexer, that would just fail to start the service.

This evolves the objstorage to grow a check_write attribute (hence parametric
through configuration file). And then adapts accordingly the multiplexer
constructor so it can check the configuration of its backends according to
their respective configuration.

Refs. #4746
Refs. swh/infra/sysadm-environment#5334

12bc3ba8

May 29, 2024

multiplexer: handle PermissionErrors in `check_config` · cd672034

Nicolas Dandrimont authored 10 months ago

Some objstorages are configured read-only implicitly by way of
permissions, so `check_config` raises a PermissionError instead of
returning False. Handle that situation explicitly.

Also handle RemoteExceptions with a warning, to support the old
situation where PermissionError was not properly wrapped by the RPC
layer.

cd672034

api: support PermissionErrors · b2f39088

Nicolas Dandrimont authored 10 months ago

`check_config` raises a PermissionError under some circumstances, and
it's nicer to be able to handle that directly than to have to unwrap a
RemoteException.

b2f39088

May 23, 2024

winery ROShard: on deletion, only close the shard when possible · 137e2241

Nicolas Dandrimont authored 10 months ago

The `__del__` method gets called on object teardown in most cases.
Specifically, it can be called when `__init__` fails, in which case all
attributes aren't guaranteed to be set.

Closes #4744.

137e2241

multiplexer: graceful fallback on specific read exceptions · a8e7f890

Nicolas Dandrimont authored 10 months ago

Under some circumstances (e.g. network instability, service restarts,
etc.), one of the object storages backing a multiplexer instance might be
temporarily unavailable.

Instead of immediately raising an exception, temporarily disable the
backend for further accesses and attempt reading the object from other
backends. Raise a specific `NoBackendsLeftError` if all the backends
have been disabled due to a transient exception.

a8e7f890

multiplexer tests: avoid capturing statsd logs · 28f775c0
Nicolas Dandrimont authored 10 months ago

28f775c0
multiplexer tests: deduplicate statsd tests · 88b8a7b6
Nicolas Dandrimont authored 10 months ago

88b8a7b6

Add statsd probing in main methods (add, get and __contains__) · 0e731fe9

David Douard authored 10 months ago

Add a simple timing statsd probe for these methods.

For MultiplexerObjStorage.get(), also send a counter metrics
related to which backend did returned the object.

The counter have add a 'backend' tag with the name of the backend
that returned the object, and a 'backend_number' tag so it's easier
to figure how many fallbacks had to be reached to actually
retrieve a content object.

Note that it does not report timing probes for each backend, since
these metrics are already reported by the backend themselves.

0e731fe9

May 22, 2024

Make the ReadOnlyObjStorageProxy raise ReadOnlyObjStorageError when needed · daee75c0

David Douard authored 10 months ago

It used to do nothing, which is inconsistent with the behavior of the
HTTPReadOnlyObjStorage.

Adapt a bit the multiplexer to handle this properly:

- keep a list of RW storage threads separated from the full list of
  backend storage threads
- the decision is made using a call to check_config(check_write=True)
  for each of the backend storages
- adapt a bit the API/behavior of the check_config() method to make it
  return False when called with 'check_write=True' for a RO obj storage,
- for the multiplexer, check_config(check_write=True) returns True if at
  least one backend objstorage is green,
- remove reference to deleted filters in the multiplexer doc string.

daee75c0

Rename ReadOnlyObjStorage and NonIterableObjStorage exceptions · 315e84d0

David Douard authored 10 months ago

Add an 'Error' suffix so there is no confusion, especially with regard
to the new ReadOnlyProxyObjStorage proxy objstorage class (even if it's
not the same name, it can be very confusing).

315e84d0

Simplify a bit the MultiplexerObjStorage · 59f19176

David Douard authored 10 months ago

Move the implementation in swh/objstorage/multiplexer.py since there is
nothing left in the swh/objstorage/multiplexer/ but this class.

Make the constructor able to instantiate the encapsulated objstorages so
we can get rid of the factory _construct_multiplexer_objstorage
function.

59f19176

Convert the ObjStorageFilter as a simple proxy ObjStorage · d32b610c

David Douard authored 10 months ago

Get rid of the generic ObjStorageFilter complexity, since we only have a
single filter. This allows also to get rid of the
_construct_filtered_objstorage() factory function in factory.py; this later
is only deprecated for now for bw compat, but uses the new
ReadOnlyObjstorageProxy class instead.

d32b610c

Add support for naming objstorage instances from configuration · 807be812

David Douard authored 2 years ago

this will allow using this name to add statsd probes ie.g. counting which
backend has been successful in a multiplexed backend.

807be812

May 05, 2024

winery: introduce a throttled access logging filter · 8a794c8d

Nicolas Dandrimont authored 11 months ago

Gunicorn access logs on RPC backends are pretty noisy by default, but
grouping successful requests into merged logs would make that more
bearable.

To use it, stick this in the gunicorn logging config:

```
filters:
  throttle_accesslog:
    (): swh.objstorage.backends.winery.gunicorn.ThrottledAccessLog
    interval: 60
    status_codes: [200]
handlers:
  gunicorn.access:
    level: INFO
    filters: [throttle_accesslog]
    handlers: [<your handler>]
```

8a794c8d

winery: ensure the worker exit functions are called in more circumstances · 7252119d
Nicolas Dandrimont authored 11 months ago
```
Since recent (?) gunicorn versions, looks like worker_exit is called
rather than worker_int on graceful shutdowns.
```
7252119d

May 04, 2024
- winery: ensure shard tables have autovacuum disabled · 701843f0
  Nicolas Dandrimont authored 11 months ago
  
  `CREATE TABLE x (LIKE y INCLUDING ALL)` doesn't replicate the options from the template table, so do it by hand.
  View commits for tag v3.0.1 v3.0.1
  
  701843f0
Apr 25, 2024

winery: release RW shards to STANDBY after an idle timeout delay · 229c586a

Nicolas Dandrimont authored 11 months ago

We run winery writers in active/active mode on two hosts, but only
usually route requests to the host which is currently managing the
primary database. When the writer fails over, shards would keep being
locked in WRITING mode, even though they're idle.

Implement a RW shard idle timeout with a watchdog thread which gets
pinged every time an object is added. The timeout defaults to 5 minutes,
but can be adjusted with a configuration variable.

229c586a

winery: add typing to WineryWriter arguments · 7a2cab01
Nicolas Dandrimont authored 11 months ago

7a2cab01

winery: replace the init()/uninit() pattern with on_shutdown hooks · ac4279ee

Nicolas Dandrimont authored 11 months ago

The init()/uninit() pattern was pretty confusing. Instead, use an
on_shutdown hook on the main storage, that calls down to the relevant
components' hooks, and can be called explicitly, or by gunicorn when it
shuts down a worker.

ac4279ee

winery roshard: set(bytes) generates a set of integers · 4862c2d1
Nicolas Dandrimont authored 11 months ago
```
This test would always return False, even if the file was full of zeros
```
4862c2d1
winery database: remove pool manager · c5502065
Nicolas Dandrimont authored 11 months ago
```
Now that we have connections to a single database, we don't need this
complexity anymore.
```
c5502065

Apr 24, 2024

winery: force setting the shard name in record_shard_mapped · a357d653
Nicolas Dandrimont authored 11 months ago
```
The semantics of creating and locking an arbitrary shard to record that
it has been mapped make little sense.
```
a357d653

winery rwshard: add objects in a single transaction · 32e308db

Nicolas Dandrimont authored 11 months ago

Now that databases are merged, we can add objects in a single
transaction instead of bouncing around with a 2-phase commit.

32e308db

winery rwshard: add type annotations · 55e1db7f
Nicolas Dandrimont authored 11 months ago

55e1db7f

winery: merge databases into a single database managed by `swh db` · 61306531

Nicolas Dandrimont authored 11 months ago

Instead of winery managing multiple databases by itself, creating quite
a complex database management lifecycle, simplify things drastically by
using the common SWH database management scaffolding.

Instead of separate databases, read-write shards now use separate tables
within the same database, created from the `shard_template` template
table. The throttler tables are also merged into the main database.

We completely drop the `shard_dsn` argument to all winery components.

61306531

winery: zero out non-empty images when packing · 46ee7b6d

Nicolas Dandrimont authored 11 months ago

When a packing operation is interrupted, the image can be left with
stale contents. Clean these stale contents up by zeroing the
image (either with blkdiscard or with ftruncate, as appropriate) before
writing the objects to the image.

46ee7b6d

winery: match image_create behavior in file-based pool · 85810939

Nicolas Dandrimont authored 11 months ago

`rbd image create` fails when the image already exists, so match that
behavior in the file-backed pool emulator.

85810939

winery: release lock on shards if packing or cleaning fails · 37646ec1

Nicolas Dandrimont authored 11 months ago

These operations can be restarted from scratch, so it's much better to
release the lock on the affected shards when the operations fail.

37646ec1

Apr 23, 2024

ROShard has no super() so we can't call __del__ on it · 995733bf
Nicolas Dandrimont authored 11 months ago

995733bf

winery: add support for rbd_map_options · b02163db

Nicolas Dandrimont authored 11 months ago

Some ceph clusters (for instance, the one spawned by the ceph/demo
docker image) require some options to be passed to rbd device map. Add
support for configuring these options in all relevant functions.

b02163db

winery tests: add envvar to hardcode ceph pool name · d917209f

Nicolas Dandrimont authored 11 months ago

This is useful to use a manually managed pool for all tests, e.g. when
using a toy ceph cluster generated by the ceph demo image.

Example usage (needs /etc/ceph to be empty!):

docker run -d --net=host -v /etc/ceph:/etc/ceph -e MON_IP=192.168.1.201 -e CEPH_PUBLIC_NETWORK=192.168.1.0/24 -e CEPH_DEMO_UID=test-user quay.io/ceph/demo
sudo chmod -R a+r /etc/ceph
ceph config set global mon_allow_pool_size_one true
ceph osd pool create winery-test-shards replicated --size=1 --yes-i-really-mean-it
ceph osd pool create winery-test-shards-data replicated --size=1 --yes-i-really-mean-it
ceph osd pool application enable winery-test-shards rbd
ceph osd pool application enable winery-test-shards-data rbd

CEPH_HARDCODE_POOL=winery-test-shards RBD_MAP_OPTIONS=ms_mode=prefer-secure pytest swh -k 'winery and not bench_real'

d917209f

winery tests: rename rw shard cleaner arguments · 1d6ed11f
Nicolas Dandrimont authored 11 months ago
```
These were copy-and-pasted from the rbd function, generating nonsensical
names.
```
1d6ed11f
winery: increase logging when packer completed · c73265c6
Nicolas Dandrimont authored 11 months ago

c73265c6
winery: add typing to some objstorage.py helper functions · 38038d7f
Nicolas Dandrimont authored 11 months ago

38038d7f

Apr 19, 2024

pathslicing: skip over bogus files when iterating · a6fc3e5a

Nicolas Dandrimont authored 11 months ago

When an addition has been interrupted, a tmpfile can be lying around. A
lot of pathslicing objstorages are legacy, read-only partitions, so
removing these bogus files is not an option. Instead of breaking
iteration, just print a warning and keep on going.

a6fc3e5a

Apr 18, 2024

multiplexer: be persistent when getting an ObjCorruptedError · 4f2aa9e9

Nicolas Dandrimont authored 11 months ago

Instead of failing fast when getting an ObjCorruptedError, try to read
from other objstorages (and report the error in logging).

Should work around Sentry issue SWH-INDEXER-YQ

4f2aa9e9

Apr 02, 2024

objstorage: Wrap compress operation in an utility method · cf7cd44e
Antoine Lambert authored 1 year ago
```
Wrap object storage backends common code related to compressing an
an object to store in an utility method of base ObjStorage class.
```
View commits for tag v2.9.2 v2.9.2

cf7cd44e

objstorage: Wrap decompress operation in an utility method · ce71cd9b

Antoine Lambert authored 1 year ago

Wrap object storage backends common code related to decompressing
a stored object in an utility method of base ObjStorage class in
order to centralize errors handling.

Catch possible exceptions decompressors might raise and raise an
ObjCorruptedError exception in such cases.

Related to swh-scrubber#4694.

ce71cd9b

objstorage: Factorize object check and add ObjCorruptedError exception · 4775f08d

Antoine Lambert authored 1 year ago

Implement object check in base ObjStorage class instead of duplicating that code
in every object storage backend.

Add an ObjCorruptedError exception raised when an object corruption is detected.

Improve exception messages.

Normalize imports of exception classes.

Related to swh-scrubber#4694.

4775f08d

Mar 29, 2024
- Apply copier template v0.2.0 · 4f274cf5
  vlorentz authored 1 year ago
  
  4f274cf5