Commits · master · David Douard / Object storage

Feb 10, 2025
- Update the README file · dcb682dd
  David Douard authored 2 months ago
  
  The quick-start section was a bit outdated.
  dcb682dd
Sep 10, 2024

winery: clean up the throttler tables more aggressively · 50edf9fe

Nicolas Dandrimont authored 8 months ago

Make the throttler window of measurement a module-level constant, and
clean up stale rows that are twice as old as the window (10 minutes)
instead of one month old.

50edf9fe

winery: add indexes on the updated column of the throttler tables · 3229f292

Nicolas Dandrimont authored 8 months ago

All throttler queries filter these tables using ranges of the updated
column, so adding a block range index makes these queries much more
efficient.

Ref. swh/infra/sysadm-environment#5388

3229f292

Aug 30, 2024
- Fix some formatting after black version bump · 14f56af4
  Antoine Lambert authored 7 months ago
  
  14f56af4
Aug 27, 2024
- Apply swh-py-template v0.2.3 · 1db035fd
  David Douard authored 7 months ago
  
  1db035fd
Jul 11, 2024

multiplexer: stop logging traceback on transient exception · 2532e434

Jérémy Bobbio (Lunar) authored 9 months ago

We don’t want to see a full stack trace as it will very likely be noise
as it is a transient error. But we still add a new line with the
exception it self, though.

2532e434

Jul 02, 2024

Fix rst syntax in the README file · 59795630
David Douard authored 9 months ago
```
and slightly improve a few points.
```
59795630

Allow declaring a multiplexer objstorage read-only · 73388383

Nicolas Dandrimont authored 9 months ago

This makes the multiplexer raise a proper ReadOnlyObjStorageError when
calling one of the write endpoints, and avoids calling on check_config
on every backend at __init__ time in that situation, which prevents us
from initializing the multiplexer if one of the backends times out.

73388383

Jun 04, 2024
- Make the http backend retry queries if configured for · ab88ccb5
  David Douard authored 10 months ago
  
  Add support for an "retry" configuration entry that sets up and configure an urllib3 Retry HTTPAdapter.
  ab88ccb5
May 31, 2024

azure backend: implement check_config without running requests · 6db9559e

Nicolas Dandrimont authored 10 months ago

The request that we were making actually fails with a container shared
access signature, it only works for account shared signatures. Account
shared signatures give much more access than needed, so only parse the
URL to (lightly) check if it is valid.

6db9559e

May 29, 2024

multiplexer: handle PermissionErrors in `check_config` · cd672034

Nicolas Dandrimont authored 10 months ago

Some objstorages are configured read-only implicitly by way of
permissions, so `check_config` raises a PermissionError instead of
returning False. Handle that situation explicitly.

Also handle RemoteExceptions with a warning, to support the old
situation where PermissionError was not properly wrapped by the RPC
layer.

cd672034

api: support PermissionErrors · b2f39088

Nicolas Dandrimont authored 10 months ago

`check_config` raises a PermissionError under some circumstances, and
it's nicer to be able to handle that directly than to have to unwrap a
RemoteException.

b2f39088

May 23, 2024

winery ROShard: on deletion, only close the shard when possible · 137e2241

Nicolas Dandrimont authored 11 months ago

The `__del__` method gets called on object teardown in most cases.
Specifically, it can be called when `__init__` fails, in which case all
attributes aren't guaranteed to be set.

Closes #4744.

137e2241

multiplexer: graceful fallback on specific read exceptions · a8e7f890

Nicolas Dandrimont authored 11 months ago

Under some circumstances (e.g. network instability, service restarts,
etc.), one of the object storages backing a multiplexer instance might be
temporarily unavailable.

Instead of immediately raising an exception, temporarily disable the
backend for further accesses and attempt reading the object from other
backends. Raise a specific `NoBackendsLeftError` if all the backends
have been disabled due to a transient exception.

a8e7f890

multiplexer tests: avoid capturing statsd logs · 28f775c0
Nicolas Dandrimont authored 11 months ago

28f775c0
multiplexer tests: deduplicate statsd tests · 88b8a7b6
Nicolas Dandrimont authored 11 months ago

88b8a7b6

Add statsd probing in main methods (add, get and __contains__) · 0e731fe9

David Douard authored 11 months ago

Add a simple timing statsd probe for these methods.

For MultiplexerObjStorage.get(), also send a counter metrics
related to which backend did returned the object.

The counter have add a 'backend' tag with the name of the backend
that returned the object, and a 'backend_number' tag so it's easier
to figure how many fallbacks had to be reached to actually
retrieve a content object.

Note that it does not report timing probes for each backend, since
these metrics are already reported by the backend themselves.

0e731fe9

May 22, 2024

Make the ReadOnlyObjStorageProxy raise ReadOnlyObjStorageError when needed · daee75c0

David Douard authored 11 months ago

It used to do nothing, which is inconsistent with the behavior of the
HTTPReadOnlyObjStorage.

Adapt a bit the multiplexer to handle this properly:

- keep a list of RW storage threads separated from the full list of
  backend storage threads
- the decision is made using a call to check_config(check_write=True)
  for each of the backend storages
- adapt a bit the API/behavior of the check_config() method to make it
  return False when called with 'check_write=True' for a RO obj storage,
- for the multiplexer, check_config(check_write=True) returns True if at
  least one backend objstorage is green,
- remove reference to deleted filters in the multiplexer doc string.

daee75c0

Rename ReadOnlyObjStorage and NonIterableObjStorage exceptions · 315e84d0

David Douard authored 11 months ago

Add an 'Error' suffix so there is no confusion, especially with regard
to the new ReadOnlyProxyObjStorage proxy objstorage class (even if it's
not the same name, it can be very confusing).

315e84d0

Simplify a bit the MultiplexerObjStorage · 59f19176

David Douard authored 11 months ago

Move the implementation in swh/objstorage/multiplexer.py since there is
nothing left in the swh/objstorage/multiplexer/ but this class.

Make the constructor able to instantiate the encapsulated objstorages so
we can get rid of the factory _construct_multiplexer_objstorage
function.

59f19176

Convert the ObjStorageFilter as a simple proxy ObjStorage · d32b610c

David Douard authored 11 months ago

Get rid of the generic ObjStorageFilter complexity, since we only have a
single filter. This allows also to get rid of the
_construct_filtered_objstorage() factory function in factory.py; this later
is only deprecated for now for bw compat, but uses the new
ReadOnlyObjstorageProxy class instead.

d32b610c

Add support for naming objstorage instances from configuration · 807be812

David Douard authored 2 years ago

this will allow using this name to add statsd probes ie.g. counting which
backend has been successful in a multiplexed backend.

807be812

May 05, 2024

winery: introduce a throttled access logging filter · 8a794c8d

Nicolas Dandrimont authored 11 months ago

Gunicorn access logs on RPC backends are pretty noisy by default, but
grouping successful requests into merged logs would make that more
bearable.

To use it, stick this in the gunicorn logging config:

```
filters:
  throttle_accesslog:
    (): swh.objstorage.backends.winery.gunicorn.ThrottledAccessLog
    interval: 60
    status_codes: [200]
handlers:
  gunicorn.access:
    level: INFO
    filters: [throttle_accesslog]
    handlers: [<your handler>]
```

8a794c8d

winery: ensure the worker exit functions are called in more circumstances · 7252119d
Nicolas Dandrimont authored 11 months ago
```
Since recent (?) gunicorn versions, looks like worker_exit is called
rather than worker_int on graceful shutdowns.
```
7252119d

May 04, 2024

winery: ensure shard tables have autovacuum disabled · 701843f0

Nicolas Dandrimont authored 11 months ago

`CREATE TABLE x (LIKE y INCLUDING ALL)` doesn't replicate the options
from the template table, so do it by hand.

701843f0

Apr 25, 2024

winery: release RW shards to STANDBY after an idle timeout delay · 229c586a

Nicolas Dandrimont authored 11 months ago

We run winery writers in active/active mode on two hosts, but only
usually route requests to the host which is currently managing the
primary database. When the writer fails over, shards would keep being
locked in WRITING mode, even though they're idle.

Implement a RW shard idle timeout with a watchdog thread which gets
pinged every time an object is added. The timeout defaults to 5 minutes,
but can be adjusted with a configuration variable.

229c586a

winery: add typing to WineryWriter arguments · 7a2cab01
Nicolas Dandrimont authored 11 months ago

7a2cab01

winery: replace the init()/uninit() pattern with on_shutdown hooks · ac4279ee

Nicolas Dandrimont authored 11 months ago

The init()/uninit() pattern was pretty confusing. Instead, use an
on_shutdown hook on the main storage, that calls down to the relevant
components' hooks, and can be called explicitly, or by gunicorn when it
shuts down a worker.

ac4279ee

winery roshard: set(bytes) generates a set of integers · 4862c2d1
Nicolas Dandrimont authored 11 months ago
```
This test would always return False, even if the file was full of zeros
```
4862c2d1
winery database: remove pool manager · c5502065
Nicolas Dandrimont authored 11 months ago
```
Now that we have connections to a single database, we don't need this
complexity anymore.
```
c5502065

Apr 24, 2024

winery: force setting the shard name in record_shard_mapped · a357d653
Nicolas Dandrimont authored 11 months ago
```
The semantics of creating and locking an arbitrary shard to record that
it has been mapped make little sense.
```
a357d653

winery rwshard: add objects in a single transaction · 32e308db

Nicolas Dandrimont authored 11 months ago

Now that databases are merged, we can add objects in a single
transaction instead of bouncing around with a 2-phase commit.

32e308db

winery rwshard: add type annotations · 55e1db7f
Nicolas Dandrimont authored 11 months ago

55e1db7f

winery: merge databases into a single database managed by `swh db` · 61306531

Nicolas Dandrimont authored 11 months ago

Instead of winery managing multiple databases by itself, creating quite
a complex database management lifecycle, simplify things drastically by
using the common SWH database management scaffolding.

Instead of separate databases, read-write shards now use separate tables
within the same database, created from the `shard_template` template
table. The throttler tables are also merged into the main database.

We completely drop the `shard_dsn` argument to all winery components.

61306531

winery: zero out non-empty images when packing · 46ee7b6d

Nicolas Dandrimont authored 11 months ago

When a packing operation is interrupted, the image can be left with
stale contents. Clean these stale contents up by zeroing the
image (either with blkdiscard or with ftruncate, as appropriate) before
writing the objects to the image.

46ee7b6d

winery: match image_create behavior in file-based pool · 85810939

Nicolas Dandrimont authored 11 months ago

`rbd image create` fails when the image already exists, so match that
behavior in the file-backed pool emulator.

85810939

winery: release lock on shards if packing or cleaning fails · 37646ec1

Nicolas Dandrimont authored 11 months ago

These operations can be restarted from scratch, so it's much better to
release the lock on the affected shards when the operations fail.

37646ec1

Apr 23, 2024

ROShard has no super() so we can't call __del__ on it · 995733bf
Nicolas Dandrimont authored 11 months ago

995733bf

winery: add support for rbd_map_options · b02163db

Nicolas Dandrimont authored 11 months ago

Some ceph clusters (for instance, the one spawned by the ceph/demo
docker image) require some options to be passed to rbd device map. Add
support for configuring these options in all relevant functions.

b02163db

winery tests: add envvar to hardcode ceph pool name · d917209f

Nicolas Dandrimont authored 11 months ago

This is useful to use a manually managed pool for all tests, e.g. when
using a toy ceph cluster generated by the ceph demo image.

Example usage (needs /etc/ceph to be empty!):

docker run -d --net=host -v /etc/ceph:/etc/ceph -e MON_IP=192.168.1.201 -e CEPH_PUBLIC_NETWORK=192.168.1.0/24 -e CEPH_DEMO_UID=test-user quay.io/ceph/demo
sudo chmod -R a+r /etc/ceph
ceph config set global mon_allow_pool_size_one true
ceph osd pool create winery-test-shards replicated --size=1 --yes-i-really-mean-it
ceph osd pool create winery-test-shards-data replicated --size=1 --yes-i-really-mean-it
ceph osd pool application enable winery-test-shards rbd
ceph osd pool application enable winery-test-shards-data rbd

CEPH_HARDCODE_POOL=winery-test-shards RBD_MAP_OPTIONS=ms_mode=prefer-secure pytest swh -k 'winery and not bench_real'

d917209f