writecache: Improve flushing scheme for badger #641

acid-ant · 2023-08-23T11:53:50Z

acid-ant commented

2023-08-23 11:53:50 +00:00

Close #568

Signed-off-by: Anton Nikiforov an.nikiforov@yadro.com

Close #568 Signed-off-by: Anton Nikiforov <an.nikiforov@yadro.com>

requested reviews from storage-core-committers, storage-core-developers

2023-08-23 11:54:01 +00:00

acid-ant force-pushed bugfix/568-imprv-badger-flush from 47cfd47377 to bcb90c455b

2023-08-23 11:54:43 +00:00

Compare

fyrchik reviewed 2023-08-23 16:25:36 +00:00

pkg/local_object_storage/writecache/writecachebadger/cachebadger.go Outdated

					
				@ -23,1 +23,4 @@

					flushCh chan *objectSDK.Object

					// scheduled4Flush contains objects scheduled for flush via flushCh

					// helps to avoid multiple flushing of one object

					scheduled4Flush    map[oid.Address]any

fyrchik commented

2023-08-23 16:18:20 +00:00

If it is a set, it is better to use struct{}, not any, as struct{} has zero size and thus could be optimized by the compiler.

If it is a set, it is better to use `struct{}`, not `any`, as `struct{}` has zero size and thus could be optimized by the compiler.

acid-ant commented

2023-08-24 06:26:44 +00:00

Thanks, updated.

fyrchik marked this conversation as resolved

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -38,0 +58,4 @@

						if kv.StreamDone {

							return nil

						}

						if c.scheduled >= flushBatchSize {

fyrchik commented

2023-08-23 16:16:26 +00:00

Flush batch size was there in bbolt, because long View transactions prevented database from growing in size.
In badger we may not need this.

Flush batch size was there in `bbolt`, because long `View` transactions prevented database from growing in size. In badger we may not need this.

acid-ant commented

2023-08-24 06:22:30 +00:00

The reason I decided to continue to use it is to allow changing mode for writecache. If wc will be under pressure and for any reason we need to change mode, this will be possible once all objects were scheduled. It may take some time, external call may fail by timeout.
Is It acceptable for us? Or it is better to interrupt some time?

The reason I decided to continue to use it is to allow changing mode for writecache. If wc will be under pressure and for any reason we need to change mode, this will be possible once all objects were scheduled. It may take some time, external call may fail by timeout. Is It acceptable for us? Or it is better to interrupt some time?

👍 1

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -38,0 +62,4 @@

							c.cancel()

							return nil

						}

						if got, want := len(kv.Key), len(cid.ID{})+len(oid.ID{}); got != want {

fyrchik commented

2023-08-23 16:15:41 +00:00

len(internalKey{})?

`len(internalKey{})`?

acid-ant commented

2023-08-24 06:27:16 +00:00

Yeah, this is better, fixed.

fyrchik marked this conversation as resolved

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -38,0 +70,4 @@

						c.processed++

						obj := objectSDK.New()

						var val []byte

						val = append(val, kv.Value...)

fyrchik commented

2023-08-23 16:18:55 +00:00

val := bytes.Clone (or slice.Copy)?

`val := bytes.Clone` (or `slice.Copy`)?

acid-ant commented

2023-08-24 06:28:48 +00:00

Updated, I've chosen append because badger uses it in CopyValue.

Updated, I've chosen append because badger uses it in `CopyValue`.

fyrchik marked this conversation as resolved

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -38,0 +74,4 @@

						if err = obj.Unmarshal(val); err == nil {

							addr := objectCore.AddressOf(obj)

							c.cache.scheduled4FlushMtx.RLock()

							_, ok := c.cache.scheduled4Flush[addr]

fyrchik commented

2023-08-23 16:17:43 +00:00

Correct me if I am wrong: the purpose of this map is to prevent flushing the same object twice?

acid-ant commented

2023-08-24 07:07:08 +00:00

Right, It is possible that a background routine which storing data may hang, and on the next iteration the same object will be scheduled. When using stream it is impossible to start from one of the key, no such api. Also, retrieved keys sorted lexicographically, not in putting in db order.

fyrchik marked this conversation as resolved

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -38,0 +91,4 @@

								return nil

							}

						} else {

							c.cache.log.Debug(fmt.Sprintf("error unmarshal: %s", err))

fyrchik commented

2023-08-23 16:20:52 +00:00

If we store garbage data, it should be at least Warn or Error (and probably increase shard error counter
Let's use common scheme with log.Error(logs.Message, zap.Error(err))

1. If we store garbage data, it should be at least `Warn` or `Error` (and probably increase shard error counter 2. Let's use common scheme with `log.Error(logs.Message, zap.Error(err))`

fyrchik commented

2023-08-23 16:21:26 +00:00

Actually, if we increase error counter, there is no need to log.

acid-ant commented

2023-08-24 06:54:53 +00:00

Agree, removed log entry.

fyrchik marked this conversation as resolved

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -129,3 +146,1 @@

								c.modeMtx.RUnlock()

								return

							}

						ctx, cancel := context.WithCancel(context.TODO())

fyrchik commented

2023-08-23 16:15:03 +00:00

Background? Or provide context to flushSmallObjects?

`Background`? Or provide context to `flushSmallObjects`?

acid-ant commented

2023-08-24 06:30:57 +00:00

Think it is better to use global context in a separate task #642. Requires a lot of refactoring, because we need to change the signature of Init method.

Think it is better to use global context in a separate task #642. Requires a lot of refactoring, because we need to change the signature of `Init` method.

fyrchik marked this conversation as resolved

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -134,2 +151,2 @@

						if count == 0 {

							c.modeMtx.RUnlock()

						stream := c.db.NewStream()

						// Logic within Send method can expect single threaded execution.

fyrchik commented

2023-08-23 16:14:37 +00:00

can expect or expects?

`can expect` or `expects`?

acid-ant commented

2023-08-24 06:23:20 +00:00

All calls to Send are done by a single goroutine.

All calls to `Send` are done by a single goroutine.

fyrchik marked this conversation as resolved

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -136,0 +151,4 @@

						stream := c.db.NewStream()

						// Logic within Send method can expect single threaded execution.

						stream.Send = coll.Send

						if err := stream.Orchestrate(ctx); err != nil {

fyrchik commented

2023-08-23 16:21:57 +00:00

Does it exit after all object in the database were streamed through?

acid-ant commented

2023-08-24 06:25:56 +00:00

Yes, in opposite to Subscribe.

Yes, in opposite to `Subscribe`.

fyrchik marked this conversation as resolved

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -141,3 +162,2 @@

						c.log.Debug(logs.WritecacheTriedToFlushItemsFromWritecache,

							zap.Int("count", count),

							zap.String("start", base58.Encode(lastKey[:])))

							zap.Int("scheduled", coll.scheduled), zap.Int("processed", coll.processed))

fyrchik commented

2023-08-23 16:25:33 +00:00

GC interval is 1 minute by default currently. It may be beneficial to run it after each flush cycle instead. What do you think? @TrueCloudLab/storage-core-committers @TrueCloudLab/storage-core-developers

acid-ant commented

2023-08-24 07:24:04 +00:00

If you mean badgers gs, I think that will be useful here if we scheduled some objects for deletion. We're flushing each second, could this be a problem?

dstepanov-yadro commented

2023-08-24 08:18:34 +00:00

I don't think it must be some correlation between badger GC and flush.

fyrchik marked this conversation as resolved

acid-ant force-pushed bugfix/568-imprv-badger-flush from bcb90c455b to 4c4968f95d

2023-08-24 06:33:11 +00:00

Compare

acid-ant force-pushed bugfix/568-imprv-badger-flush from 4c4968f95d to 7a3c157537

2023-08-24 06:54:40 +00:00

Compare

dstepanov-yadro reviewed 2023-08-24 08:22:38 +00:00

pkg/local_object_storage/writecache/writecachebadger/flush.go Outdated

					
				@ -136,0 +153,4 @@

								"error during flushing object from wc: %s", err))

						}

						c.modeMtx.RUnlock()

						if coll.scheduled == 0 {

dstepanov-yadro commented

2023-08-24 08:22:36 +00:00

Please explain this line. I don't understand why the flush ends if no objects are scheduled?

acid-ant commented

2023-08-24 08:52:29 +00:00

It is possible that few objects still exists in db and at the same time scheduled for flush.
To prevent iteration over already scheduled objects, we need to check for scheduled counter, not for processed.
This loop running by timer each second.

It is possible that few objects still exists in db and at the same time scheduled for flush. To prevent iteration over already scheduled objects, we need to check for scheduled counter, not for processed. This loop running by timer each second.

dstepanov-yadro marked this conversation as resolved

dstepanov-yadro approved these changes 2023-08-24 09:07:51 +00:00

dstepanov-yadro approved these changes 2023-08-29 07:07:09 +00:00

aarifullin approved these changes 2023-08-29 08:38:22 +00:00

fyrchik reviewed 2023-08-29 08:52:42 +00:00

fyrchik approved these changes 2023-08-29 08:52:56 +00:00

acid-ant force-pushed bugfix/568-imprv-badger-flush from 7a3c157537 to 1129564e7a

2023-08-30 06:33:51 +00:00

Compare

fyrchik merged commit 4dff9555f1 into master

2023-08-30 17:22:32 +00:00

Sign in to join this conversation.