node: Implement Lock\Delete requests for EC object #1147

					
				@ -362,3 +362,1 @@

					default:

						commonCmd.ExitOnErr(cmd, "failed to get raw object header: %w", err)

					case err == nil:

					if err == nil {

fyrchik commented

2024-05-28 07:46:57 +00:00

Why have you replaced switch with else if chain?

Why have you replaced switch with `else if` chain?

acid-ant commented

2024-05-28 12:58:29 +00:00

Because previous implementation was only for Split Info. Thought it should be more readable. Reverted switch back.

Because previous implementation was only for `Split Info`. Thought it should be more readable. Reverted switch back.

fyrchik commented

2024-05-28 14:13:39 +00:00

IMO it is exactly the opposite -- else if is ok once, switch is less verbose for multiple branches.

IMO it is exactly the opposite -- `else if` is ok once, switch is less verbose for multiple branches.

fyrchik marked this conversation as resolved

cmd/frostfs-cli/modules/object/util.go Outdated

					
				@ -369,0 +383,4 @@

							if err != nil {

								commonCmd.ExitOnErr(cmd, "failed to create object id: %w", err)

							}

							chain = append(chain, objID)

fyrchik commented

2024-05-28 07:48:49 +00:00

So we add each chunk to the tombstone/lock? It is a problem, because chunks may be missing (with split it cannot be the case, it means DL, with EC it is ok).

acid-ant commented

2024-05-28 12:55:09 +00:00

Oh, thanks, that was from previous implementation, removed.

fyrchik commented

2024-05-28 14:14:01 +00:00

Does the new implementation still pass sanity tests?

acid-ant commented

2024-05-28 14:31:01 +00:00

Execute each time when changed sensitive part of the code.

fyrchik marked this conversation as resolved

pkg/local_object_storage/engine/delete.go Outdated

					
				@ -174,0 +201,4 @@

								zap.Stringer("addr", addr),

								zap.String("err", err.Error()),

								zap.String("trace_id", tracingPkg.GetTraceID(ctx)))

							continue

fyrchik commented

2024-05-28 07:50:16 +00:00

There is an error which we have ignored. What happens with this yet-to-be-removed chunk?

acid-ant commented

2024-05-28 12:48:26 +00:00

It will be removed by remover at next iteration. The behavior is the same as for complex object, see deleteChildren().

It will be removed by `remover` at next iteration. The behavior is the same as for complex object, see [deleteChildren()](https://git.frostfs.info/TrueCloudLab/frostfs-node/src/commit/3627b44e92395d2be7eeda9790513021b9f345ca/pkg/local_object_storage/engine/delete.go#L136).

fyrchik marked this conversation as resolved

pkg/local_object_storage/engine/engine_test.go Outdated

					
				@ -63,3 +64,3 @@

					b.ResetTimer()

					for i := 0; i < b.N; i++ {

						ok, err := e.exists(context.Background(), addr)

						ok, _, err := e.exists(context.Background(), addr, oid.Address{})

fyrchik commented

2024-05-28 07:51:31 +00:00

The interface is confusing, we have 2 identical types with different meaning.
What about accepting shard.ExistsPrm?

The interface is confusing, we have 2 identical types with different meaning. What about accepting `shard.ExistsPrm`?

acid-ant commented

2024-05-28 19:59:04 +00:00

In this case we need to make fields of ExistsPrm public, are you ok?

In this case we need to make fields of `ExistsPrm` public, are you ok?

acid-ant commented

2024-05-30 06:28:01 +00:00

Implemented in a separate commit.

pkg/local_object_storage/engine/inhume.go Outdated

					
				@ -223,0 +235,4 @@

					e.iterateOverUnsortedShards(func(h hashedShard) (stop bool) {

						ld, err := h.Shard.GetLocked(ctx, addr)

						if err != nil {

							e.reportShardError(h, "can't get object's lockers", err, zap.Stringer("addr", addr),

fyrchik commented

2024-05-28 07:52:21 +00:00

It is a log message, should be a const. Also, why didn't the linter fail? cc @achuprov

acid-ant commented

2024-05-28 12:54:51 +00:00

Thanks, updated.

fyrchik marked this conversation as resolved

pkg/local_object_storage/engine/put.go Outdated

					
				@ -88,0 +96,4 @@

							return err

						}

						for _, locker := range lockers {

							err = e.lock(ctx, addr.Container(), locker, []oid.ID{addr.Object()})

fyrchik commented

2024-05-28 07:54:54 +00:00

Do we lock an object before we have put it? It seems like a problem, because this lock record can persist indefinitely.

acid-ant commented

2024-05-28 14:21:09 +00:00

Didn't catch the problem. Here we are persisting lock for a chunk before put, because we need to avoid gc removing it. This is reconstruction scenario - when we need to put chunk on the node. If there is no lock for a chunk, gc will inhume it.

Didn't catch the problem. Here we are persisting lock for a chunk before put, because we need to avoid `gc` removing it. This is reconstruction scenario - when we need to put chunk on the node. If there is no lock for a chunk, `gc` will inhume it.

fyrchik commented

2024-05-28 15:26:42 +00:00

The problem is atomicity -- lock -> CRASH -> put and we now have some garbage about locks which will (?) be removed eventually.
We could do it atomically in put instead, this would also ensure we put info on the same shard.

The problem is atomicity -- `lock -> CRASH -> put` and we now have some garbage about locks which will (?) be removed eventually. We could do it atomically in `put` instead, this would also ensure we put info on the same shard.

acid-ant commented

2024-05-30 06:48:13 +00:00

As a result of discussion, we need to move gc on a storage engine level. Created #1151 for tracking.

As a result of discussion, we need to move `gc` on a storage engine level. Created https://git.frostfs.info/TrueCloudLab/frostfs-node/issues/1151 for tracking.

pkg/local_object_storage/metabase/delete.go Outdated

					
				@ -477,0 +498,4 @@

							offset := 0

							for offset < len(val) {

								if bytes.Equal(objKey, val[offset:offset+objectKeySize]) {

									val = append(val[:offset], val[offset+objectKeySize:]...)

fyrchik commented

2024-05-28 07:56:41 +00:00

val is received from getFromBucket. Is it taken from bbolt or freshly allocated? Bbolt prohibits changing values in some cases.

`val` is received from `getFromBucket`. Is it taken from bbolt or freshly allocated? Bbolt prohibits changing values in some cases.

acid-ant commented

2024-05-28 13:48:45 +00:00

According to doc, val should be valid for the life of the transaction. Let's clone it.

According to doc, `val` should be valid for the life of the transaction. Let's clone it.

fyrchik commented

2024-05-28 14:14:59 +00:00

This line is more important // The returned memory is owned by bbolt and must never be modified; writing to this memory might corrupt the database.

This line is more important `// The returned memory is owned by bbolt and must never be modified; writing to this memory might corrupt the database.`

acid-ant commented

2024-05-28 14:43:37 +00:00

This line is from the newest version. Looks like we need to update bbolt.

This line is from the newest version. Looks like we need to update `bbolt`.

fyrchik marked this conversation as resolved

pkg/local_object_storage/shard/gc.go Outdated

					
				@ -526,3 +526,3 @@

						log.Debug(logs.ShardHandlingExpiredTombstonesBatch, zap.Int("number", len(tssExp)))

						s.expiredTombstonesCallback(ctx, tssExp)

						if len(tssExp) > 0 {

fyrchik commented

2024-05-28 07:28:13 +00:00

To be clear: is this an optimization or a functional change?

acid-ant commented

2024-05-28 13:07:05 +00:00

It is an optimization - here we do nothing but getting lock on metabase, because call db.boltDB.Update(...).

It is an optimization - here we do nothing but getting lock on metabase, because call `db.boltDB.Update(...)`.

fyrchik marked this conversation as resolved

fyrchik commented

2024-05-28 08:37:25 +00:00

Approved accidentally, please disregard

acid-ant commented

2024-05-28 09:30:47 +00:00

Fixed gopls-run. Resolved issue from gopls.

Fixed `gopls-run`. Resolved issue from `gopls`.

fyrchik reviewed 2024-05-28 10:55:08 +00:00

Makefile Outdated

					
				@ -221,3 +224,3 @@

						make gopls-install; \

					fi

					@if [[ $$(find . -type f -name "*.go" -print | xargs $(GOPLS_VERSION_DIR)/gopls check | tee /dev/tty | wc -l) -ne 0 ]]; then \

					$(GOPLS_VERSION_DIR)/gopls check $(SOURCES) 2>&1 >$(GOPLS_TEMP_FILE)

fyrchik commented

2024-05-28 10:55:08 +00:00

Was there any problem with the previous implementation (pipe instead of temp file)?

acid-ant commented

2024-05-28 13:19:10 +00:00

We are unable to use tee /dev/tty for security reason. If replace it for ... check 2>&1 | tee | wc -l there are no output for error.

We are unable to use `tee /dev/tty` for security reason. If replace it for `... check 2>&1 | tee | wc -l` there are no output for error.

fyrchik marked this conversation as resolved

acid-ant force-pushed feature/ec-del-lock from 998d6a86d7 to 2ea7b6331d

2024-05-28 12:53:15 +00:00

Compare

acid-ant force-pushed feature/ec-del-lock from 2ea7b6331d to 13f6770f25

2024-05-28 12:54:48 +00:00

Compare

acid-ant force-pushed feature/ec-del-lock from 13f6770f25 to 9ba4a97276

2024-05-28 13:46:44 +00:00

Compare

aarifullin approved these changes 2024-05-29 11:19:29 +00:00

acid-ant force-pushed feature/ec-del-lock from 9ba4a97276 to 2a4f637861

2024-05-29 14:11:36 +00:00

Compare

acid-ant added 1 commit 2024-05-30 06:26:35 +00:00

[#1147 ] node: Use public fields for shard.ExistsPrm

Vulncheck / Vulncheck (pull_request) Successful in 4m54s

Details

DCO action / DCO (pull_request) Successful in 5m6s

Details

Build / Build Components (1.21) (pull_request) Successful in 5m25s

Details

Build / Build Components (1.22) (pull_request) Successful in 5m33s

Details

Tests and linters / gopls check (pull_request) Successful in 5m50s

Details

Tests and linters / Staticcheck (pull_request) Successful in 6m59s

Details

Tests and linters / Lint (pull_request) Successful in 8m26s

Details

Pre-commit hooks / Pre-commit (pull_request) Successful in 9m38s

Details

Tests and linters / Tests with -race (pull_request) Successful in 10m49s

Details

Tests and linters / Tests (1.21) (pull_request) Successful in 11m7s

Details

Tests and linters / Tests (1.22) (pull_request) Successful in 11m37s

Details

f1f267a9a5

Signed-off-by: Anton Nikiforov <an.nikiforov@yadro.com>

fyrchik approved these changes 2024-05-30 08:12:59 +00:00

fyrchik merged commit 92e19feb57 into master

2024-05-30 08:13:07 +00:00