rclone

Author	SHA1	Message	Date
Aleksandar Janković	5470d34740	backend/s3: use low-level-retries as the number of SDK retries Amazon S3 is built to handle different kinds of workloads. In rare cases where S3 is not able to scale for whatever reason users will face status 500 errors. Main mechanism for handling these errors are retries. Amount of needed retries varies for each different use case. This change is making retries for s3 backend configurable by using --low-level-retries option.	2020-02-24 16:43:44 +01:00
Maciej Zimnoch	ac9cb50fdb	backend/s3: use memory pool for buffer allocations Currently each multipart upload allocated his own buffers, which after file upload was garbaged. Next files couldn't leverage already allocated memory which resulted in inefficent memory management. This change introduces backend memory pool keeping memory chunks which can be used during object operations. Fixes #3967	2020-02-24 13:32:32 +01:00
Michał Matczuk	e75c1f70bb	backend/s3: Added 500 as retryErrorCode The error code 500 Internal Error indicates that Amazon S3 is unable to handle the request at that time. The error code 503 Slow Down typically indicates that the requests to the S3 bucket are very high, exceeding the request rates described in Request Rate and Performance Guidelines. Because Amazon S3 is a distributed service, a very small percentage of 5xx errors are expected during normal use of the service. All requests that return 5xx errors from Amazon S3 can and should be retried, so we recommend that applications making requests to Amazon S3 have a fault-tolerance mechanism to recover from these errors. https://aws.amazon.com/premiumsupport/knowledge-center/http-5xx-errors-s3/	2020-02-12 11:43:18 +00:00
Michał Matczuk	19a4d74ee7	backend/s3: Fail fast multipart upload When a part upload request fails error is returned and gCtx is cancelled. This does not prevent from other parts being tried. They immediately fail due to a canceled context, but are retried by rclone anyway... Example AWS debug output ``` ----------------------------------------------------- 2020/02/11 14:12:17 DEBUG: Retrying Request s3/UploadPart, attempt 4 2020/02/11 14:12:17 DEBUG: Request s3/UploadPart Details: ---[ REQUEST POST-SIGN ]----------------------------- PUT /backuptest-rclone/huge/file.db?partNumber=11&uploadId=190939b4-3c43-4b98-ac11-92303e3f11b0 HTTP/1.1 Host: 192.168.100.99:9000 User-Agent: aws-sdk-go/1.23.8 (go1.13.1; linux; amd64) Content-Length: 5242880 Authorization: AWS4-HMAC-SHA256 Credential=miniouser/20200211/us-east-1/s3/aws4_request, SignedHeaders=content-length;content-md5;expect;host;x-amz-content-sha256;x-amz-date, Signature=3fc03a01f651cec09b05290459e9ceb26db9a8aa00c4e1b16e8cf5617eb81da8 Content-Md5: XzY+DlipXwbL6bvGYsXftg== Expect: 100-Continue X-Amz-Content-Sha256: c036cbb7553a909f8b8877d4461924307f27ecb66cff928eeeafd569c3887e29 X-Amz-Date: 20200211T131217Z Accept-Encoding: gzip ----------------------------------------------------- http://192.168.100.99:9000/backuptest-rclone/huge/file.db?partNumber=11&uploadId=190939b4-3c43-4b98-ac11-92303e3f11b0 2020/02/11 14:12:17 DEBUG: Response s3/UploadPart Details: ---[ RESPONSE ]-------------------------------------- HTTP/1.1 500 InternalServerError Content-Length: 0 ----------------------------------------------------- UploadPartWithContext() error InternalError: We encountered an internal error. Please try again status code: 500, request id: , host id: 2020/02/11 14:12:18 DEBUG ERROR: Request s3/UploadPart: ---[ REQUEST DUMP ERROR ]----------------------------- context canceled ------------------------------------------------------ UploadPartWithContext() error RequestCanceled: request context canceled caused by: context canceled 2020/02/11 14:12:20 DEBUG ERROR: Request s3/UploadPart: ---[ REQUEST DUMP ERROR ]----------------------------- context canceled ------------------------------------------------------ UploadPartWithContext() error RequestCanceled: request context canceled caused by: context canceled 2020/02/11 14:12:22 DEBUG ERROR: Request s3/UploadPart: ---[ REQUEST DUMP ERROR ]----------------------------- context canceled ------------------------------------------------------ UploadPartWithContext() error RequestCanceled: request context canceled caused by: context canceled ``` This adds a fail fast behaviour in case the context was cancelled.	2020-02-12 11:40:34 +00:00
Nick Craig-Wood	90377f5e65	s3: Specify that Minio supports URL encoding in listings Thanks to @harshavardhana for pointing this out See #3934 for background	2020-02-09 12:03:20 +00:00
Dave Koston	9f99c20232	s3: Add StackPath Object Storage Support	2020-01-31 16:05:44 +00:00
Nick Craig-Wood	bafe7d5a73	backends: move encoding definitions from fs/encodings	2020-01-16 14:40:36 +00:00
Nick Craig-Wood	3c620d521d	backend: adjust backends to have encoding parameter Fixes #3761 Fixes #3836 Fixes #3841	2020-01-16 14:40:36 +00:00
Nick Craig-Wood	b6e86b2c7f	s3: fix missing x-amz-meta-md5chksum headers for multipart uploads This reverts "s3: fix DisableChecksum condition" which introduced the problem. This reverts commit `c05bb63f96`. The code was correct as it stands - the comment was incorrect and this commit updates it. See: https://forum.rclone.org/t/s3-upload-md5-check-sum/13706	2020-01-07 19:39:39 +00:00
Tennix	15d19131bd	s3: use aws web identity role provider	2020-01-05 19:49:31 +00:00
Nick Craig-Wood	9d993e584b	s3: force path style bucket access to off for AWS deprecation AWS are deprecating path style bucket access so rclone should stop using it by default for this provider. This change shouldn't break any workflows as all AWS endpoints support virtual hosted style lookups of buckets. It may even improve performance. See: https://aws.amazon.com/blogs/aws/amazon-s3-path-deprecation-plan-the-rest-of-the-story/	2020-01-05 17:53:45 +00:00
Nick Craig-Wood	7242c7ce95	s3: fix multipart upload uploading 0 length files This regression was introduced by the recent re-write of the s3 multipart upload code.	2020-01-05 12:32:55 +00:00
Nick Craig-Wood	7e6fac8b1e	s3: re-implement multipart upload to fix memory issues There have been quite a few reports of problems with the multipart uploader using too much memory and not retrying possible errors. Before this change the multipart uploader used the s3manager abstraction in the AWS SDK. There are numerous bug reports of this using up too much memory. This change re-implements a much simplified version of the s3manager code specialized for rclone's purposes. This should use much less memory and retry chunks properly. See: https://forum.rclone.org/t/memory-usage-s3-alike-to-glacier-without-big-directories/13563 See: https://forum.rclone.org/t/copy-from-local-to-s3-has-high-memory-usage/13405 See: https://forum.rclone.org/t/big-file-upload-to-s3-fails/13575	2020-01-03 22:19:28 +00:00
Thomas Kriechbaumer	584e705c0c	s3: introduce list_chunk option for bucket listing The S3 ListObject API returns paginated bucket listings, with "MaxKeys" items for each GET call. The default value is 1000 entries, but for buckets with millions of objects it might make sense to request more elements per request, if the backend supports it. This commit adds a "list_chunk" option for the user to specify a lower or higher value. This commit does not add safe guards around this value - if a user decides to request a too large list, it might result in connection timeouts (on the server or client). In AWS S3, there is a fixed limit of 1000, some other services might have one too. In Ceph, this can be configured in RadosGW.	2020-01-02 12:15:01 +00:00
Outvi V	db1c7f9ca8	s3: Add new region Asia Patific (Hong Kong)	2020-01-02 11:10:48 +00:00
Nick Craig-Wood	0ecb8bc2f9	s3: fix url decoding of NextMarker - fixes #3799 Before this patch we were failing to URL decode the NextMarker when url encoding was used for the listing. The result of this was duplicated listings entries for directories with >1000 entries where the NextMarker was a file containing a space.	2019-12-12 13:33:30 +00:00
Nick Craig-Wood	0d10640aaa	s3: add --s3-copy-cutoff for size to switch to multipart copy Before this change we used the same (relatively low limits) for server side copy as we did for multipart uploads. It doesn't make sense to use the same limits since no data is being downloaded or uploaded for a server side copy. This change introduces a new parameter --s3-copy-cutoff to control when the switch from single to multipart server size copy happens and defaults it to the maximum 5GB. This makes server side copies much more efficient. It also fixes the erroneous error when trying to set the modification time of a file bigger than 5GB. See #3778	2019-12-03 10:37:55 +00:00
Nick Craig-Wood	f4746f5064	s3: fix multipart copy - fixes #3778 Before this change multipart copies were giving the error Range specified is not valid for source object of size This was due to an off by one error in the range source introduced in `7b1274e29a` "s3: support for multipart copy"	2019-12-03 10:37:55 +00:00
Aleksandar Janković	c05bb63f96	s3: fix DisableChecksum condition	2019-12-02 15:15:59 +00:00
Nick Craig-Wood	9b5308144f	s3: Reduce memory usage streaming files by reducing max stream upload size Before this change rclone would allow the user to stream (eg with rclone mount, rclone rcat or uploading google photos or docs) 5TB files. This meant that rclone allocated 4 * 525 MB buffers per transfer which is way too much memory by default. This change makes rclone use the configured chunk size for streamed uploads. This is 5MB by default which means that rclone can stream upload files up to 48GB by default staying below the 10,000 chunks limit. This can be increased with --s3-chunk-size if necessary. If rclone detects that a file is being streamed to s3 it will make a single NOTICE level log stating the limitation. This fixes the enormous memory usage. Fixes #3568 See: https://forum.rclone.org/t/how-much-memory-does-rclone-need/12743	2019-11-09 15:55:19 +00:00
Aleksandar Jankovic	4b20afa94a	backend/s3: fix ExpiryWindow value ExpiryWindow accepts duration but it was set to value 3. This changes it to 3 * time.Minute since default is 5 min.	2019-11-05 13:55:55 +00:00
Nick Craig-Wood	ab895390f4	s3: fix nil pointer reference if no metadata returned for object Fixes #3651 Fixes #3652	2019-10-25 13:45:47 +01:00
庄天翼	7b1274e29a	s3: support for multipart copy Fixes #2375 Fixes #3579	2019-10-04 16:49:06 +01:00
Aleksandar Jankovic	6b55b8b133	s3: add option for multipart failiure behaviour This is needed for resuming uploads across different sessions.	2019-10-02 16:49:16 +01:00
Nick Craig-Wood	6e053ecbd0	s3: only ask for URL encoded directory listings if we need them on Ceph This works around a bug in Ceph which doesn't encode CommonPrefixes when using URL encoded directory listings. See: https://tracker.ceph.com/issues/41870	2019-09-30 22:00:24 +01:00
Fabian Möller	33f129fbbc	s3: use lib/encoder Co-authored-by: Nick Craig-Wood <nick@craig-wood.com>	2019-09-30 22:00:24 +01:00
Nick Craig-Wood	a8adce9c59	s3: fix encoding for control characters - Fixes #3345	2019-09-30 22:00:24 +01:00
Anthony Rusdi	899f285319	s3: fix signature v2_auth headers When used with v2_auth = true, PresignRequest doesn't return signed headers, so remote dest authentication would be fail. This commit copying back HTTPRequest.Header to headers. Tested with RiakCS v2.1.0. Signed-off-by: Anthony Rusdi <33247310+antrusd@users.noreply.github.com>	2019-09-21 14:38:51 +01:00
Nick Craig-Wood	25786cafd3	s3: fix SetModTime on GLACIER/ARCHIVE objects and implement set/get tier - Read the storage class for each object - Implement SetTier/GetTier - Check the storage class on the object before using SetModTime This updates the fix in `1a2fb52` so that SetModTime works when you are using objects which have been migrated to GLACIER but you aren't using GLACIER as a storage class. Fixes #3522	2019-09-14 09:18:55 +01:00
Nick Craig-Wood	66c23723e3	Add context to all http.NewRequest #3257 When we drop support for go1.12 we can use http.NewRequestWithContext	2019-09-09 23:27:07 +01:00
Nick Craig-Wood	6f16588123	s3,b2,googlecloudstorage,swift,qingstor,azureblob: fixes after code review #3421 - change the interface of listBuckets() removing dir parameter and adding context - add makeBucket() and use in place of Mkdir("") - this fixes some corner cases in Copy/Update - mark all the listed buckets OK in ListR Thanks to @yparitcher for the review.	2019-08-22 23:06:59 +01:00
Nick Craig-Wood	eaaf2ded94	s3: make all operations work from the root #3421	2019-08-17 10:30:41 +01:00
Nick Craig-Wood	e502be475a	azureblob/b2/dropbox/gcs/koofr/qingstor/s3: fix 0 length files In `0386d22cc9` we introduced a test for 0 length files read the way mount does. This test failed on these backends which we fix up here.	2019-08-06 15:18:08 +01:00
Nick Craig-Wood	57d5de6fba	build: fix up package paths after repo move git grep -l github.com/ncw/rclone \| xargs -d'\n' perl -i~ -lpe 's\|github.com/ncw/rclone\|github.com/rclone/rclone\|g' goimports -w `find . -name \*.go`	2019-07-28 18:47:38 +01:00
Matti Niemenmaa	a6dca4c13f	s3: Add INTELLIGENT_TIERING storage class For Intelligent-Tiering: https://aws.amazon.com/s3/storage-classes/#Unknown_or_changing_access	2019-07-01 18:17:48 +01:00
Aleksandar Jankovic	f78cd1e043	Add context propagation to rclone - Change rclone/fs interfaces to accept context.Context - Update interface implementations to use context.Context - Change top level usage to propagate context to lover level functions Context propagation is needed for stopping transfers and passing other request-scoped values.	2019-06-19 11:59:46 +01:00
Philip Harvey	1a2fb52266	s3: make SetModTime work for GLACIER while syncing - Fixes #3224 Before this change rclone would fail with Failed to set modification time: InvalidObjectState: Operation is not valid for the source object's storage class when attempting to set the modification time of an object in GLACIER. After this change rclone will re-upload the object as part of a sync if it needs to change the modification time. See: https://forum.rclone.org/t/suspected-bug-in-s3-or-compatible-sync-logic-to-glacier/10187	2019-06-03 15:28:19 +01:00
Robert Marko	5ccc2dcb8f	s3: add config info for Wasabi's EU Central endpoint Wasabi has a EU Central endpoint for a couple months now, so add it to the list. Signed-off-by: Robert Marko <robimarko@gmail.com>	2019-05-15 13:35:55 +01:00
Nick Craig-Wood	b68c3ce74d	s3: suppport S3 Accelerated endpoints with --s3-use-accelerate-endpoint Fixes #3123	2019-05-02 14:00:00 +01:00
Manu	6e86526c9d	s3: add support for "Glacier Deep Archive" storage class - fixes #3088	2019-04-11 10:21:41 +01:00
Fabian Möller	61616ba864	pacer: make pacer more flexible Make the pacer package more flexible by extracting the pace calculation functions into a separate interface. This also allows to move features that require the fs package like logging and custom errors into the fs package. Also add a RetryAfterError sentinel error that can be used to signal a desired retry time to the Calculator.	2019-02-16 14:38:07 +00:00
Nick Craig-Wood	73f0a67d98	s3: Update Dreamhost endpoint - fixes #2974	2019-02-13 21:10:43 +00:00
Fabian Möller	a0d4c04687	backend: fix misspellings	2019-02-07 19:51:03 +01:00
weetmuts	96f6708461	s3: add aws endpoint eu-north-1	2019-02-03 12:17:15 +00:00
Nick Craig-Wood	e31578e03c	s3: Auto detect region for buckets on operation failure - fixes #2915 If an incorrect region error is returned while using a bucket then the region is updated, the session is remade and the operation is retried.	2019-01-27 21:22:49 +00:00
Nick Craig-Wood	39f5059d48	s3: add --s3-bucket-acl to control bucket ACL - fixes #2918 Before this change buckets were created with the same ACL as objects. After this change, the user can set just --s3-acl to set the ACL of buckets and objects, or use --s3-bucket-acl as well to have a different ACL used for bucket creation. This also logs at INFO level the creation and deletion of buckets.	2019-01-18 15:12:11 +00:00
Nick Craig-Wood	1318c6aec8	s3: Add Alibaba OSS to integration tests and fix storage classes	2019-01-12 20:41:47 +00:00
Nick Craig-Wood	ff0b8e10af	s3: Support Alibaba Cloud (Aliyun) OSS The existing s3 backend passed all integration tests with OSS provided `force_path_style = false`. This makes sure that is so and adds documentation and configuration for OSS. Thanks to @luolibin for their work on the OSS backend which we ended up not needing. Fixes #1641 Fixes #1237	2019-01-12 17:28:04 +00:00
William Cocker	8575abf599	s3: add GLACIER storage class Fixes #923	2018-12-06 21:53:05 +00:00
Nick Craig-Wood	d99ffde7c0	s3: change --s3-upload-concurrency default to 4 to increase perfomance #2772 Increasing the --s3-upload-concurrency to 4 (from 2) gives an additional 45% throughput at the cost of 10MB extra memory per transfer. After testing the upload perfoc	2018-12-02 17:58:34 +00:00

1 2

84 commits