netris-cdc-file-transfer

mirror of https://github.com/nestriness/cdc-file-transfer.git synced 2026-03-17 09:53:06 +02:00

Author	SHA1	Message	Date
Lutz Justen	991f61cc4d	Improve create_release workflow (#20 ) Modifies the create_release workflow in 2 ways: - It only runs now if something is pushed to main. - It creates a tagged release if a tag is pushed. To create a tagged release, run e.g. git tag -a v0.1.0 -m "Release 0.1.0" git push origin v0.1.0	2022-11-25 12:02:54 +01:00
Lutz Justen	9d7eee35bd	Convert src dir to full path (#21 ) The asset stream manager requires a full path. With this CL, you can stream from e.g. ".".	2022-11-25 12:01:29 +01:00
Lutz Justen	fac559b1be	Improve readme (#19 ) Improve readme This CL adds - a history section with references to Stadia - benchmarks - animated gifs with demos - a troubleshooting section - and more info about cdc_stream	2022-11-24 13:33:29 +01:00
Lutz Justen	21a1b37787	Add workflow to create a release (#11 ) Adds a workflow that creates a "latest" release with a description listing all changes and a zip with built binaries (for Windows and Ubuntu 20.04).	2022-11-22 12:14:14 +01:00
wurwunchik	b2c011cc0d	Add new flags to asset stream manager (#10 ) - Add --config-file option defining Json configuration file for asset stream manager - Add log_dir flag for log folder - Remove unused functions from SdkUtils - Fix build issue in cdc_fuse_fs	2022-11-22 12:05:48 +01:00
Lutz Justen	0252d51cc0	Add actions for building and testing (#8 ) * Add a Github action for building and testing On Windows, -- -//third_party/... doesn't seem to work, so add all test directories manually. Also run the tests_*. We run only fastbuild tests here, since the opt tests will be run in the release workflow. Also fix a number of compilation and test issues found along the way.	2022-11-21 23:22:09 +01:00
Lutz Justen	bccc025945	[cdc_stream] Append errors from netstart to status (#9 ) So far, errors from the remote netstat process would only be logged in the asset stream service, for instance when SSH auth failed. However, the errors were not shown to the client, and that's the most important thing. Also adds some feedback to cdc_stream in case of success.	2022-11-21 09:07:05 +01:00
Lutz Justen	269fb2be45	[cdc_stream] Add a CLI client to start/stop asset streaming sessions (#4 ) Implements the cdc_stream client and adjusts asset streaming in various places to work better outside of a GGP environment. This CL tries to get quoting for SSH commands right. It also brings back the ability to start a streaming session from asset_stream_manager. Also cleans up Bazel targets setup. Since the sln file is now in root, it is no longer necessary to prepend ../ to relative filenames to make clicking on errors work.	2022-11-18 10:59:42 +01:00
ljusten	ca84d3dd2e	[cdc_fuse_fs] Fix various issues (#6 ) Fixes a couple of issues with the FUSE: - Creates the mount directory if it does not exist. This assumes the mount dir to be the last arg. Ideally, we'd parse the command line and then create the directory, but unfortunately fuse_parse_cmdline already verifies that the dir exists. - Expands the cache_dir (e.g. ~). - Fixes a compile issue in manifest_iterator.	2022-11-17 14:01:59 +01:00
chrschng	76bbdb01bb	Merge dynamic manifest updates to Github (#7 ) This change introduces dynamic manifest updates to asset streaming. Asset streaming describes the directory to be streamed in a manifest, which is a proto definition of all content metadata. This information is sufficient to answer `stat` and `readdir` calls in the FUSE layer without additional round-trips to the workstation. When a directory is streamed for the first time, the corresponding manifest is created in two steps: 1. The directory is traversed recursively and the inode information of all contained files and directories is written to the manifest. 2. The content of all identified files is processed to generate each file's chunk list. This list is part of the definition of a file in the manifest. * The chunk boundaries are identified using our implementation of the FastCDC algorithm. * The hash of each chunk is calculated using the BLAKE3 hash function. * The length and hash of each chunk is appended to the file's chunk list. Prior to this change, when the user mounted a workstation directory on a client, the asset streaming server pushed an intermediate manifest to the gamelet as soon as step 1 was completed. At this point, the FUSE client started serving the virtual file system and was ready to answer `stat` and `readdir` calls. In case the FUSE client received any call that required file contents, such as `read`, it would block the caller until the server completed step 2 above and pushed the final manifest to the client. This works well for large directories (> 100GB) with a reasonable number of files (< 100k). But when dealing with millions of tiny files, creating the full manifest can take several minutes. With this change, we introduce dynamic manifest updates. When the FUSE layer receives an `open` or `readdir` request for a file or directory that is incomplete, it sends an RPC to the workstation about what information is missing from the manifest. The workstation identifies the corresponding file chunker or directory scanner tasks and moves them to the front of the queue. As soon as the task is completed, the workstation pushes an updated intermediate manifest to the client which now includes the information to serve the FUSE request. The queued FUSE request is resumed and returns the result to the caller. While this does not reduce the required time to build the final manifest, it splits up the work into smaller tasks. This allows us to interrupt the current work and prioritize those tasks which are required to handle an incoming request from the client. While this still takes a round-trip to the workstation plus the processing time for the task, an updated manifest is received within a few seconds, which is much better than blocking for several minutes. This latency is only visible when serving data while the manifest is still being created. The situation improves as the manifest creation on the workstation progresses. As soon as the final manifest is pushed, all metadata can be served directly without having to wait for pending tasks.	2022-11-16 11:20:32 +01:00
ljusten	23fcd5ef1d	Improve cdc_fuse_fs and path (#2 ) Improve cdc_fuse_fs and path Improves the error handling in path so that std:error_codes are not assumed to be of system category, and also that their messages are displayed. Also improves debug messages in GameletComponent.	2022-11-15 12:53:02 +01:00
ljusten	9fdccb3548	Remove GGP dependencies from CDC RSync (#1 ) * Remove dependencies of cdc_sync from GGP Allows overriding the SSH and SCP commands via command line flags. Hence, strict host checking, SSH config etc. can be removed since it is passed in by command line flags for GGP. Also deploys cdc_rsync_server to ~/.cache/cdc_file_transfer/ and creates that dir if it does not exist. * Tweak RemoteUtil Replaces localhost: by //./ in the workaround for scp since localhost: had two disadvantages: 1) It required 2 gnubby touches for gLinux and 2) it didn't work for ggp. //./ works for both. Also tweaks quoting, which didn't quite work for ggp. * Don't check remote ports in cdc_rsync Turns off checking remote ports in PortManager. In the future, the server should return available ports after failing to connect to the provided port. Since now the first remote connection is running cdc_rsync_server, the timeout check has to be done when running that process. * Remove now-unused kInstancePickerNotAvailableInQuietMode enum * Add more details to the readme * [cdc_rsync] Accept [user@]host:destination Removes the --ip command line argument and assumes user/host are passed in along with the destination, so it works in the same way as other popular tools. * [ggp_rsync] Combine server deploy commands Combines two chmod and one mv command into one ssh command. This makes deploy a bit quicker, especially if each ssh command involves touching your gnubby. * Remove GGP specific stuff from VS build commands * [cdc_rsync] Get rid of cdc_rsync.dll Compile the CDC RSync client as a static library instead. This removes quite a bit of boiler plate and makes string handling easier since we can now pass std::strings instead of const chars. Also fixes an issue where we were sometimes trying to assign nullptr to std::strings, which is forbidden. * Allow specifying ssh/scp commands with env vars * Rename GgpRsync* to CdcRsync* * Merge ggp_rsync_cli into ggp_rsync * [cdc_rsync] Refactor cdc_rsync.cc/h Merges cdc_rsync.cc/h with main.cc and CdcRsyncClient since code is closer to where it's being used and should be more readable.	2022-11-15 12:48:09 +01:00
Christian Schneider	4326e972ac	Releasing the former Stadia file transfer tools The tools allow efficient and fast synchronization of large directory trees from a Windows workstation to a Linux target machine. cdc_rsync* support efficient copy of files by using content-defined chunking (CDC) to identify chunks within files that can be reused. asset_stream_manager + cdc_fuse_fs support efficient streaming of a local directory to a remote virtual file system based on FUSE. It also employs CDC to identify and reuse unchanged data chunks.	2022-11-03 10:39:10 +01:00

1 2

63 Commits