Skip to content

Instantly share code, notes, and snippets.

@ozika
Last active August 31, 2022 12:26
Show Gist options
  • Select an option

  • Save ozika/30aba020dbe22f22da5e2bd5df0fdfac to your computer and use it in GitHub Desktop.

Select an option

Save ozika/30aba020dbe22f22da5e2bd5df0fdfac to your computer and use it in GitHub Desktop.
Setting up datalad dataset (core steps)
  • add locking

requirements

create data set from a folder

datalad create -f [folder]

to ignore a folder you can use

.gitignore

add a folder to be tracked

datlad save -m "added a folder to tracking"

add a remote (example of GIN)

  • create the repo on GIN datalad siblings add -d . --name gin --url git@gin.g-node.org:/ozika/pain_learning_raw_dataset.git

push data to remote

datalad push -f all --to gin before 0.15 datalad publish -f --to gin --transfer-data all

see how much data is present locally

datalad status --annex all

update info from remote

datalad update only updates the information, to see the new files one needs to change branch to origin/main to see which ones are there!

(downloads the data)

datalad update -s origin --merge TRUE before 0.15 datalad update -s origin --merge ALLOWED

get files from remote

datalad get . gets all files datalad get file1 file2 get specific files

remove local files after they have been used

datalad drop .

clone (install) data set

datalad clone [url]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment