Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
jirivorel authored Nov 29, 2023
1 parent 342727f commit 616b7f7
Showing 1 changed file with 53 additions and 0 deletions.
53 changes: 53 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,7 @@ Information given in this course is current as of 30th November 2023.
* [System of software modules](#system-of-software-modules)
* [Raw reads and quality control](#raw-reads-and-quality-control)
* [Data manipulation](#data-manipulation)
* [Download via terminal](#download-via-terminal)

# Introduction

Expand Down Expand Up @@ -150,6 +151,7 @@ In Metacentrum, we lack the visual interface typical for desktop computers. Inst
| `rmdir` | Removes an empty directory. |
| `rm -r` | Removes directory with its content. |
| `touch` | Creates a file without content. |
| `scp` | Transfers data. |
| `.` or `./` | Refers to the present location. |
| `..` or `../` | Refers to the parent directory. |

Expand Down Expand Up @@ -446,13 +448,64 @@ exit
# Data manipulation

From the previous chapter, we have two types of data (reads and files with quality information), and we will use them for:
- download the data from MetaCentrum to the local computer.
- upload the data to MetaCentrum.
- transfer the data between storages.
- back up the data.

> [!IMPORTANT]
> How to effectively manipulate the data is comprehensively described [here](https://docs.metacentrum.cz/data/data-within/).
Firstly, we will download the results from the quality check step. This means we will download the data from the MetaCentrum storage `plzen1` to the local computer.

In general, small files and folders can be downloaded/uploaded through the frontend servers. For bigger volumes of data, it is recommended to [access storage servers directly](https://docs.metacentrum.cz/data/data-within/#large-data-handling). A list of all MetaCentrum storage servers is deposited [here](https://wiki.metacentrum.cz/wiki/NFS4_Servery).

## Download via terminal

The easiest way to download data from the remote server is via a terminal. Let's execute a few commands and discuss what is different.

```shell
scp [email protected]:Illumina_raw_SRR24321378_1_fastqc.html .
# alternatively:
scp [email protected]:Illumina_raw_SRR24321378_1_fastqc.html .
```
```shell
scp [email protected]:./Illumina_raw_SRR24321378_2_fastqc.html .
# alternatively:
scp [email protected]:./Illumina_raw_SRR24321378_2_fastqc.html .
```

General syntax with path is `scp user_name@server_name:/path/to/any/file/ /path/where/to/save/it/on/my/computer`. `scp` is a traditional Linux command with [many tutorials on how to use it](https://linuxize.com/post/how-to-use-scp-command-to-securely-transfer-files/).

```shell
scp [email protected]:Illumina_raw_SRR24321378_\*_fastqc.html .
# alternatively:
scp [email protected]:Illumina_raw_SRR24321378_\*_fastqc.html .
```
```shell
scp -r [email protected]:ont_outdir .
# alternatively:
scp -r [email protected]:ont_outdir .
```
```shell
scp [email protected]:ONT_raw_SRR24321377.fastq .
# alternatively:
scp [email protected]:ONT_raw_SRR24321377.fastq .
```

> [!IMPORTANT]
> **The directory structure on NFS4 storages and frontend servers is not identical!**
> ```shell
> $ ssh [email protected]
> $ pwd
> /storage/plzen1/home/vorel
> $ ls
> Illumina_raw_SRR24321378_1.fastq Illumina_raw_SRR24321378_2.fastq
>
> $ sftp [email protected]
> $ pwd
> Remote working directory: /home/vorel
> Illumina_raw_SRR24321378_1.fastq Illumina_raw_SRR24321378_1_fastqc.html
> ```

0 comments on commit 616b7f7

Please sign in to comment.