GEN220 High Throughput Data Analysis
On the UNIX command line. Go into your bigdata folder. If you have not used the cluster before then you will be in the gen220 project. Or you may have your own lab bigdata folder.
See Text Editors in UNIX While it takes a few steps to install and setup, VisualStudio is a great resource you can edit on your local machine but saves changes on HPCC. See https://hpcc.ucr.edu/manuals/hpc_cluster/selected_software/vscode/
This should work
cd ~/bigdata
but if it doesn’t
cd /bigdata/gen220/$USER # will go into your bigdata folder for the class
But if you already had an account on the cluster then
cd /bigdata/$GROUP/$USER # should work since $USER is your login and $GROUP is your primary lab group
# you can see what groups you are in by typing
groups
For your homework:
mkdir -p ~/bigdata/gen220/homework and then cd ~/bigdata/gen220/homeworkgit clone https://github.com/biodataprog/GEN220_data.git
Now you want to make a folder for your work for this classThe steps look like this:
# you can make a folder for GEN220
mkdir gen220
go into that folder
cd gen220
# now use git to checkout the class data folder
git clone https://github.com/biodataprog/GEN220_data.git
# now go into this folder
cd GEN220_data
Look around in the folder. Go into the tabular folder where I’ve stored some tab or comma delimited data. You will later need to copy a file from this folder into your homework folder.
git clone git@github.com:biodataprog/2025-hw1-YOURGITHUBID
OR for the https will need to create a token as your password)
git clone https://github.com/biodataprog/2025-hw1-YOURGITHUBID.git
Go into your folder (cd 2025-hw1-YOURGITHUBID).
filesize.sh; you can do this in jupyter on web, you can edit on the command line with nano, vi, or emacs, or you can use visual studio tunnel./filecount.sh).git commit and then git push
```git commit -m “This is a homework 1 solution” filesize.sh
git push ```
threatened-species.csv.gz file - see info here HW1 or you can just run the included ./setup.sh script to download. but also encourage you to practice with cp command.answer1.sh:
du or ls -lcut, sort, uniqgit commit -m 'a message' and git push to save the changes to github.