cluster:software
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
cluster:software [2024/11/11 20:27] – mcloughlin | cluster:software [2024/11/14 14:47] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== Cluster Software ====== | + | ======Cluster Software====== |
The list of currently installed software on the cluster. If you wish to have additional software installed, please email econcluster@umd.edu. | The list of currently installed software on the cluster. If you wish to have additional software installed, please email econcluster@umd.edu. | ||
- | //Currently updating to reflect new OS refresh. --2024/ | + | ^Software ^Version ^Terminal |
- | + | | GCC | 11.4.1 | gcc | | |
- | ^Software ^Version ^Command ^ | + | |
| Matlab | R2023a | matlab | | | Matlab | R2023a | matlab | | ||
| Python | 3.9 | python | | | Python | 3.9 | python | | ||
+ | | Python | 3.11 | python3.11 | | ||
| R | 4.4.2 | R | | | R | 4.4.2 | R | | ||
| Stata | 18 MP8 | stata-mp | | | Stata | 18 MP8 | stata-mp | | ||
- | |||
- | =====Matlab===== | ||
- | The current version of Matlab is R2023a. Use " | ||
=====Python===== | =====Python===== | ||
- | The current version of Python is 3.9. Use "python" | + | |
+ | To run a pre-written | ||
+ | |||
+ | ==== Installing Libraries via VENV ==== | ||
+ | |||
+ | To install a library that doesn' | ||
+ | < | ||
+ | after creating, activate the environment | ||
+ | < | ||
+ | (you should now see the environment on the far left of the terminal line). After that, you can simply install any library using pip from the command line | ||
+ | < | ||
=====R===== | =====R===== | ||
- | The current version | + | ====Batch Mode==== |
+ | You can run an R file in batch mode with < | ||
+ | To run your R command in the background, see [[cluster: | ||
+ | |||
+ | ====Installing Packages===== | ||
+ | To install an R package, type in the interactive mode < | ||
+ | |||
+ | ====Introduction to R==== | ||
+ | The following section comes initially from an introductory talk on R given by Paul Bailey in February 2011. The data used in the examples is located at [[http:// | ||
+ | |||
+ | ===R Background=== | ||
+ | * Based on Bell Labs S | ||
+ | * Open source software | ||
+ | * Large group of contributors | ||
+ | * Most R code is written in R | ||
+ | * Computationally intensive code written in FORTRAN or C | ||
+ | * Datasets, matrices are native types | ||
+ | * Easy, customizable graphics | ||
+ | |||
+ | ===R Pros=== | ||
+ | * Free | ||
+ | * Easy to get a sense of what is going on with data | ||
+ | * Excellent at simulation | ||
+ | * Interfaces with lots of other software (i.e. WINBUGS, SQL) | ||
+ | |||
+ | ===R Cons=== | ||
+ | * Uses RAM to store data | ||
+ | * Support mainly via listserves | ||
+ | * Difficult to get started | ||
+ | |||
+ | ===Read in Data=== | ||
+ | * Some type specific methods, and a general method < | ||
+ | |||
+ | ===Getting Help=== | ||
+ | * You can use the following | ||
+ | * To search for text in help text use the following command: < | ||
+ | |||
+ | ===Summary=== | ||
+ | * Getting summaries is easy: '' | ||
+ | * You can also focus on one variable | ||
+ | < | ||
+ | summary(dat$num_child) | ||
+ | table(dat$num_child) | ||
+ | </ | ||
+ | |||
+ | ===Subset Data=== | ||
+ | * When you reference something with '' | ||
+ | < | ||
+ | dat.lf <- dat[dat$emp %in% c(" | ||
+ | dat.hs <- dat.lf[dat.lf$educ==39, | ||
+ | </ | ||
+ | |||
+ | ===Linear Models=== | ||
+ | * The **lm** function fits linear models with a formula: | ||
+ | < | ||
+ | lm1 <- lm(weekly_earn ~ age + year, | ||
+ | summary(lm1) | ||
+ | </ | ||
+ | * You can also treat a variable as a factor: | ||
+ | < | ||
+ | dat$yearf <- as.factor(dat$year) | ||
+ | lm2 <- lm(weekly_earn ~ age + yearf, | ||
+ | summary(lm2) | ||
+ | </ | ||
+ | * And change constraints: | ||
+ | < | ||
+ | contrasts(dat$yearf) <- " | ||
+ | lm3 <- lm(weekly_earn ~ age + yearf, | ||
+ | summary(lm3) | ||
+ | </ | ||
+ | |||
+ | ===Aggregate=== | ||
+ | * Allows you to create summary statistics for groups | ||
+ | * First argument is what you want to summarize | ||
+ | * Second argument is what you want to group by | ||
+ | * Their argument is what to do to the groups | ||
+ | < | ||
+ | agg.hs <- aggregate(dat.hs$emps, | ||
+ | </ | ||
+ | * Results names a little odd. | ||
+ | * | ||
+ | ===Merge=== | ||
+ | * Groups two datasets by shared columns | ||
+ | < | ||
+ | merged <- merge(data.a, | ||
+ | </ | ||
+ | * Lots of options for this one | ||
+ | |||
+ | ===Parallel=== | ||
+ | Some basic info can be found at the [[http:// | ||
+ | |||
+ | You can also use [[http:// | ||
+ | |||
+ | ^^OpenMPI^MPICH2^ | ||
+ | |Before anything (installation or usage)|> | ||
+ | |Installation| R> | ||
+ | |||
+ | A good intro guide is [[http:// | ||
+ | |||
+ | ===Other functions=== | ||
+ | * '' | ||
+ | * '' | ||
+ | * '' | ||
+ | * [[http:// | ||
=====Stata===== | =====Stata===== | ||
- | The current version of Stata is 18 MP8. Use "stata-mp" | + | ====Batch Mode===== |
+ | You can run a .do file in batch mode with | ||
+ | < | ||
+ | stata-mp | ||
+ | </ | ||
+ | |||
+ | To allow your do-file to continue running when you log off from your terminal, preface the command with "nohup" | ||
+ | < | ||
+ | nohup stata-mp -b do dofile.do & | ||
+ | </ | ||
+ | |||
+ | For more information on how to run your Stata command | ||
+ | |||
+ | ==== Temporary Files ==== | ||
+ | By default, Stata saves tempfiles (from -tempfile- or -preserve-) | ||
+ | < | ||
+ | export STATATMP=$HOME/ | ||
+ | </ | ||
+ | |||
+ | One reason you might want to do this is that files are removed from / | ||
+ | |||
+ | ==== Installing Extra Packages ==== | ||
+ | If you are using extra packages on your home/work computer and need them installed on the cluster, you can install them via ssc: | ||
+ | |||
+ | < | ||
+ | ssc install outreg | ||
+ | </ | ||
+ | |||
+ | You will then have a folder installed within your home directory called " |
cluster/software.1731356836.txt.gz · Last modified: 2024/11/11 20:27 by mcloughlin