AML 612 Fall 2015/Spring 2016: Stochastic Methods

Posted on October 1, 2015 by Sherry Towers

[In these modules, students will become familiar with basic computational methods for stochastic modeling. We will cover stochastic modeling of epidemics and biological processes using Markov Chain Monte Carlo (MCMC), Stochastic Differential Equations (SDE’s), and Agent Based Models (ABMs, aka Individual Based Models, IBMs). Stochastic methods are very useful for many different things, so we’ll also discuss other applications throughout the course. Along the way, we will discuss many other things that are critical to your future success as a researcher. How to do literature searches and build an annotated bibliography, how to organize your work, good coding practices, how to write a good research paper, and how to give a good presentation. There is no required textbook for this course. However, I *highly* recommend Modeling Infectious Diseases in Humans and Animals by Keeling and Rohani. It is a great introductory- to medium-level book on modeling methods (including stochastic modeling). Another book that is good, at a medium- to advanced-level, is An Introduction to Stochastic Processes with Applications to Biology, by Linda Allen. NIMBios also has a good web page with information related to stochastic modelling. In addition, a really nice introductory exposition on the topic of stochastic modelling by Priscilla Greenwood and Luis Gordillo can be found here]

test of info box

Posted on June 25, 2015 by Sherry Towers

This is an example of some content

This is my content

This is another example of some content.

And more.

Basics of LaTeX and BibTeX for students in mathematical biology/epidemiology

Posted on June 23, 2015 by Sherry Towers

Most journals in the field of mathematical epidemiology/biology require that you submit manuscripts in LaTeX format.

LaTeX is a word processing language, in which you create a document that contains directives to the latex compiler to produce a final document. LaTeX provides many capabilities not available in Word; perhaps most important to mathematicians, LaTeX enables typesetting of complex equations that would be difficult, if not impossible in Word.

In addition, LaTeX has a reference management package called BibTeX that easily enables citation of references within your document.

In this module, we will discuss some simple examples of document formatting in LaTeX, and describe how to include figures and tables in the document, and cite references.

Continue reading →

Good work habits towards being a more effective researcher

Posted on June 9, 2015 by Sherry Towers

Over the years I have developed some habits as I work that help to make me more efficient as a researcher. Of students that I’ve seen struggle in their doctoral studies, they are always lacking several of the habits in this list. Of students who excel, they have all (or nearly all of these habits) either because they were mentored in them, or somehow figured it out themselves.

In assigned homework, students will be expected to conform to good coding and plotting practices, and to submit an annotated bibliography in bibtex format if literature searches are required.

Many of these habits overlap with the list of good work habits in jobs in the private sector. Start using these practices now, and reap many benefits as you go along 🙂

Back up your files
Sharing information in the cloud
Organize your work
Use good coding practices
Make descriptive plots
Motive and Objective
Motive and Objective
MOTIVE AND OBJECTIVE
Background reading, and documenting the published literature on a topic
Publish your work in a timely fashion

Continue reading →

MTBI 2015 summer lectures

Posted on June 9, 2015 by Sherry Towers

[In these modules, students will become familiar with basic concepts that will enable them to fit the parameters of a mathematical model, such that the model gives a good description of a data set. Sources of data useful to mathematical epidemiology will be discussed, including online sources of data, and how to extract data sources from the literature using programs like DataThief.

Goodness-of-fit statistics will be discussed, as will computational methods for finding model parameters that optimize the goodness-of-fit statistic. In particular, in the examples we will focus on fitting the parameters of compartmental models of disease dynamics to epidemic data.

In passing along the way, we will discuss how to do literature searches, how to build an annotated bibliography in bibtex, how to come up with a solid research question, how to organize your work, and how to write a good research paper (essentially, many of the skills needed to excel at research!)]

Part I

Homework #1

Continue reading →

Wedgie poll

Posted on May 7, 2015 by Sherry Towers

Some text [wedgie id=”554aff5178f2801400009038″]

Some more
and more

Protected: AML 610 Spring 2015 part II: simulating science fairs

Posted on February 19, 2015 by Sherry Towers

Protected: AML 610 Spring 2015: Science Fair judging and the problem of obtaining accurate project rankings based on incomplete information

Posted on February 19, 2015 by Sherry Towers

Protected: Class publication project: R scripts and C++ code for running on ASURE

Posted on November 17, 2014 by Sherry Towers

How to scp and ssh without prompting for password

Posted on November 17, 2014 by Sherry Towers

When you use scp to copy files from Unix machine to Unix machine, it asks you for your password. Ditto when you use ssh. When copying a lot of files, this can get tedious. Continue reading →

Example using Negative Binomial likelihood for model parameter optimization

Posted on November 12, 2014 by Sherry Towers

In this past module, we discussed using the Pearson chi-squared statistic to determine the best-fit parameters of an SIR model to influenza B data from the 2007-08 Midwest flu season. In this module, we will discuss how to find the best-fit parameters using the Negative Binomial likelihood instead.

Continue reading →

Correcting for over-dispersion when using Pearson chi-squared

Posted on November 10, 2014 by Sherry Towers

In this past module, we discussed the various merits and applicability of the Least Squares, Pearson chi-square, Poisson likelihood, and Negative Binomial likelihood statistics.

And in this past module we discussed how we can use the graphical Monte Carlo method (aka fmin plus a half method) to determine the one-std deviation confidence interval on our parameter hypotheses when using a likelihood statistic, and we also discussed how the Least Squares and Pearson chi-square statistics can be converted to likelihood statistics.

Continue reading →

Submitting jobs to the ASU A2C2 ASURE batch computing system

Posted on November 2, 2014 by Sherry Towers

The AML610 Fall 2014 course has received an allocation of 10,000 CPU hours on the A2C2 Asure batch computing system. Students in the course have received an email from me describing how to sign up for an Asure account under this allocation.

Continue reading →

Another example C++ program to fit model parameters to data

Posted on November 1, 2014 by Sherry Towers

In a past module, we examined how we could use methods in the R deSolve to fit the parameters of an SIR model to confirmed cases of influenza B in the Midwest region during the 2007-2008 flu season (the data were obtained from the CDC). In that module, we used a Least Squares goodness-of-fit estimator. Continue reading →

An example C++ program to fit model parameters: using XSEDE

Posted on October 29, 2014 by Sherry Towers

In Homework#7 of the 2014 fall AML610 course, we discussed an example where we had a time series of data for a disease vector, V, that can spread disease to a human population. Once the humans catch the disease, they recover after 1/gamma days and moved to the recovered compartment.

Continue reading →

A C++ class for numerically solving ODE’s

Posted on October 15, 2014 by Sherry Towers

In previous modules, we have described how to use methods in the R deSolve library to numerically solve systems of ordinary differential equations, like the SIR model. The default algorithm underlying the functions in the deSolve library is 4th order Runge-Kutta method, which involves an iterative process to obtain approximate numerical solutions to the differential equations. Euler’s method is an even simpler method that can be used to estimate solutions to ODE’s, but 4th order Runge-Kutta is a higher order method that is more precise. Continue reading →

Using the NSF XSEDE batch computing system

Link

Logging in to NSF XSEDE Stampede

Before starting this module, all students in AML610 were asked to apply for an XSEDE user portal account by going to this link and filling in the form, and choosing a username. Continue reading →

Protected: AML610 Fall 2104 project list

Posted on October 9, 2014 by Sherry Towers

Estimating parameter confidence intervals when using the Monte Carlo optimization method

Posted on September 23, 2014 by Sherry Towers

[In this module, students will become familiar with estimating parameter confidence intervals when using the Monte Carlo method to estimate the best-fit parameters of a mathematical model to data.]

Introduction
Local curvature of likelihood GoF statistic close to minimum, and relationship to parameter uncertainties
Estimating parameter uncertainties with the Hessian matrix
A more simple method for estimating parameter uncertainties: the fmin plus a half method (aka graphical Monte Carlo method)
Using the graphical Monte Carlo method with Least Squares
Constructing 95% Confidence Intervals
Example to show that graphical Monte Carlo method actually works

Continue reading →

How to download an R script from the internet and run it

Posted on September 2, 2014 by Sherry Towers

While you can input commands interactively into the R window, it is often more convenient to create a file (usually with a .R extension) that contains all the R code, and then ask R to run (aka: source) the commands in the file.

In the file short_test.R, I have put the R code to do a loop, and print out the numbers one through ten. To run this script, first you need to create a folder on your computer that we will refer to as your working directory… Have this folder off of your root directory (C: directory in windows, and off of your base user directory in other platforms), and call this folder short_test_dir

Now, in R, you can use the setwd() command to change to that working directory (which tells R that from now on you want it to look only in that directory for files)

For windows, type at the R command prompt

setwd("C:/short_test_dir")

and for Linux or Max OSX type

setwd("~/short_test_dir")

(the twiddle ~ means your home directory). If you get an error at this point, it is because you did not properly create the folder short_test_dir under your home folder (or the C: directory in Windows), or you made a spelling mistake, or you forgot one or both quotes.

Now, a problem with Windows is that .R and data files downloaded from the internet tend to be saved with a .txt extension, and it is annoying to constantly have to remove it. In addition, web browsers running on Windows seem to usually assume that any text file you are looking at on the internet must be HTML, so when downloading such files it puts HTML prefaces at the top, which prevent the files from being run in R. To get around these problems, it is usually easiest to download the files using the R function download.file().

Thus, for windows, at the R command line type

download.file("http://www.sherrytowers.com/short_test.R","short_test.R")

and for Linux and Mac OSX type

download.file("http://www.sherrytowers.com/short_test.R","short_test.R")

If you get an error at this point, you’ve made a spelling mistake in the URL, or in the local directory name. Or you’ve forgotten one or more quotes.

Now, to run the file, type at the R command prompt

source("short_test.R")

You should see the numbers 1 through 10 being printed to the screen.

If you are creating your own .R file, you need to make sure that (particularly for Windows) a .txt is not appended to the end of it.

Now, using a text editor like Notepad (Windows) or TextEdit (Mac OSX), or whatever text editor you feel comfortable with, change the short_test.R file to print the numbers 11 to 100, but in a line, with each number separated by a space (rather than in a column). Save the file, then make sure you can run the new file from the R command line.

You will be expected to have completed this exercise on your own before class, and to be adept at downloading, editing, and running R scripts.

Polymatheia

Author Archives: Sherry Towers

AML 612 Fall 2015/Spring 2016: Stochastic Methods

test of info box

Basics of LaTeX and BibTeX for students in mathematical biology/epidemiology

Good work habits towards being a more effective researcher

MTBI 2015 summer lectures

Wedgie poll

Protected: AML 610 Spring 2015 part II: simulating science fairs

Protected: AML 610 Spring 2015: Science Fair judging and the problem of obtaining accurate project rankings based on incomplete information

Protected: Class publication project: R scripts and C++ code for running on ASURE

How to scp and ssh without prompting for password

Example using Negative Binomial likelihood for model parameter optimization

Correcting for over-dispersion when using Pearson chi-squared

Submitting jobs to the ASU A2C2 ASURE batch computing system

Another example C++ program to fit model parameters to data

An example C++ program to fit model parameters: using XSEDE

A C++ class for numerically solving ODE’s

Using the NSF XSEDE batch computing system

Link

Protected: AML610 Fall 2104 project list

Estimating parameter confidence intervals when using the Monte Carlo optimization method

How to download an R script from the internet and run it