An Ubuntu VPS for Django
Three years ago I wrote a guide to building a VPS web server for serving sites in a PHP environment. That setup served me well for some time, but most of the sites I run now – including this one – are now written in Python. Earlier this year I built another web server to reflect this. It’s similar to before; I still use Ubuntu and I still like to serve pages with nginx. But PHP has been replaced with Python, and many of the packages used to build the environment have changed as a result. As with the last time, I decided to compile my notes into a guide, both for my own reference and in case anyone else would like to duplicate it. So far, the server has proven to be fast and efficient. It serves Python using uWSGI, uses a PostgreSQL database, and includes a simple mail server provided by Postfix. I think it’s a good setup for serving simple Django-based websites.
Basic Setup
As with last time, I recommend following Slicehost’s basic server setup article. It discusses user administration, SSH security, and firewalls. I no longer use Slicehost as my VPS provider, but I find that Slicehost’s articles provide an excellent base regardless of the host.
Packages
Packages should upgraded immediately to address any known security vulnerabilities.
1 2 |
|
After the repositories have been updated, I install some essential packages.
1 |
|
Build-essential includes necessary tools to compile programs. I am incapable of using a computer that does not have screen on it, so that gets installed too. The third package, dnsutils, is optional, but includes dig
which is useful for troubleshooting DNS issues.
DenyHosts
Slicehost’s setup article recommends turning off password authentication in SSH, forcing users to login with keys only. I use keys whenever I can, but I appreciate the option of being able to login to my server from any computer, when I may or may not have my SSH key with me. So I leave password authentication enabled. This presents the possibility of brute-force attacks. Enter DenyHosts. DenyHosts, which I have discussed previously attempts to protect against SSH attacks by banning hosts after a certain number of failed login attempts. When password authentication is enabled, running DenyHosts is a smart move.
1 2 |
|
Personalize the Environment
I use update-alternatives
to set my default editor to vim.
1 |
|
All of my personal configuration files are kept in a github repository. I’ll check out that repository into ~/src
and install the files.
1 2 3 4 5 6 7 8 9 10 11 12 |
|
Time
The next step is to set the server’s timezone and sync the clock with NTP.
1 2 |
|
Rootkits
There’s no reason not to run both chkrootkit and rkhunter to check for rootkits and related vulnerabilities.
chrkrootkit
Slicehost has an excellent article for setting up and using chkrootkit.
Later on I’ll be installing some Python development packages. One of them creates a directory called /usr/lib/pymodules/python2.6/.path
, which sets off a warning in chkrootkit. Part of chkrootkit’s desgin philosophy is to not include any whitelists: if chkrootkit finds something that it doesn’t like, you’re going to hear about it. I have cron run chkrootkit nightly and I want to receive any warnings, but I don’t want to receive the same false positive every morning in my inbox.
The solution is to create a file that contains chkrootkit’s warning. I call that file whitelist
and store it in the same directory as chkrootkit. When chkrootkit is run, any output is redirected to a file. That file is compared to the whitelist using diff
and the output of that – if any – is then read. At the end, the file containing chkrootkit’s output is deleted so that the working directory is ready for the next run. The effect is that I only hear warnings from chkrootkit that I have not explicit whitelisted. All of this can be accomplished in a single crontab entry.
1 |
|
rkhunter
I’m sure it doesn’t surprise you that I’m going to recommend reading Slicehost’s article on rkhunter.
Unlike chkrootkit, rkhunter does allow for whitelists. On a clean Ubuntu 10.04 system, I find that I need to whitelist a few items in the config.
1 2 3 4 5 6 7 |
|
The script that my cronjob runs is slightly different from the one demonstrated in the Slicehost article. Their script executes a few commands, groups the output together, and sends it to mail
to email the system administrator. This results in daily emails, regardless of whether rkhunter finds any warnings or not. My script is simpler and does not result in so many messages.
1 2 3 4 |
|
The version check and update commands both have the -q
switch, which disables any output – I don’t care to know whether rkhunter updated itself or not. The final line actually executes the scan. Notice that there’s no reference to mail
. This script does not send any messages. The reason for that is that rkhunter itself provides the mail functionality. Inside of /etc/rkhunter.conf
there is a MAIL-ON-WARNING
variable. As long as the machine has an smtp server on it (which I’ll get to later in this guide), simply filling in this variable will result in any warnings being emailed to the system administrator.
Web Server
With the basics complete, it’s time to start serving something! In my previous article I covered serving a PHP-based Wordpress site via FastCGI and nginx. This time around the stack will be different: nginx, uWSGI, Python, and Django.
A few basic packages will help flesh out the server’s Python development environment:
1 |
|
uWSGI
Installing uWSGI is a simple matter of compiling it and moving the resulting binary into one of your system’s bin directories.
1 2 3 4 5 6 |
|
nginx
The nginx package in Ubuntu’s official repositories is always notoriously outdated. It used to be you had to compile the server from source, but there is now an Ubuntu PPA for the latest stable versions. As described by the nginx wiki, all that is needed is to add the PPA to your sources.list
and apt-get
away!
1 2 3 |
|
Python and Django
If you do Python development and haven’t heard of virtualenv, it is well worth reading up on. It allows the user to create an isolated, virtual Python environment for each project. This helps immensely when developing (or serving) multiple projects on a single machine. Needless to say, I consider it to be a required package.
Install
I’ll be installing virtualenv and virtualenvwrapper (a set of scripts to facilitate working with virtual environments). I also prefer pip over easy_install
for managing Python packages.
1 2 3 |
|
Setup a virtual environment
Virtual environments can be stored wherever you fancy. For now, I keep them in a hidden folder in my home directory. For these examples, I’ll setup an environment called myproject
.
1 2 3 |
|
Notice the --no-site-packages
switch. That tells virtualenv
to create this environment without any of the Python packages already installed, creating a completely fresh, clean environment. The --distribute
switch causes the new virtual environment to be setup with distribute, a replacement for the old and rather broken setuptools.
All that’s needed to get virtualenvwrapper
up and running is to add two lines to your .bashrc
and re-source the file.
1 2 3 4 5 6 |
|
We can now use commands like workon
to ease the process of activating a certain environment.
I’ll go ahead and install yolk in the environment to help manage packages.
1 2 3 |
|
The last command will cause yolk
to list all packages installed in the environment. Try deactivating the environment and then running yolk
again.
1 2 3 |
|
‘yolk’ wasn’t found, because it was only installed within the virtual environment. Neat!
Install Django
Finally, it’s time to install Django! The process is simple enough.
1 2 |
|
And that’s it!
The Python Imaging Library is likely to be needed for any Django project. I installed it in the beginning of this section, but because I used the --no-site-packages
when creating my virtual environment, it is not available for use within the project. To fix that, I’ll just link the package in. I also previously installed psyopg2, which Python will need to communicate with my PostgreSQL database, so I’ll link that in as well. psyopg2
depends on mx, which was also previously installed but still must be made available in the environment.
1 2 3 4 5 |
|
That wasn’t too painful!
Create a Django project
While I’m still in the virtual environment, I’ll go ahead and create a new Django project. The project will have the same name as the environment: myproject
. For this tutorial, I’ll stick with the precedence set by the Slicehost tutorials and use demo
as the name of both my user and group on the server.
I like to keep my sites in the /srv/
directory. I structure them so that the code that runs the site, any public files, logs, and backups are all stored in separate sub-directories.
1 2 3 4 5 6 7 8 9 |
|
Notice that the logs
and public
directories were chown
ed to the www-data group. That is the name of the user and group that nginx will run as. The web server will need permissions to write to those locations.
Save Requirements
With the environment setup and all the necessary packages installed, now is a good time to tell pip to freeze all the packages and their versions. I keep this file in a deploy
folder in my project.
1 2 |
|
Now, if I needed to recreate the virtual environment somewhere else, I could just tell pip to install all the packages from that file.
1 |
|
Configure uWSGI
Now that I have something to serve, I’ll configure uWSGI to serve it. The first step is to create a configuration file for the project. I call mine wsgi.py
and store it in /srv/myproject.com/code/myproject/
. It appends the current directory to the Python path, specifies the Django settings file for the project, and registers the WSGI handler.
1 2 3 4 5 6 7 8 9 |
|
With that done, the next step is to decide how uWSGI should be run. I’m going to use Ubuntu’s upstart to supervise the service. I keep the upstart script in my project’s deploy/
directory.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
|
Sadly, upstart doesn’t seem to recognize links. Rather than linking the config file into /etc/init/
, I have to copy it.
1 |
|
Configure nginx
Nginx’s configuration is pretty straight-forward. If you’ve never configured the server before, Slicehost’s articles can set you down the right path. My own nginx config looks something like this:
user www-data www-data;
worker_processes 4;
pid /var/run/nginx.pid;
events {
worker_connections 768;
use epoll;
}
http {
##
# Basic Settings
##
sendfile on;
tcp_nopush on;
tcp_nodelay on;
keepalive_timeout 30;
types_hash_max_size 2048;
# server_tokens off;
# server_names_hash_bucket_size 64;
# server_name_in_redirect off;
include /etc/nginx/mime.types;
default_type application/octet-stream;
##
# Logging Settings
##
access_log /var/log/nginx/access.log;
error_log /var/log/nginx/error.log;
##
# Gzip Settings
##
gzip on;
gzip_disable "msie6";
gzip_types text/plain text/css application/json application/x-javascript text/xml application/xml application/xml+rss text/javascript;
gzip_proxied any;
gzip_comp_level 2;
# gzip_vary on;
# gzip_buffers 16 8k;
# gzip_http_version 1.1;
##
# Virtual Host Configs
##
include /etc/nginx/conf.d/*.conf;
include /etc/nginx/sites-enabled/*;
}
I keep the virtual host config for my project inside the project’s code/deploy/
directory. A basic virtual host for a Django project would looks like this:
server {
listen 80;
server_name www.myproject.com;
rewrite ^/(.*) http://myproject.com/$1 permanent;
}
server {
listen 80;
server_name myproject.com;
access_log /srv/myproject.com/logs/access.log;
error_log /srv/myproject.com/logs/error.log;
location /media {
root /srv/myproject.com/public/;
}
location /static {
root /srv/myproject.com/public/;
}
location / {
uwsgi_pass unix:///var/run/myproject.com.sock;
include uwsgi_params;
}
}
To install and enable the virtual host, I’ll link the configuration file first to the nginx sites-available
directory, and then link that link to the sites-enabled
directory.
1 2 |
|
SSL
If you need to encrypt communications, Linode has a tutorial on using both self-signed certificates and commercial certificates with nginx.
Fire it Up
Nginx should be set to talk to uWSGI, which should be set to talk to the Django project. Time for a test run!
1 2 |
|
memcached
Django has a very good built-in cache framework. I like to take advantage of it with a memory-based backend: namely, memcached. It’s fast, efficient, and easy to setup.
All that’s needed is to install memcached on the server, followed by the Python API python-memcached.
1 2 3 |
|
The default configuration file in Ubuntu lives at /etc/memcached.conf
. I usually stick with the defaults, but sometimes end up changing the port that memchached runs on or the amount of memory it is allowed to use.
logrotate
With the web server more-or-less complete, I like to setup logrotate to manage the logs in my project’s directory. Once again, Slicehost has an excellent introduction to logrotate and an example config for virtual hosts.
I maintain a configuration file for each of the domains being served by the machine. The file for a domain lives in – you guessed it – the associated project’s deploy/
folder. Each contains two entries: one for the nginx virtual host and one for the uWSGI instance. The reason for this is that each config block needs a postrotate
section to restart the associated server after the logs have been rotated. I don’t want nginx to be restarted everytime a uWSGI log is rotated, and I don’t want uWSGI restarted everytime an nginx log is rotated.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
|
This file is linked in to the /etc/logrotate.d/
directory. Logrotate will automatically include any file in that directory inside its configuration.
1 |
|
Database Server
A web server isn’t much use without a database these days. I use PostgreSQL.
Install
1 |
|
Configure
PostgreSQL has some unique terminology and ways of doing things. When I first set it up for the first time, having coming from a MySQL background, not everything was completely straightforward. As usual, Slicehost has a number of articles that will provide a foundation.
In the /etc/postgresql/8.4/main/postgresql.conf
file, I uncomment the following two lines:
track_counts = on
autovacuum = on
Then restart the database server.
1 |
|
After that I’ll change the password for the postgres user and the postgres database role.
1 2 3 4 |
|
To allow local socket connections to the database using passwords, I open up /etc/postgresql/8.4/main/pg_hba.conf
and find the following line:
local all all ident
Which I then change to:
local all all md5
After which another restart is in order.
1 |
|
Create a database
The next step is to create a user (or role, in PostgreSQL’s parlance) and database for the project. I use the same name for both.
1 2 |
|
After that, I should be able to connect.
1 |
|
Import the Database
If I’m restoring a previous database from a backup, now would be the time to import the backup.
1 |
|
And now Django should be able to connect!
The basic server is setup and secure. Django, uWSGI, nginx and PostgreSQL are all running and getting along swimmingly. At this point, many people would be done, but I also like to have a minimal mail server.
Mail Server
Most of my domains use Google Apps, so I don’t need a full-blown mail server. I do want programs and scripts to be able to send mail, and I prefer not to do so through an external SMTP server – I’d rather just deal with having sendmail running on my own box. And I do have a few domains that do not use Google Apps. They have one or two aliases associated with them, so the server needs to receive messages for those domains and forward them off to an external address. If any of this sounds vaguely familiar, it’s because it’s the same thing I detailed last time. My setup now is the same as then, so I won’t repeat any of it here.
For a more detailed explanation of running Postfix, you can read the Slicehost articles.
A Note on Git
I use Git to keep track of the code for all my projects. (If you’re new to Git, you ought to skim the Git Reference or Everyday GIT With 20 Commands Or So). To manage websites, I create a repository of the directory with the code that runs the site (in this case, /srv/myproject.com/code/
) and another empty, bare repository to work as a hub. With a post-update
and post-commit
, the end result is an excellent web workflow:
- A copy of the hub can be checked out on a local machine for development. Whenever a change is committed, a simple
git push
will push the code to the web server and automatically make it live. - Changes can be made on the server in the actual live website directory. (This is not a best practice, but I do it more often than I should probably admit.) Whenever a change is committed, it is automatically pushed to the hub, so that a simple
git pull
is all that’s needed on the development machine to update its repository.
A more detailed explanation of this workflow is at Joe Maller’s blog.
To start, I need to create a repository for the new project I created in this tutorial. And, since this is a new server, I need to give Git my name and email address to record with every commit.
1 2 3 4 |
|
Before adding the files, I create a .gitignore
file in the repository root to tell Git to ignore compiled Python files.
$ vim .gitignore
*.pyc
Now I add all the files to the repository, confirm that it worked, and commit the files.
1 2 3 |
|
I create the bare hub directory directly along side the projects code/
.
1 2 3 4 |
|
With the hub created, I need to add it as the remote for the main repository and push the master branch to it.
1 2 3 4 |
|
Now the hub needs a post-update
script so that every time something is pushed to it, that change is automagically pulled into the live website directory.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 |
|
And the live website directory requires a post-commit
script so that every time something is committed inside of it, that change is automagically pushed to the hub.
1 2 3 4 5 6 7 8 9 10 |
|
All that’s left is to check out the hub onto the development machine – my laptop, in this case!
1 2 3 |
|
To test things out, we can add a file to the repository on the development machine.
1 2 3 4 5 |
|
Now go back to the server, and the file should be there! To test things the other way around, I’ll delete the file from the live repository.
1 2 3 4 5 |
|
And once again to the development machine:
1 2 3 |
|
No more test! It’s pretty dandy.
Restoring
If I was building a new server and restoring a project from an old server, I would simply mirror the old hub and then clone that in the live directory.
1 2 3 |
|
Resources
Prior to building this server, I was new to a lot of this – particularly, uWSGI and virtualenv. The following tutorials helped me a good deal in putting together the perfect setup for my needs.
- A Primer on virtualenv by Chris Scott
- A web-focused Git workflow by Joe Maller
- Deployment with uWSGI and nginx by Zachary Voase
- Django on uWSGI and Nginx by Brandon Konkle
- Django, Nginx and uWSGI in production by Jeremy Bowers
- Notes on using pip and virtualenv with Django by Eliot
- Presentation: pip and virtualenv by Rich Leland
- Provisioning a new Ubuntu server for Django by Brandon Konkle
- Running Django with Nginx and uWSGI by Simon Westphahl
- virtualenvwrapper Command Reference
- Working with virtualenv by Arthur Koziel