Installing CKAN from package

This section describes how to install CKAN from package. This is the quickest and easiest way to install CKAN, but it requires Ubuntu 18.04 (Python 2) or 20.04 (Python 3 or Python 2) 64-bit. If you’re not using any of these Ubuntu versions, or if you’re installing CKAN for development, you should follow Installing CKAN from source instead.

At the end of the installation process you will end up with two running web applications, CKAN itself and the DataPusher, a separate service for automatically importing data to CKAN’s DataStore extension. Additionally, there will be a process running the worker for running Background jobs. All these processes will be managed by Supervisor.

For Python 3 installations, the minimum Python version required is 3.6.

  • Ubuntu 20.04 includes Python 3.8 as part of its distribution
  • Ubuntu 18.04 includes Python 3.6 as part of its distribution

Host ports requirements:

Service Port Used for
NGINX 80 Proxy
uWSGI 8080 Web Server
uWSGI 8800 DataPusher
Solr/Jetty 8983 Search
PostgreSQL 5432 Database
Redis 6379 Search

1. Install the CKAN package

On your Ubuntu system, open a terminal and run these commands to install CKAN:

  1. Update Ubuntu’s package index:

    sudo apt update
    
  2. Install the Ubuntu packages that CKAN requires (and ‘git’, to enable you to install CKAN extensions):

    sudo apt install -y libpq5 redis-server nginx supervisor
    

    Note

    If you want to install CKAN 2.9 running on Python 2 for backwards compatibility, you need to also install the Python 2 libraries:

    # On Ubuntu 18.04
    sudo apt install python2 libpython2.7
    
    # On Ubuntu 20.04
    sudo apt install libpython2.7
    
  3. Download the CKAN package:

    • On Ubuntu 18.04:

      wget http://packaging.ckan.org/python-ckan_2.9-bionic_amd64.deb
      
    • On Ubuntu 20.04, for Python 3 (recommended):

      wget http://packaging.ckan.org/python-ckan_2.9-py3-focal_amd64.deb
      
    • On Ubuntu 20.04, for Python 2:

      wget http://packaging.ckan.org/python-ckan_2.9-py2-focal_amd64.deb
      
  4. Install the CKAN package:

    • On Ubuntu 18.04:

      sudo dpkg -i python-ckan_2.9-bionic_amd64.deb
      
    • On Ubuntu 20.04, for Python 3:

      sudo dpkg -i python-ckan_2.9-py3-focal_amd64.deb
      
    • On Ubuntu 20.04, for Python 2:

      sudo dpkg -i python-ckan_2.9-py2-focal_amd64.deb
      

2. Install and configure PostgreSQL

Tip

You can install PostgreSQL and CKAN on different servers. Just change the sqlalchemy.url setting in your /etc/ckan/default/ckan.ini file to reference your PostgreSQL server.

Note

The commands mentioned below are tested for Ubuntu system

Install PostgreSQL, running this command in a terminal:

sudo apt install -y postgresql
orphan:

Note

If you are facing a problem in case postgresql is not running, execute the command sudo service postgresql start

Check that PostgreSQL was installed correctly by listing the existing databases:

sudo -u postgres psql -l

Check that the encoding of databases is UTF8, if not you might find issues later on with internationalisation. Since changing the encoding of PostgreSQL may mean deleting existing databases, it is suggested that this is fixed before continuing with the CKAN install.

Next you’ll need to create a database user if one doesn’t already exist. Create a new PostgreSQL user called ckan_default, and enter a password for the user when prompted. You’ll need this password later:

sudo -u postgres createuser -S -D -R -P ckan_default

Create a new PostgreSQL database, called ckan_default, owned by the database user you just created:

sudo -u postgres createdb -O ckan_default ckan_default -E utf-8

Note

If PostgreSQL is run on a separate server, you will need to edit postgresql.conf and pg_hba.conf. On Ubuntu, these files are located in etc/postgresql/{Postgres version}/main.

Uncomment the listen_addresses parameter and specify a comma-separated list of IP addresses of the network interfaces PostgreSQL should listen on or ‘*’ to listen on all interfaces. For example,

listen_addresses = 'localhost,192.168.1.21'

Add a line similar to the line below to the bottom of pg_hba.conf to allow the machine running the web server to connect to PostgreSQL. Please change the IP address as desired according to your network settings.

host    all             all             192.168.1.22/32                 md5

Edit the sqlalchemy.url option in your CKAN configuration file (/etc/ckan/default/ckan.ini) file and set the correct password, database and database user.

3. Install and configure Solr

Tip

You can install Solr and CKAN on different servers. Just change the solr_url setting in your /etc/ckan/default/ckan.ini /etc/ckan/default/production.ini file to reference your Solr server.

Install Solr, running this command in a terminal:

sudo apt install -y solr-tomcat
orphan:

CKAN uses Solr as its search platform, and uses a customized Solr schema file that takes into account CKAN’s specific search needs. Now that we have CKAN installed, we need to install and configure Solr.

Note

These instructions explain how to deploy Solr using the Tomcat web server, but CKAN doesn’t require Tomcat - you can deploy Solr to another web server, such as Jetty, if that’s convenient on your operating system.

  1. Change the default port Tomcat runs on (8080) to the one expected by CKAN. To do so change the following line in the /etc/tomcat9/server.xml file (tomcat8 in older Ubuntu versions):

    From:
    
    <Connector port="8080" protocol="HTTP/1.1"
    
    
    To:
    
    <Connector port="8983" protocol="HTTP/1.1"
    

    Note

    This is not required by CKAN, you can keep the default Tomcat port or use a different one, just make sure to update the solr_url setting in your CKAN configuration file accordingly.

  2. Replace the default schema.xml file with a symlink to the CKAN schema file included in the sources.

    sudo mv /etc/solr/conf/schema.xml /etc/solr/conf/schema.xml.bak
    sudo ln -s /usr/lib/ckan/default/src/ckan/ckan/config/solr/schema.xml /etc/solr/conf/schema.xml
    
  3. Now restart Solr (use tomcat8 on older Ubuntu versions):

    sudo service tomcat9 restart
    

    Check that Solr is running by opening http://localhost:8983/solr/

    Note

    On Ubuntu 18.04 and older, instead of the Solr UI you may see an Internal Server Error page with a message containing:

    java.io.IOException: Cannot create directory: /var/lib/solr/data/index
    

    This is caused by a bug and you need to run some extra commands to fix it:

    sudo mv /etc/systemd/system/tomcat9.d /etc/systemd/system/tomcat9.service.d
    sudo systemctl daemon-reload
    sudo service tomcat9 restart
    
  4. Finally, change the solr_url setting in your CKAN configuration file (/etc/ckan/default/ckan.ini) to point to your Solr server, for example:

    solr_url=http://127.0.0.1:8983/solr
    

4. Update the configuration and initialize the database

  1. Edit the CKAN configuration file (/etc/ckan/default/ckan.ini) to set up the following options:

    site_id

    Each CKAN site should have a unique site_id, for example:

    ckan.site_id = default
    
    site_url

    Provide the site’s URL. For example:

    ckan.site_url = http://demo.ckan.org
    
  2. Initialize your CKAN database by running this command in a terminal:

    sudo ckan db init
    
  3. Optionally, setup the DataStore and DataPusher by following the instructions in DataStore extension.

  4. Also optionally, you can enable file uploads by following the instructions in FileStore and file uploads.

5. Start the Web Server and restart Nginx

Reload the Supervisor daemon so the new processes are picked up:

sudo supervisorctl reload

After a few seconds run the following command to check the status of the processes:

sudo supervisorctl status

You should see three processes running without errors:

ckan-datapusher:ckan-datapusher-00   RUNNING   pid 1963, uptime 0:00:12
ckan-uwsgi:ckan-uwsgi-00             RUNNING   pid 1964, uptime 0:00:12
ckan-worker:ckan-worker-00           RUNNING   pid 1965, uptime 0:00:12

If some of the processes reports an error, make sure you’ve run all the previous steps and check the logs located in /var/log/ckan for more details.

Restart Nginx by running this command:

sudo service nginx restart

6. You’re done!

Open http://localhost in your web browser. You should see the CKAN front page, which will look something like this:

../../_images/9.png

You can now move on to Getting started to begin using and customizing your CKAN site.

Note

The default authorization settings on a new install are deliberately restrictive. Regular users won’t be able to create datasets or organizations. You should check the Organizations and authorization documentation, configure CKAN accordingly and grant other users the relevant permissions using the sysadmin account.