Unfortunatly, installing pebl is not a one-stop process although it should take no longer than a few minutes. If you experience any problems, please contact me at abhikshah@gmail.com.
Pebl is known to run on Linux and Mac OSX and should also run on Windows and any platform that supports Python 2.5 and numpy. Pebl depends on the following packages:
- Numpy: tested with version 1.0.4 but should work with other recent versions
- Pydot: tested with version 1.0.2
- Matplotlib (optional): tested with version 0.91.2 but should work with other recent versions
- Graphviz (optional): any recent version of dot and neato
- IPython1 (optional): curently, require r2815 from svn trunk
- boto (optional): any recent version
Check what version of Python you have with:
python --version
You can download Python 2.5 from http://python.org/download
Note
Pebl requires Python 2.5 or greater and will not run under earlier versions.
easy_install lets you install python packages and their requirements in one easy step. Unfortunately, easy_install is not distributed with python and needs to be installed separately.
Run easy_install –help to make sure that it is in your path. On unix-type systems, it is usually installed in /usr/bin/ and on Windows in C:\Python2.5\Scripts\
Numpy can be tricky to install because it require C and Fortran compilers and several libraries. You can try installing it from source using easy_install:
easy_install numpy
If that doesn’t work or if you’d rather install using a binary package, consult http://www.scipy.org/Download.
You can now install (or upgrade) pebl and it’s required dependencies using easy_install:
easy_install pebl
Note
The current version of Pebl is 1.0.2
If you have downloaded the source code for Pebl (or installing pebl from svn), you can run sudo python setup.py install from pebl’s root directory instead.
To test pebl installation, you can run the automated unit tests by first install nose:
easy_install nose
and then running nose from the command line, in any directory:
nosetests -v pebl.test
If you haven’t installed all the optional dependencies, certain tests will fail but the name of the tests should make it clear whether it’s a normal feature or an optional one.
Certain features of pebl require additional dependencies. You only need to install these if you need the optional features. Without the optional features, you can load data, learn networks and output networks in dot format, although not as an image. If installing pebl on a cluster, the optional features are only necessary on the controller or the machine used interactively by the user.
Pebl uses Graphviz to visualize networks and matplotlib for plotting. If using Pebl on a cluster, these don’t need to be installed on the worker nodes.
Finally, install simplejson using easy_install:
easy_install simplejson
simplejson sometimes fails to install on Windows. In that case, you can use a binary package:
easy_install http://pebl-project.googlecode.com/files/simplejson-1.7.3-py2.5-win32.egg
To test the html-report feature, run:
nosetests -v pebl.test.test_result:TestHtmlReport
Apple XGrid is a grid solution for Apple computers that lets desktops and dedicated servers be used in a computational grid.
The Xgrid task controller only runs on platforms where XGrid is available (currently, only Mac OSX). Pebl uses the PyXG package to access the XGrid controller:
easy_install PyXg
To test the XGrid feature, find the location where pebl was installed by easy_install and navigate to the src/pebl/test.manual directory. There, create a file called xgridconfig.txt and include the relevant configuration parameters. The file should look like:
[xgrid]
controller = your.controller.com
password = yourpassword
grid = gridnumber
pollinterval = 30
Then, execute the test_xgrid.py test:
python test_xgrid.py xgridconfig.txt
Pebl will create and run 10 short learners over the XGrid. It will print some information about submitting and retrieving data from the Xgrid controller and finally print a list of machines where the tasks ran.
IPython1 is the next version of the popular IPython shell that also includes an interactive, clustering solution. Pebl can use IPython1 to execute learners in parallel.
You will need to install both IPython and IPython1. IPython1 is in active development and pebl requires a specific version. Once, IPython1 is oficially released, we will use that package:
easy_install ipython
svn co -r 2815 http://ipython.scipy.org/svn/ipython/ipython1/trunk ipython1-2815
cd ipython1-2815
sudo python setup.py install
To test the IPython1 feature, make sure that IPython1’s ipcluster is in the path and run:
nosetests -v pebl.test.test_taskcontroller:TestIPython1TC
The test will create three local IPython1 engines and run tasks on them. The test cannot, unfortunately, terminate the engines and that needs to be done manually. Simply run ‘ps’ and terminate the appropriate processes.
Amazon EC2 is an on-demand cloud computing solution from Amazon. It allows users to rent computing power on an as-needed basis. Pebl can reserve, create, use and terminate EC2 instances automatically. More information available at http://aws.amazon.com/ec2/
Before you can use Amazon’s EC2 platform, you need to register with Amazon and create the required authentication credentials. Consult the Getting Started Guide at Amazon for further information.
Pebl uses the boto package to interact with EC2:
easy_install boto
Also see the instructions above for installing dependencies for the IPython1 Task Controller (which is required by the EC2 Task Controller).
To test the EC2 feature, find the location where pebl was installed by easy_install and navigate to the src/pebl/test.manual directory. There, create a file called ec2config.txt and include the relevant configuration parameters. The file should look like:
[EC2]
access_key = YOUR_AMAZON_ACCESSKEY
secret_access_key = YOUR_AMAZON_SECRET_ACCESS_KEY
ami = ami-66a3470f # or any AMI with pebl, IPython1 and svn installed
key_name = keyname-to-use
credential = ~/foo/private-key-to-use
Then, execute the test_ec2.py test:
python test_ec2.py ec2config.txt
Pebl will reserve and use 3 EC2 instances, upgrade the version of pebl installed, run 10 shorts tasks on them and terminate the instances when finished. If all works sucessfully, Pebl will print the hostnames of the machines used.
Pebl developers will also need nose for testing and sphinx for generating html documentation:
easy_install nose
easy_install sphinx