Examples ======== Examples are king. Example 1 - A simple pull ------------------------- Fetch SWAMID metadata, split it up into EntityDescriptor elements and store each as a separate file in /tmp/swamid. .. code-block:: yaml - load: - http://md.swamid.se/md/swamid-2.0.xml - select - publish: "/tmp/swamid-2.0.xml" - stats This is a simple example in 3 steps: load, select, store and stats. Each of these commands operate on a metada repository that starts out as empty. The first command (load) causes a URL to be downloaded and the SAML metadata found there is stored in the metadata repository. The next command (select) creates an active document (which in this case consists of all EntityDescriptors in the metadata repository). Next publish is called which causes the active document to be stored in an XML file. Finally the stats command prints out some information about the metadata repository. This is essentially a 1-1 operation: the metadata loaded is stored in a local file. Next we'll look at a more complex example that involves filtering and transformation. Example 2 - Grab the IdPs from edugain -------------------------------------- Grab edugain metadata, select the IdPs (using an XPath expression), run it through the built-in 'tidy' XSL stylesheet (cf below) which cleans up some known problems, sign the result and write the lot to a file. .. code-block:: yaml - load: - http://mds.edugain.org edugain-signer.crt - select: "http://mds.edugain.org!//md:EntityDescriptor[md:IDPSSODescriptor]" - xslt: stylesheet: tidy.xsl - finalize: cacheDuration: PT5H validUntil: P10D - sign: key: sign.key cert: sign.crt - publish: /tmp/edugain-idp.xml - stats In this case the select (which uses an xpath in this case) picks the EntityDescriptors that contain at least one IDPSSODescriptor - in other words all IdPs. The xslt command transforms the result of this select using an xslt transformation. The finalize command sets cacheDuration and validUntil (to 10 days from the current date and time) on the EntitiesDescriptor element which is the result of calling select. The sign command performs an XML-dsig on the EntitiesDescriptor. For reference the 'tidy' xsl is included with pyFF and looks like this: .. code-block:: xml Example 3 - Use an XRD file --------------------------- Sometimes it is useful to keep metadata URLs and signing certificates used for validation in a separate file and pyFF supports XRD-files for this purpose. Modify the previous example to look like this: .. code-block:: yaml - load: - links.xrd - select: "!//md:EntityDescriptor[md:IDPSSODescriptor]" - xslt: stylesheet: tidy.xsl - sign: key: sign.key cert: sign.crt - publish: /tmp/idp.xml - stats Note that in this case the select doesn't include the http://mds.edugain.org prefix before the '!'-sign. This causes the xpath to operate on all source URLs, rather than just the single source http://mds.edugain.org . It wdould have been possible to call select with multiple arguments, each using a different URL from the file links.xrd which contains the following: .. code-block:: xml http://md.swamid.se/md/swamid-2.0.xml SWAMID MIIDdTCCAl0CBEY7EskwDQYJKoZIhvcNAQEEBQAwfzELMAkGA1UEBhMCU0UxEjAQ BgNVBAgTCVN0b2NraG9sbTESMBAGA1UEBxMJU3RvY2tob2xtMREwDwYDVQQKEwhT V0FNSS5zZTEPMA0GA1UECxMGU1dBTUlEMSQwIgYDVQQDExtTV0FNSUQgbWV0YWRh dGEgc2lnbmVyIHYxLjEwHhcNMDcwNTA0MTEwMjMzWhcNMTcwNTAxMTEwMjMzWjB/ MQswCQYDVQQGEwJTRTESMBAGA1UECBMJU3RvY2tob2xtMRIwEAYDVQQHEwlTdG9j a2hvbG0xETAPBgNVBAoTCFNXQU1JLnNlMQ8wDQYDVQQLEwZTV0FNSUQxJDAiBgNV BAMTG1NXQU1JRCBtZXRhZGF0YSBzaWduZXIgdjEuMTCCASIwDQYJKoZIhvcNAQEB BQADggEPADCCAQoCggEBAM6wXN3pVCo98SACS6JCHjSlWj83oNL/Ct+a9hmAx1NZ SKg7lnEJYwWBvzJt5o/47jRQbGm94a45Yy5LVoXq4XyCKINhMxSwbRROvr8Hw6tg P1Z9dk5Jjejvus3gyaH3+EuEyP4aIjTlgmHDwW6HOv/m/4bOXSHB4Pisn7aocqU7 kjpOn1f0cGodWOgGO4tP7KXs6ndcLhIkW+e/B80WEr0kocuc/pvx+aLuKSkttk/A fP1DFs5sqX31RXQKGrB/uEEYVv1Qvneig+RXGSbqk2Tab3BcLE/Cjnfi9Q9cH/jR eL/YSSafGtl+EBgXKszxjMtELhiEWsL9RrMu1HUkBusCAwEAATANBgkqhkiG9w0B AQQFAAOCAQEAkXaa61gp/lkEDNRFc0bzH3ZyoUFgol64F1zdAwBS3xnsCkTnAXt3 p452daEyz+0UR5J/BruMOyvR57w1m7ckVnx/sAgRgaD6gQlUWehjKPEsx8o5iDfO 5R1V5Rn2o7+0VuIJDDObEAtMwqn2Nk6TTzsUVfz5y9nUQAxBz3EqXnnSgRwqSwRF yiVkpVfwtUHIolAf6O2N9Fg1jqoqt4mQCOyRZpD0/5SRYESTY6TJjTmvoh+zOPlI yEiw+Zrl/FWjXtBnRnz8AVT5NRzYiMHdbTHs0Fh6elsb5b9gTBo7j6+t36m7oo2K DaWWpMWvuWHugEqvIAXDCI/HzTbbiWm9NQ== InCommon InCommon Metadata The structure of the file should be fairly self-evident. Only links with @rel="urn:oasis:names:tc:SAML:2.0:metadata" will be parsed. If a KeyInfo with a X509Certificate element (usual base64-encoded certificate format) then this certificate is used to validate signature on the donwloaded SAML metadata. Note that while 'load' supports validation based on certificate fingerprint the XRD format does not and you will have to include Base64-encoded certificates if you want validation to work. Example 4 - Sign using a PKCS#11 module --------------------------------------- Fetch SWAMID metadata (and validate the signature using a certificate matching the given SHA1 fingerprint), select the Identity Providers, tidy it up a bit and sign with the key with the label 'signer' in the PKCS#11 module /usr/lib/libsofthsm.so. If a certificate is found in the same PKCS#11 object, that certificate is included in the Singature object. .. code-block:: yaml - load: - http://md.swamid.se/md/swamid-2.0.xml 12:60:D7:09:6A:D9:C1:43:AD:31:88:14:3C:A8:C4:B7:33:8A:4F:CB - select: "!//md:EntityDescriptor[md:IDPSSODescriptor]" - xslt: stylesheet: tidy.xsl - sign: key: pkcs11:///usr/lib/libsofthsm.so/signer - publish: /tmp/idp.xml - stats Running this example requires some preparation. Run the 'p11setup.sh' script in the examples directory. This results in an SoftHSM token begin setup with the PIN 'secret1' and SO_PIN 'secret2'. Now run pyff (assuming you are using a unix-like environment). .. code-block:: bash # env PYKCS11PIN=secret1 SOFTHSM_CONF=softhsm.conf pyff --loglevel=DEBUG p11.fd Example 5 - MDX --------------- Runing an MDX server is pretty easy using pyff. Lets start with the links.xrd file (cf example above) and add this simple pipeline. .. code-block:: yaml - when update: - load: - links.xrd - break - when request: - select - pipe: - when accept application/xml: - xslt: stylesheet: tidy.xsl - first - finalize: cacheDuration: PT5H validUntil: P10D - sign: key: sign.key cert: sign.crt - emit application/xml - break - when accept application/json: - xslt: stylesheet: discojson.xsl - emit application/json: - break The big difference here are the two when commands. They are used to select between the two main entrypoints for the pyff server: the update flow and the request flow. The update flow is run repeatedly and is usually used for updating the internal metadata repository. The request flow is called every time an MDX request is submitted. The internal when statements are used to provide basic content negotiation for the MDX request. Content negotiation is based both on the Accept header and on the extension (suffix) on the URL - ending a resource with '.json' selects application/json, etc and overrides the Accept header. The only new commands here are emit, break and first. The emit command transforms the result into the appropriate output format (UTF-8 encoded text), the break terminates the pipeline. The first command strips the outer EntitiesDescriptor if only a single EntityDescriptor is present in the active document which is consistent with expected behaviour for the MDX protocol. The behaviour of the select command in the request pipeline is a bit different: the select operates on a query fed to the request pipeline from the HTTP server that runs the command. This is called implicit select. Now start pyffd: .. code-block:: bash # pyffd -f --loglevel=DEBUG -p /var/run/pyffd.pid mdx.fd This should start pyffd in the foreground. If you remove the ``-f`` pyff should daemonize. For running pyff in production I suggest something like this: .. code-block:: bash # pyffd --loglevel=INFO --log=syslog:auth --frequency=300 -p /var/run/pyffd.pid --dir=`pwd` -H -P80 mdx.fd This starts pyff on the interface :80 and uses the current directory as the working directory. If you leave out --dir then pyffd will change directory to $HOME of the current user which is probably not what you want. In this case logging is done through syslog (the auth facility) and with log level INFO. The refres-rate is set to 300 seconds so at minimum your downstream feeds will be refreshed that often.