Creating documentation from our code by our code

I hate having to do documentation as much as the next developer, especially when it’s for something as simple as some codes or underlying dependencies you use in your pipeline! Just imagine it, you’re working in your Agile ways, you get given a task to incorporate a new feature which uses some of the businesses codes like ‘WEDPA’, ‘UUQL’ or some other strange hieroglyphic.

You get cracking with your development and your code speaks for itself, a work of art if you don’t say so yourself! You close the ticket and document up how to use the feature but realise you’ve not documented any of its dependencies and what they mean! Whats more the business might change those codes and you would have to update the documentation every time you added new codes or updated them...

The Solution

Create a table to hold all of your dependencies, describing in detail all that they do (even better would be to grab that data from a static data table where whoever entered the data would populate that for you).
Load this table into memory if you haven't created it yourself in the pipeline (eg from database -> pandas)
Push this table into a Confluence page where you store all of this information so that its easily readable and visible to the Business, not just a csv or left somewhere as a comment in the code

Demo Confluence environment

To get us up and running we can spin up two docker containers, one running confluence and the other running a jupyter notebook. Be sure to follow through the step by step instructions on getting your confluence server up and running. It’s blissfully easy!

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14


# Run our confluence container, NOTE: ensure you have a folder called confluence 
#   or change where the volume will be stored (by changing ~/Desktop/confluence to your required folder)
docker run -v ~/Desktop/confluence:/var/atlassian/application-data/confluence --name="confluence" -d -p 8090:8090 -p 8091:8091 atlassian/confluence-server

# Quick function to get the ip of a container by name
docker-ip() {
	docker inspect --format '{{ .NetworkSettings.IPAddress }}' "$@"
}

# The ip of the confluence container
export CONFLUENCE_IP=`docker-ip confluence`

# Running our jupyter notebook container with the IP as an environment variable
docker run -d -p 8888:8888 --name notebook -e CONFLUENCE_IP=$CONFLUENCE_IP -v ~:/home/jovyan/work jupyter/scipy-notebook start-notebook.sh --NotebookApp.token=''

Creating and using confluence API wrapper

I have created a quick and dirty confluence wrapper and it is freely available on GitHub if you have any issues please raise them and pull requests are most welcome.

After we have installed this by running pip install git+https://github.com/ghandic/confluenceapi.git we should be good to go with the jupyter notebooks.

First of all we will make our pages in confluence leaving some pages empty for the code to fill in and update on every production pipeline run.

Now we can add html content to that page by following the example notebook provided:

Updating pages in confluence

Neccessary imports

In [ ]:

import os
from confluenceapi import Confluence

Setting up our credentials

In [ ]:

conf_server = os.environ['CONFLUENCE_IP'] + ':8090'
credentials = ('admin', 'Password123')

Create a confluence object ready to submit requests

In [ ]:

lc = Confluence(conf_server, credentials)

Add a page

In [ ]:

lc.add_page('Page about DS', 'Data Science')

Update a page with raw HTML

In [ ]:

lc.update_page('Page about DS', 'Data Science', '<h1 style="color:red;">This is a new title</h1>')

Delete a page

In [ ]:

lc.delete_page('Page about DS', 'Data Science')

Another method we may want to document is by uploading files, maybe its a picture (.png), log file (.txt), etc we can do this by using the following methods:

Attachments pages in confluence

Neccessary imports

In [ ]:

import os
from confluenceapi import Confluence

Setting up our credentials

In [ ]:

conf_server = os.environ['CONFLUENCE_IP'] + ':8090'
credentials = ('admin', 'Password123')

Create a confluence object ready to submit requests

In [ ]:

lc = Confluence(conf_server, credentials)

Add an attachment to our page

In [ ]:

lc.upload_attachment('demo.txt', 'Page about DS', 'Data Science', 'First upload!')

Update our attachments on our page

In [ ]:

lc.update_attachment('demo.txt', 'Page about DS', 'Data Science', 'Second upload!')

Delete an attachment on our page

In [ ]:

lc.delete_attachment('demo.txt', 'Page about DS', 'Data Science')

To see more examples check out the full GitHub repo.

Andy Challis

Andy Challis

Apis are friends not food - Confluence

Creating documentation from our code by our code

Demo Confluence environment

Creating and using confluence API wrapper

Updating pages in confluence

Neccessary imports

Setting up our credentials

Create a confluence object ready to submit requests

Add a page

Update a page with raw HTML

Delete a page

Attachments pages in confluence

Neccessary imports

Setting up our credentials

Create a confluence object ready to submit requests

Add an attachment to our page

Update our attachments on our page

Delete an attachment on our page