How to make a Parliamentary chart with Tableau Desktop

A case study to go further

IV. Labels for the groups▲

The main use for a Parliamentary chart is representing the majority or finding alternate majorities. Our chart is not very convenient for this, as it does not display any figure. Let us try to add the Group and MEPs (Count) fields on Label.

Image non disponible — Hemicycle with labels, first try

We now have our labels, that has provoked a series of issues, from the most trivial to the most complex:

as the Sector pill is not the first one anymore, we have lost the sort order; of course, we just have to drag it in first position again
some labels may be missing; just click on the Label tool and check Allow labels to overlap other marks
the label with the number of MEPs in the white group does show, betraying the presence of fillers; a little IF would be enough to remplace it with a NULL (we will fix this later)
the central group, Renew, has two labels, the one with 100 MEPs and the other with 1; this is where the real difficulties begin

The grain level of our chart is Sector, so technically it is normal there is a label per sector rather than per group, but that’s quite unfortunate. Instead of two labels with 100 and 1 MEPs, we want one with 101 MEPs, and it has to be placed on the side with the most MEPs.

There are two policies for this, either LODs or Table Calculations. You can test whichever you prefer, or even try both. In any case, I invite you to Duplicate your current sheet, so you can test various versions and without breaking anything.

Another improvement would be coloring the label like its group; to do this, click the Label tool, then Font, and check Match Mark Color. As an extra perk, the label for the white group becomes white on white. I have not kept this option, as we will have to use formulas to fix the double label of the central group, but you can perfectly use it.

IV-A. With LODs▲

Our chart includes two dimensions, Group and Sector, with a hierarchy relationship between them (a sector belongs to one group, a group can include two sectors). Hence, Tableau can perform calculations on three different levels:

the detailed data level, i.e. MEP-level
the grain level of the chart, i.e. sector-level
the group level, which determines color

Here is a sketch of the data at these three levels for Renew and (for comparison), for both neighboring groups, Greens and EPP:

Our labels are at the chart grain level (Sector), we will use LODs to get data at the upper level (more aggregated) and at the lower level (more detailed).

IV-A-1. Exercise: compute the headcount at group-level**▲

Create a new calculated field, LOD Label, to get the count of MEPs at group-level, then use it in the labels instead of COUNT(MEPs).

IV-A-2. Answer▲

The easier way is to directly position the calculation at group-level with a FIXED:

LOD label

Sélectionnez

{FIXED [Group] : COUNT([MEPs])}

This figure should be displayed as it is and not aggregated, so you should make it a dimension.

As the labels are displayed in a context with two dimensions, Group and Sector, another way is to exclude the Sector dimension to position the calculation in a context with only Group dimension:

LOD label

Sélectionnez

{EXCLUDE [Sector] : COUNT([MEPs])}

Tableau refuses to make a dimension with this version, so you will have to settle for a discrete measure.

We now have got the right figure for the central group, we just have to hide the label of the minority sector. The syntax will quickly become intricate, so let us do it step by step.

IV-A-3. Exercise: get the headcount of the majority sector***▲

Now, update your formula to get the headcount of the bigger sector in the group. On my sketch above, your formula should do this:

IV-A-4. Answer▲

First, you have to climb up to the group level, and there get the MAX of the lower level headcounts, i.e. of sector headcounts. To tell it the other way round, we COUNT at sector-level and then we MAX these figures at group-level. Technically speaking, you have to nest one LOD into another:

LOD label

Sélectionnez

{EXCLUDE [Sector] : COUNT([MEPs])}

The maximum headcount should now display on both sectors:

IV-A-5. Exercise: compare headcounts***▲

Now, edit the formula to display the word ‘label’ if the sector headcount matches the headcount of the bigger sector in group, and else the word ‘nothing’.

IV-A-6. Answer▲

The sector headcount is just COUNT([MEPs]), the headcount of the bigger sector in group is the nested formula we just wrote, so the natural way of coding this would be:

LOD label (wrong syntax)

Sélectionnez

/* comparison test (mixed levels error) */
IF COUNT([MEPs]) = {FIXED [Group]: MAX({FIXED [Sector] : COUNT([MEPs])})}
THEN "label"
ELSE "nothing"
END

However, Tableau considers this formula incorrect, arguing that it ‘cannot mix aggregate and non-aggregate arguments with this function’.

Actually, the result of an LOD formula is syntactically considered as a detail-level value. Hence, our {FIXED…} has the correct value of 100, but this 100 is at MEP-level. To compare it with an aggregate value like the sector headcount, we have to aggregate it. You can use whichever aggregate function among AVG, MEDIAN, MIN or MAX, even SUM will do (we fixed on GROUP and there is only one group for the sector). My preference is for ATTR, because in unexpected situations it will kindly raise an error rather than send a wrong result.

As we discussed previously, Tableau aggregates measures in the context specified by dimensions. The attributes are a third qualification, that is not available on the Data Pane, but only on shelves. Attributes do not participate in context specifying, but are expected to bear a unique value in the dimensional context where they are displayed.

The ATTR function (conversion into attribute) is very similar to an aggregate function, but is not officially considered as such. Its mechanism is very simple: if all detailed rows bear the same value, it returns this value, and else it warns the developer by returning an asterisk character (*).

Here is the correct formula (ATTR version):

LOD label

Sélectionnez

/* comparison test (mixed levels error) */
IF COUNT([MEPs]) = ATTR({FIXED [Group]: MAX({FIXED [Sector] : COUNT([MEPs])})})
THEN "label"
ELSE "nothing"
END

Tableau considers this formula as too complex for a dimension, so if it was one Tableau automatically converts it to a discrete measure. Hence the pill may become red as this measure can no longer be used as a dimension on the viz.

If ever you encounter this issue, just drag again the LOD label field on the red pill, so Tableau now displays it as an AGGregate calculation.

With this fix, all the labels should now show, except the one for the right part of Renew:

IV-A-7. Exercise: compute the text for labels**▲

Now, remove any other field from Label, and finalize the LOD label formula that will display the name and the headcount once for each group, except the white group.

IV-A-8. Answer▲

You just have to assemble the pieces, and pay attention not to mix aggregates and details, nor texts and numbers:

LOD label

Sélectionnez

/* if white group, no label,
   else, if bigger sector (or lone sector), group name and headcount,
   else (if minority sector), no label */
IF ATTR([Group]) = "white"
  THEN NULL
ELSEIF COUNT([MEPs]) = ATTR({FIXED [Group]: MAX({FIXED [Sector] : COUNT([MEPs])})})
  THEN ATTR([Group]) + ": " + STR(ATTR({FIXED [Group] : COUNT([MEPs])}))
ELSE
  NULL
END

All this fuss just for hiding two labels? Yes. With Tableau, the finishing details are often the harder to get. But if you have followed it up to this, you now are an accomplished LODer!

IV-B. With Table Calculations▲

In a relational database, tables store data rows with no specific order; according to the customary analogy, records are in their table like marbles in a bag. With LODs, we stayed within the boundaries of this logic and we built our formulas by matching data on different levels of aggregation.

Tableau proposes an alternate mechanism, calculating from data already displayed on the chart. With this approach, the sort order of the viz can be used to implement notions like ‘previous data’, ‘next data’, ‘first row’, ‘last row’, and so on. This mechanism bears the weird name of Table Calculation – the said ‘table’ does not refer to tables on the data source, but to the data set underlying the chart. Probably you have already used or met this system? Tableau represents it by displaying a delta (Δ) on the pills using it.

Now, please have a backward look at the hemicycle as it was at the beginning of this section, i.e. with the two labels 100 and 1 for the Renew group. The sectors are ordered along the Clockwise Order field, from center-right to center-left via far right, fillers and far left.

By construction, the duplicate label is on the central group (the one split by the median). Hence, the two sectors concerned are necessarily the first and the last clockwise. Here is a new version of my sketch with the three levels of data (MEP, Sector and Group) for the Renew group and its neighbors, reorganized to follow the clockwise order.

When can now think in terms of positions:

we have to hide either the first or the last label (the one with the smaller headcount)
the non-hidden one should display the total headcount of both
the label for white group should also be masked
the other labels should display normally

This way of telling it is simpler and more intuitive than the one with LODs, because everything is expressed at the sector level (grain level of the chart). In practical terms however, implementing it is not totally obvious…

Everything relies upon our capacity to tell the first and last sectors apart from others, so let us start by trying to display their order index. Create a new calculated field, TC Label, with just INDEX() as a formula. This function returns the order of items on the chart. Drag this field to Label, you should get this result (a bit disappointing):

IV-B-1. Exercise: set the Table Calculation OK***▲

Now, click the Δ symbol on the TC Label pill, choose Edit Table Calculation, and torture it until the sectors are numbered 1 to 10 clockwise.

IV-B-2. Answer▲

The Table Calc should be performed along both dimensions, Group and Sector, at the deepest level (sector), it should not restart, and it has to follow the Clockwise Order.

A simpler way of getting the right order is to swap dimensions so that Sector gets priority:

Either of these settings should get you the expected result:

The Table Calculation is performed according to all dimensions used on the sheet, classified as addressing dimensions (checked) and partitioning dimensions (unchecked). The combination of addressing dimensions specifies the items to count, while the combination of partitioning dimensions specifies the sets to count within. In our case, the chart includes two dimensions, Group and Sector:

if none is checked, each Group/Sector combination forms a partition with only one item, so they are all #1
we want to count sectors, hence we have to check this dimension so it defines the items to count; however, if the Group dimension is unchecked, it is considered as partitioning and the counting is performed within each group, so they are all #1 except for Renew, which has #1 and #2
if we now check both dimensions, there is no partitioning anymore; Tableau numbers the sectors on the whole chart, from #1 to #10.

You have now determined the right options for the Table Calculation. Let us edit the calculated field in the Data Pane and set its Default Table Calculation accordingly.

IV-B-3. Exercise: test first and last positions*▲

The function to get values from another sector is LOOKUP, it relies on two other functions, FIRST and LAST. Both these latter functions (to use with no argument), return the gap between the current label and the first or last one… maybe that’s not very easy to figure out? Edit the calculated field to display the result of FIRST and LAST on each sector.

IV-B-4. Answer▲

You can use the formula below, or many other variants. If the result does not change, or if the pill becomes red, re-drag the calculated field to replace the erroneous pill.

TC label

Sélectionnez

"First: " + STR(FIRST()) + CHAR(10) + "Last: " + STR(LAST())

I have used STR and concatenations to get both values at once, but of course you could display them alternatively.

CHAR(10) represents the character #10 in ASCII code, i.e. line feed. I could also have used #13, carriage return.

IV-B-5. Exercise: get the last value*▲

We will now test the LOOKUP function. Edit the calculated field so that the headcount of the last sector (100) displays on all labels, like below:

IV-B-6. Answer▲

No subtlety here, just adapt the syntax example from the Tableau doc:

TC label

Sélectionnez

LOOKUP(COUNT([MEPs]), LAST())

IV-B-7. Exercise: compute label text**▲

You now have all the parts, you just have to devise the eventual formula for labels.

If you have read the ‘With LODs’ section above, you should be able to solve the ‘cannot mix aggregate and non-aggregate’ errors. If not, I invite you to view this bit.

IV-B-8. Answer▲

Here is an example solution, but of course there are many ways to write this formula.

TC label

Sélectionnez

// in all cases, group name and colon
ATTR([Group]) + ": " 
+ STR(
  // case 1, white group => NULL
  IF ATTR([Group]) = 'White' 
    THEN NULL
  // case 2, regular sector => headcount
  ELSEIF FIRST() != 0 AND LAST() != 0
    THEN COUNT([MEPs])
  // case 3, bigger sector among first and last => first headcount + last headcount
  ELSEIF FIRST() = 0 AND COUNT([MEPs]) >= LOOKUP(COUNT([MEPs]), LAST())
      OR LAST() = 0  AND COUNT([MEPs]) > LOOKUP(COUNT([MEPs]), FIRST())
    THEN LOOKUP(COUNT([MEPs]), FIRST()) + LOOKUP(COUNT([MEPs]), LAST())
  // case 4, smaller sector among first and last => NULL
  ELSE
    NULL
  END
)

The big IF… END computes the headcount, then it is converted to string and concatenated to group name and colon; the rule of NULLs propagation will provide us a NULL label for the white group and the minority sector. Last, if ever the central group gets split in two exactly equal parts, the label will arbitrarily show on the first sector (i.e. on the right side), as the conditions are ‘first >= last OR last > first’.

IV-B-9. Finishing▲

You can now hide the color legend, as it is redundant with the labels.

As you can see on this example, the difficult part is to get the right setting for the Table Calculation. Once you have got it, the syntax is simpler and more direct than the one for LODs.

V. Add percentages for arcs▲

We have set up a hemicycle and display the headcount for each group, in other words we now represent correctly the raw data. Now, we will enrich the chart with two features that are a matter of political analysis: the political balance between bigger political tendencies (e.g. left wing vs right wing vs far right), and the percentages for these big blocks.

V-A. A second semicircle for arcs▲

Beyond the political weight of each political group, we would like to display insights about the political balance in a parliament. In most national chambers we would display the government’s majority versus opposition(s), but the EU is a bit different, as there is no established majority and the Commission comes from national governments, not the EU Parliament. As there are no formal alliances between groups, I would rather avoid the terms ‘coalition’ or ‘block’, and I will use the geometric concept of ‘arc’, which does not assume any agreement between its items, but just implies they are adjacent on the left-right axis.

I will propose you the following breakdown:

a first arc for the left wing, from The Left to Greens
a second arc for the (center and) right wing, Renew and EPP
a third arc for the nationalist or far right groups, ECR and ID
a residual arc for ‘others’, i.e. the NI group
and a technical arc for the white group

If your political categories diverge from mine, please feel free to organize your arcs in another way, with no consequence for the rest of this case study.

We will display these arcs on a second pie, and we will superimpose this second pie with the first one. As with groups, Tableau forces us to begin at the top middle position, so we will have to split the central arc (actually, the second one) into two sub-arcs (the first one on the right side, with the sectors Renew1 and EPP1, and the second one on the left side, with just Renew4).

V-A-1. Exercise: create arcs and sub-arcs*▲

Implement both these fields, arcs and sub-arcs, with the technically simplest method.

V-A-2. Answer▲

No need for an ‘Arc’ formula, just right-click the Group field and choose Create > Group to get arcs (yes, arc are groups of Groups, sorry for the homonymy).

For sub-arcs, you can either group sectors or create a formula to concatenate the quadrant number to the arc.

Sub-arc

Sélectionnez

[Arc] + STR([Quadrant])

V-A-3. Exercise: duplicate the pie to prepare superimposing***▲

We now want two identical pies, next to each other on the same worksheet. If you already know the trick for this, your deserve your three stars and you can go directly to the answer. If you do not, I will try to make you guess, but I have to confess this is rather convoluted.

A regular Tableau pie chart does not use the Rows or Columns shelves. If you put a dimension on either shelf, you get a pie chart for each dimension value. For example, if we were working on the US Congress, with a ‘Chamber’ dimension to distinguish between Senate data and House of Representatives data, I could put this Chamber dimension on Columns to get a Senate pie next to a House of Representatives pie.

Our case is a bit different: first we do not have a relevant dimension, and moreover we will eventually superimpose both pies using Dual Axis, which can be used on measures only…

V-A-4. Answer▲

We will use Dual Axis, so we need a measure; both our pies must be identical and positioned at the same height, so the measure has to bear exactly the same value on each share of both pies. It has to be constant on detailed data, and it has to remain constant when it gets aggregated.

In the screenshots below, I am using a calculated field named Constant, whose formula is ATTR(1), which will result in 1 in all cases. You can use as well MIN(1), which is customary on Tableau forums, MAX(1), MEDIAN(1), MIN(123456), SUM(0) or even a simple zero (aggregated by sum, it will still be zero).

Create such a Constant, then drag it to the Columns shelf; that makes the pie smaller, but we will fix that later. Now catch the Constant pill you have just dropped on Columns, drag it a little farther to the right, and keep pressed the Control key on your keyboard.

You now have your two pies:

Please notice you now have three sections on the Marks shelf, so you can either edit both pies together, or just the first one, or just the second one.

V-A-5. Exercise: display political arcs**▲

Now, edit the second pie so it displays arcs instead of groups. Make all adaptations necessary, and use dimmed colors.

V-A-6. Answer▲

First, click on the third section of Marks shelf, the one for pie #2. Here is the to-do list:

on Color, drag Arc instead of Group
on Detail, drag Sub-arc instead of Sector
sort sub-arcs according to ascending average of Clockwise Order; if needed, make it the first pill
edit the colors to get something politically meaningful, and choose a low Opacity (40% on the screenshot below)
duplicate the LOD Label field (or the TC Label), then rename it to Arc Label
in the formula of Arc Label, replace all occurrences of [Group] by [Arc], 'White' by ' ', and all occurrences of [Sector] by [Sub-arc]
use this Arc Label on Label instead of LOD Label or TC Label
hide the horizontal axis (right-click it and uncheck Show Header)
if needed, hide the legend of AGG([Constant]) and the color legend

You should get something like this:

V-A-7. Exercise: superimpose the pies**▲

We are now ready to superimpose both pies. Switch to Dual Axis, adjust the sizes so that the pie with arcs is slightly bigger than the one with groups, then put the small one over the big one.

V-A-8. Answer▲

To get Dual Axis, just right-click the second Constant pill. Getting the sizes correct is a bit more complicated: as long as there is a COUNT([MEPs]) pill on the Size tool of both pies, they will always get the same size. Remove it from both pies so you can set sizes manually.

Then, you will probably notice that the colors of the groups pie are darkened: this is due to the arcs pie overlapping the groups pie. Just swap both Constant pills to reverse the overlapping order, and you get your original colors back.

V-B. Computing percentages▲

The underlying question in a Parliamentary Chart is always who can get an absolute majority, i.e. over 50% of seats. Experts in EU politics know that since the departure of UK MEPs, this is 353 votes, but of course displaying the headcounts of arcs as percentages would make things easier for everyone.

Probably you have already used the ‘Percent of Total’ table calculation to display pie shares in percents? Alas, we cannot use this because of the fillers, who make the total headcount twice as big as the real figure.

V-B-1. Exercise: setting up the ratio**▲

Create a new calculated field, % Arc.

if your arc labels use LODs, get the headcount from the label formula, normally that is {FIXED [Arc]: COUNT([MEPs])}, then copy-paste it to the new calculated field and made a correct percentage out of it.
if your arc labels use Table Calculations, do the same thing, but do not bother with white group nor Renew for the moment; your headcount should just be COUNT([MEPs])

whatever your case (LOD or TC), find two significantly different solutions for fixing the percentage calculation

V-B-2. Answer▲

A first obvious solution is to divide by the total headcount nonetheless, and then multiply by two:

% Arc

Sélectionnez

// LODs version (ATTR to avoid aggregation)
ATTR({FIXED [Arc]: COUNT([MEPs])} * 2 / {COUNT([MEPs])})

// Table Calc version
COUNT([MEPs]) * 2 / TOTAL(COUNT([MEPs]))

This is correct as long as the fillers are an exact half of the total headcount. However, on next chapter, we will exclude some fillers, so this formula will become wrong. Hence my demand for an alternative solution.

So the total headcount (denominator) should include only real MEPs and ignore fillers. For this, we just have to count a field where fillers are NULL, e.g. LR Group Position. As aggregate functions ignore NULLs, only real MEPs will be counted.

% Arc

Sélectionnez

// With LODs
ATTR({FIXED [Arc]: COUNT([MEPs])} / {COUNT([LR Group Position])})

// With Table Calc
COUNT([MEPs]) / TOTAL(COUNT([LR Group Position]))

Keep this second version rather than the multiplication by two, format it as percentage with one decimal position, and test your percentages on a new sheet. You should get these results:

Left wing: 35.7%
Right wing: 39.4%
Others: 6.7%
Far right: 18.2%

V-B-3. Exercise: format into percentages**▲

The Arc label formula has the mechanic to display only the right labels, and the % Arc formula has the right figure. It should be easy to combine them, however there is an unexpected issue: Tableau has no function to format numbers into text (imagine C without sprintf, or Visual Basic without Format). You have to make do nonetheless.

V-B-4. Answer▲

The easier is to keep two different calculated fields, one for the arc name and the other for the percentage, so you can set a default number format to the percentage.

Arc Label

Sélectionnez

// With LODs
IF ATTR([Arc]) = ' '
  THEN NULL
ELSEIF COUNT([MEPs]) = ATTR({FIXED [Group]: MAX({FIXED [Sub-arc] : COUNT([MEPs])})})
  THEN ATTR([Arc])
ELSE
  NULL
END

// With Table Calcs
IF ATTR([Arc]) = ' ' 
  THEN NULL
ELSEIF FIRST() != 0 AND LAST() != 0
    OR FIRST() = 0  AND COUNT([MEPs]) >= LOOKUP(COUNT([MEPs]), LAST())
    OR LAST()  = 0  AND COUNT([MEPs]) > LOOKUP(COUNT([MEPs]), FIRST())
  THEN ATTR([Arc])
ELSE
  NULL
END

% Arc

Sélectionnez

// With LODs
IF ATTR([Arc]) = ' '
  THEN NULL
ELSEIF COUNT([MEPs]) = ATTR({FIXED [Group]: MAX({FIXED [Sub-arc] : COUNT([MEPs])})})
  THEN ATTR({FIXED [Arc]: COUNT([MEPs])} / {COUNT([LR Group Position])})
ELSE
  NULL
END

// With Table Calcs
IF ATTR([Arc]) = ' ' 
  THEN NULL
ELSEIF FIRST() != 0 AND LAST() != 0
  THEN COUNT([MEPs]) / TOTAL(COUNT([LR Group Position]))
ELSEIF FIRST() = 0  AND COUNT([MEPs]) >= LOOKUP(COUNT([MEPs]), LAST())
    OR LAST()  = 0  AND COUNT([MEPs]) > LOOKUP(COUNT([MEPs]), FIRST())
  THEN (LOOKUP(COUNT([MEPs]), FIRST()) + LOOKUP(COUNT([MEPs]), LAST())) / TOTAL(COUNT([LR Group Position]))
ELSE
  NULL
END

If ever you want to stick to the one-formula solution, you will have to format the percentage yourself with a good deal of concatenations:

Arc Full Label

Sélectionnez

// With LODs
IF ATTR([Arc]) = " " 
  THEN NULL
ELSEIF COUNT([MEPs]) = ATTR({FIXED [Group]: MAX({FIXED [Sub-arc] : COUNT([MEPs])})})
  THEN ATTR([Arc]) + ": " + STR(
    ROUND(
      100 * ATTR({FIXED [Arc] : COUNT([MEPs])} / {COUNT([LR Group Position])}),
      1)
  ) + "%"
ELSE 
  NULL
END

// With Table Calcs
ATTR([Arc]) + ": " + STR(
  IF ATTR([Arc]) = ' ' 
    THEN NULL
  ELSEIF FIRST() != 0 AND LAST() != 0 
     THEN 
      ROUND(
        100 * COUNT([MEPs]) / TOTAL(COUNT([LR Group Position])),
        1)
  ELSEIF FIRST() = 0  AND COUNT([MEPs]) >= LOOKUP(COUNT([MEPs]), LAST())
      OR LAST()  = 0  AND COUNT([MEPs]) > LOOKUP(COUNT([MEPs]), FIRST())
    THEN ROUND(
        100 * (LOOKUP(COUNT([MEPs]), FIRST()) + LOOKUP(COUNT([MEPs]), LAST())) / TOTAL(COUNT([LR Group Position])),
        1)
  ELSE
    NULL
  END
) + "%"

OK, you now have all the data necessary to run for president of the European Parliament!

VI. Overflowing the semicircle▲

Finally, let us consider the visual display, at once to improve aesthetics and to discover certain mechanisms and limits of Tableau.

Up to now, we have based our works on a strict definition of the hemicycle as a semicircle, i.e. forming a circular arc with an angle of 180 degrees. But in real life, most assemblies (as well as parliamentary charts) go beyond this angle. As an example, the seating plan for Strasbourg displays an arc for MEPs of around 200°; with extra seats left and right for EU Council and EU Commission, the total arc looks close to 240°.

That should be simple enough in Tableau: we just have to create a parameter the user can use to choose the arc, and filter out the matching number of fillers. Hence, the point is how to transform a quantity into a selection. The first obvious solution would be to use Excel, Tableau Prep, or any other tool to add an incremental row number on the source file. Then, if you have to filter out 20 fillers, you will just have to filter on ‘Group different from white OR row# greater than 20’, and it will suppress the 20 first fillers. If you actually have to build up a Parliament chart overflowing beyond 180°, that would probably be the simplest solution.

As this article aims at exploring the possibilities of Tableau Desktop, I would like to discuss hereafter solutions with no other tool and without editing the data file. So, first, can we perform this numbering within Tableau Desktop?

The issue is that all functions for such a numbering (INDEX, RANK, RUNNING_COUNT, WINDOW_COUNT, etc.) rely on Table Calculation; as we want to number individual data rows, this Table Calculation should address the Id (or the Id / Table Name combination, or any other unique identifier, like the Full Name). As Table Calculations are performed on the data used in the chart, we would have to include the identifier in the chart (a priori on Detail), and the chart grain would go down to MEP-level. We would get a label per MEP instead of a label per sector, and we would have to review all our formulas to compute which ones should display or not among the potential 1,410 labels…

Although it is feasible (e.g. we could compute a median MEP in each group to position the label at mid-arc), it would be quite tedious, and we would not learn anything new about Tableau.

If you find a better idea, please propose it in comments!

I would like to propose you two different approaches; neither produces a perfect result, but they would probably be considered good enough in most projects, and they will lead us into new aspects of Tableau. Therefore, I invite you to duplicate your latter worksheet, so we can compare both results.

VI-A. With an overflowing percent▲

Our basic hemicycle has an angle of 180°, this is when there are as many fillers as MEPs. Maximum overflow would be an angle of 360°, i.e. a full pie, without any filler.

VI-A-1. Exercise: create the percentage of overflow*▲

Do what is necessary for making the user able to choose a percentage of overflow from 0% to 100%.

VI-A-2. Answer▲

You of course have to create a Parameter, e.g. ‘Overflow %’, with a Range from 0 to 1 (i.e. 0% to 100%), then Show it.

As we have set a validity range and a step, the parameter is displayed as a slider:

If you do not know Tableau parameters, this feature aims to enable the end user to input himself or herself a value; this value can then be used as a filter, in calculations, in a top or a set and/or directly displayed on the viz. To get the Create Parameter window above, right-click any dimension in the Data Pane, and choose Create > Parameter. Once you have set up the parameter, right-click it and choose Show Parameter.

VI-A-3. Exercise: create the flag▲

Create now a calculated field, ‘Overflow Flag’, returning a boolean result, that must be False for the right proportion of fillers; in other words, if the user chooses to overflow by 20%, your flag should remove 20% of the fillers.

VI-A-4. Answer▲

I propose you to use the PERCENTILE function: to remove 20% of the fillers, you have to sort them on the identifier, then compute the identifier value under which there are 20% of the headcount… this is the very definition of the 20^th percentile.

If you are not familiar with this notion of percentile, here are some more details. The k^th percentile is the value below which there are k% of the population. In our example, there are 705 MEPs sorted according to their Id. The 20^th percentile is the Id that separates the 141 first MEPs from the following ones. The 141^th MEP is Evelyn Regner (Id# 96,998), and the next one is Guy Verhofstadt (ID# 97,058). Tableau computes the 20^th percentile as 97,046, and this value is actually between Regner and Verhofstadt.

Overflow Flag

Sélectionnez

[Group] <> 'White' OR [Id] >= {PERCENTILE([Id], [Overflow %])}

The percentile has to be computed on the whole dataset, like the median before, hence the shorthand LOD {}. Now, you just have to put this field on Filter, keep only True data, and you can test the slider:

In terms of visual display, overflowing beyond 180° moves the labels apart from each other, improving readability.

Everything works, however there is no proportion between the Overflow % and the angle you get. With an overflow of 50%, the user would probably expect a right angle, i.e. an arc of 270°. As you can see on the screenshot above, the arc is rather two thirds of a circle, i.e. 240°.

Is there a bug? With 705 MEPs and an overflow of 50%, you get 353 fillers (my flag above uses the >= operator). Hence, MEPs are 705 / (705 + 353), i.e. 66.6% of a circle. Our 240° stands the test of math! However, our system is counter-intuitive, and that’s never a good news in dataviz. In a perfect system, the user would choose his/her angle, then we would compute the matching percentage of fillers to remove; however, the PERCENTILE function does not support a calculated percentile number, but only a constant or a parameter. As long as we use the percentile method, we are stuck with our Overflow %. It does the job, but is hardly understandable for users.

For a better user experience, we will perform a cognitive trick: keep the same system, but hide the percentage and display the angle instead.

VI-A-5. Exercise: compute the angle from the percentage of overflow**▲

Define the formula to get the total angle according to Overflow %, then create a calculated field with it, Angle Obtained.

VI-A-6. Answer▲

First, let us formulate an equation, just on paper. Here are the variables:

x: our target angle, in degrees
n: the count of real MEPs
p: the percent of overflow chosen by user

With an overflow of 10%, we should display 100% of real MEPs and 90% of the fillers; so, in math, we have n(1 – p) fillers. The pie share of real MEPs will be n / (n + n(1 – p)). This ratio is also equal to the pie share occupied by our x angle, hence the equation:

      x / 360 = n / (n + n(1 – p))
?   x / 360 = 1 / (2 – p)
?   x = 360 / (2 – p)

Let us translate this into a calculated field:

Angle Obtained

Sélectionnez

360 / (2 - [Overflow %])

This result should not get aggregated, you can make it a dimension. It is expressed in degrees, without any decimal position, so a good number format would be 0°.

VI-A-7. Exercise: display the calculated angle▲

On the parameter card, keep the slider but hide the percentage, then find a way to display the Angle Obtained just below the parameter card.

VI-A-8. Answer▲

Hiding the percentage is easy, just Customize the parameter card to uncheck Show readout. Then, Edit the card title so there is no trace of any %.

Let us display the angle obtained: just right-click it and choose Show Filter, then Customize the new card and uncheck Show "All" Value. Last, the checkbox is a risk: if a user unchecks it, the whole chart disappears. You cannot remove it, but you can turn it into a dropdown list or a single-value list (radio button) that are much safer.

Eventually, move the filter card below the parameter card, so users understand the angle results from the slider.

Dataviz is a matter of communication: you have to both deliver the relevant information, the angle, and prevent it from getting polluted by a less relevant information like the percentage. Users can now pilot the angle with the slider, without bothering with the intermediate calculation.

VI-B. With Monte Carlo method▲

Now, let us do it the other way round: we want the user to choose his/her angle directly, and compute the proportion of fillers to filter out. As we cannot do this with a percentile, we will use a random selection!

You may find it surprising, but calculation algorithms based on random processes form a recognized method in statistics, under the funny nickname of Monte Carlo method. They are notably used in fluid mechanics or particle physics. In the archetypal example, you assess the area of a pond by randomly firing cannon balls within a land square including the pond: if you drown one ball out of three, then the pond area should be approximately one third of the land square!

VI-B-1. Exercise: create the new parameter*▲

Create a new parameter, Angle Desired, and set the necessary options.

VI-B-2. Answer▲

No specific difficulty here:

VI-B-3. Exercise: compute the proportion of fillers to remove*▲

On paper, define the formula to determine the percentage of fillers to filter out according to the angle chosen by the user. Then, in Tableau, implement it as a calculated field, Exclude %.

VI-B-4. Answer▲

Just re-use the same equation than before, and reverse it to express the percentage (p) as a function of the angle (x):

x = 360 / (2 – p) 2 – p = 360 / x p = 2 – 360 / x

Let us test this: if the user wants an arc of 240°, we will have to remove 2 – 360 / 240 = 0.5, i.e. 50% of fillers. This does match our previous experiments.

Here is the Tableau formula:

Exclude %

Sélectionnez

2 - 360 / [Angle Desired]

As usually, the result is a constant to convert into a dimension so we avoid aggregation.

VI-B-5. Exercise: filter randomly*▲

We can now compute how many fillers should be filtered out. As we cannot select them with a percentile, we will choose them randomly. This exercise could have been ‘find a way to draw lots in Tableau’, but that would be quite unfair, as the answer is secret, or more precisely undocumented: the RANDOM function, to be used without argument. It returns a random decimal number between zero and one.

I can now fairly ask you: create a calculated field, Monte Carlo Filter, returning False for the proportion of fillers to remove, True for the proportion of fillers to keep, and of course True also for reals MEPs.

VI-B-6. Answer▲

To remove e.g. 20% of fillers, you just have to filter on RANDOM value greater than 0.2. As the generated random number is between 0 and 1, it will be true in approximately 80% of the cases and false in approximately 20%.

Monte Carlo Filter

Sélectionnez

[Group] != 'White' OR RANDOM() > [Exclude %]

As you have probably noticed, first RANDOM does not show in the list of Tableau functions, and second you have to type it in entirely, without the help of word completion. Well, at least it is recognized as correct by the syntax validation. As usually, this calculated field should not get aggregated, so you can make it a dimension.

Why the hell RANDOM is still unofficial in Tableau? This mystery teases imagination. Has it been introduced surreptitiously by a stubborn developer overriding the veto from its outraged hierarchy? Would it be a shaky compromise between Monte Carlist and anti-Monte Carlist among the company management? Whatever the cause for its ambiguous status, the RANDOM function has proven very useful when you have to distribute data homogeneously, e.g. to stagger data points with the same coordinates so they do not overlap (as in this example jitterplot).

VI-B-7. Exercise: assemble the pieces*▲

Now you have all the elements. Go back to the worksheet you had duplicated before using percentiles, and make sure users can choose their angle and get the wished viz.

VI-B-8. Answer▲

You just have to Show the Angle Desired parameter, then drag Monte Carlo Filter on the Filter shelf, and choose True.

And that’s all, folks! Here is an example of the final result:

Drawing randomly implies some approximation. You can notice, on your own workbook or the screenshot above, that shares from both pies do not match exactly. As our median enforces symmetry, maybe it can pass for a stylistic effect?

VII. Conclusion▲

We used but one chart type, notoriously the simplest one, yet technical challenges were there. I myself can hardly believe I had to spend 68,000 characters to explain how to build up a half-pie! As a conclusion, here are the features we have used during this practical case:

Union query
row counter
calculated field
conditional syntaxes IF and CASE
sort order formula
dimension versus measures
discrete versus continuous
default properties (default aggregation, default number format)
NULLs propagation
median, percentile
LOD expression
sorting priority
alias
attribute, ATTR function
Table Calculation
addressing versus partitioning
INDEX, FIRST, LAST and LOOKUP functions
group
Dual Axis
double pie
parameter
filter formula
RANDOM function

I hope you have found this practical case interesting and entertaining. If you have a competitive mind, this case include 34 exercises for a total of 58 stars. Count your points, then you are welcome flaunting your score in the comments.

Les sources présentées sur cette page sont libres de droits et vous pouvez les utiliser à votre convenance. Par contre, la page de présentation constitue une œuvre intellectuelle protégée par les droits d'auteur. Copyright © 2023 Antoine Dinimant. Aucune reproduction, même partielle, ne peut être faite de ce site ni de l'ensemble de son contenu : textes, documents, images, etc. sans l'autorisation expresse de l'auteur. Sinon vous encourez selon la loi jusqu'à trois ans de prison et jusqu'à 300 000 € de dommages et intérêts.