My colleague James Cheshire’s research deals with understanding and classifying spatial patterns in surnames. He has been able to show, through various techniques, that there exists in the UK a regional geography of surnames. This in mind, I thought I’d interogate my database of NHS patient registrations for Southwark and see what was going on in surname terms there. This first table shows the top 20 most popular surnames in Southwark, ranked by occurance.
Unsurprisingly perhaps, the top places are dominated by surnames native to the UK, classically Smith, Williams, Jones etc. However, in line with Southwark’s reputation as a diverse borough and in light of it’s high inmigration figures, it is also clear that of these top 20 surnames some of them would be connected to inmigrant names: Kamara, Ahmed, Ali, Patel and Khan are all surnames that are increasingly associated with a previous period of migration to the UK. Interestingly the Vietnamese population is very small, less than 1% of the population of Southwark, but around 23% of these have the surname ‘Nguyen’. The ethnicity of the surnames is derived from Onomap.
The frequency distribution of Southwark surnames looks like this:
Note the characteristic long tail, there are a huge number of unique, or almost unique surnames, and considerably fewer surnames which are possessed by a large number of people. Such a distribution seems to obey a power law of some sort.
We can dig deeper into this phenomenon by looking at the number of surnames that comprise a given percentage of the population:
As we can see from the above figure, only 56 names account for 10% of the Southwark Population, but that in total there are 88,124 distinct surnames in Southwark. Again there is a characteristic decay to the curve.
Finally, let us consider just the charactersitics of the long-tail of the distribution:
From figure 4 it is clear that almost 25% of the Southwark population have a surname that is share by fewer that 11 people, indeed just over 16% of the Southwark population have a surname unique to the Southwark patient register. The shape of the curve in figure 4 demonstrate the effect of the long tail seen in figure 2.
For more information on surnames research check out James Cheshire’s blog, working paper or Pablo Mateos’ working paper.




~ End Article and Begin Conversation ~
There are no comments yet...
~ Now It's Your Turn ~