Friday, 20 December 2013

Astrophysics Corner, Part 5 – Asteroids, Stars, and Konrath’s Book Sales – The Power Law


If you have read much astrophysics, you have probably come across the term “power law”.  Similarly if you have followed blogs on books and publishing, you have probably come across a phrase that goes something like “book sales follow a power law”.  What exactly does this mean, and what is this mysterious mathematical entity known as a power law?

For the purposes of this blog, I will try to avoid being very theoretical and keep equations to a minimum.  Most of the important understanding of this subject can be explained with graphs and words and that’s usually what a general audience prefers anyway.  Astrophysical (hi Scott!) and mathematical types are permitted a smile at the oversimplification of some of the explanations below.

So, what is a power law?  Simply put, it is a mathematical function which is used to describe a situation in which one variable varies as a power of another.  A common example of a power law, that most of us can recall from high school science, is the equation that describes the force of gravity, where the gravitation force between two masses varies as the inverse square of the distance between those masses.   This is an example of a power law that is underlain by a straightforward scientific theory, the theory of gravity.

Many phenomena in the physical world, especially the astrophysical world, appear to be modelled well by power laws, though the theoretical laws underpinning this are not always so obvious.  One example is the size of asteroids in the Asteroid Belt, as outlined below, showing the number of asteroids in each size range (the data is from Wiki):

Description (Radius)
Size Categ
Num Aster
0 - 0.49 km
1
29,000,000
0.5 - 1.99 km
2
2,750,000
2 - 7.99 km
3
290,000
8 - 31.99 km
4
11,100
32 - 127.99 km
5
800
128 km - Up
6
39
 
 
 

 

The asteroid data has also been graphed below, to show the typical shape of a power law, which will tend to be dominated by a few of the categories, with most of the cases falling into these ranges.  The second graph shows a nice feature of a power law, which is that if we graph it using a logarithmic scale, we will get something pretty close to a straight line.  Note that the vertical scale has been transformed into a logarithmic scale, with equal intervals for each time the graph axis goes up by a factor of ten.

In fact, we can use the slope of this straight line to estimate the actual value of the “power” in the power law.  We won’t bother with that here, as it gets a bit tricky and we just want to understand the main features of the power law at a graphical and qualitative level.
 

Let’s look at another set of astrophysical data, because it’s fun.  Dodecahedron Books does mostly science fiction, so we like stars and astrophysics.  This table shows the number of stars visible to the naked eye under good conditions (a dark sky, good seeing, and good eyesight), by brightness category (stellar magnitude, lower for brighter stars, and higher for dimmers stars, which happens to be the reverse of the asteroid size scale we used earlier).  This data is from wiki, though I am sure similar data can be found  in all kinds of astronomy textbooks.

 

Description
Stellar Magnitude
Num  of Visible Stars
Dimmest
5
1,602
Fifth brightest
4
513
Fourth brightest
3
171
Third brightest
2
48
Second brightest
1
15
Brightest
0
5

 

Again, here are the two graphs: raw data and logarithmically scaled.  Note the nice straight line on the bars of the latter graph, indicating a linear fit to the logarithmically transformed data, which (after a bit of algebra transforming back to the original data) indicates a power law (please note: blogger seems to be acting wonky, so I hope the graph shows up):
 
Power laws are frequently a good fit for phenomena in the domain of human culture and economics.  We often want to answer questions like “what is the likelihood of a book selling between 1000 and 5000 copies, given usual book sales statistics?”  Unfortunately, it is not always easy to get good data, but luckily for us, best-selling indie  writer Joe Konrath recently shared his sale statistics on his blog, so let’s see how that particular dataset conforms to a power law.   I binned his sales data into six ranges, so that we could compare that data with our asteroid and star data later.  Each bin size was 2.4 times the previous bin - that’s pretty nearly the same as how the magnitude scale for stars works.
Description
Category
Unique Book Titles in Each Sales Category
Fewest Sales (0-5000)
1
77
5001-12000
2
37
12001-28800
3
23
28801-69120
4
13
69121-165888
5
6
Highest (165889 and up)
6
2
 
 
 

So, Konrath’s sales data conform pretty well to a power law, though not quite as nicely as our asteroid  and star brightness data.  It looks like he writes more books than would be expected at the higher sales categories and fewer books at the lower sales categories - lucky him.  That’s not a surprise, given his reputation.  If we had a set of book sales from a random collection of writers, especially self-published writers, we would undoubtedly get a result that conforms much more closely to a power law than Joe’s does.
 I computed the R-square of the log transformed data for each dataset, to make a quantitative comparison.  The closer to 1 that the value is, the better the data fits a power law function.  You can also see this qualitatively if you inspect the various graphs, noting how closely the lines sit on the tops of the bars :
·         Asteroid  data R-square = .99
·         Star visibility data R-square = .99
·         Konrath book sales R-square = .92
Alternatively, we can compare the three datasets graphically, as below.  This shows the percentage of events falling into each of the six categories for the differing datasets.
It is clear that the asteroid line is much more weighted to the first category than the stars dataset or the Konrath book sales dataset.  Simply put, the smallest asteroids make up a much larger proportion of all asteroids than the dimmest stars make up a proportion of all visible stars, or that Konrath’s lowest selling titles make up a proportion of his overall titles.  So, as far as his power function is concerned,  Konrath is closer to the stars than he is to the asteroids.
 That’s also obvious in the second graph below, which plots the logarithm of the respective independent variables against their respective categories. The slope of the best-fit straight line is lower for the book sales, indicating a flatter power law.  You can also see that the book sales data deviate more from a straight line than do the data for the stars and asteroids.



So, what’s behind these power law functions?  That’s an active area of research in the various disciplines.
The asteroid data are undoubtedly a result of the complicated mechanisms that underlie the formation of the solar system from the primordial nebula and its subsequent evolution.  This will obviously include the inverse square gravitational law, but the mixing of particles, the resulting collisions and accretions, the interaction with magnetic fields and other factors must also come into play.  There are a lot of papers on the subject in the scientific literature.
The stars data is probably a combination of how brightness falls off as an inverse square law and the particular astrophysics and astrophysical history of the sun’s part of the galaxy.  The fact that the stellar magnitude scale is related to the human eye’s logarithmic response to light also plays a role.
The book data (along with much other human-related power law phenomena) is thought to be a manifestation of something called preferential attachment theory.  Basically, that’s a variation of the principles known as “the rich get richer”, “nothing succeeds like success” or “first mover advantage”.  But that’s another blog for another day.



 
 
 

No comments:

Post a Comment