public:on_uea_gig_ticket_price_and_ftse_100
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
public:on_uea_gig_ticket_price_and_ftse_100 [2019/11/18 02:22] – [Analysis] fangfufu | public:on_uea_gig_ticket_price_and_ftse_100 [2019/11/24 00:26] (current) – removed fangfufu | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== On UEA Gig Ticket Price and FTSE 100 ====== | ||
- | <note tip> Please don't take any of the findings too seriously! </ | ||
- | |||
- | ===== Abstract ===== | ||
- | By analysing The Gig List [(gig_list> | ||
- | ===== Background ===== | ||
- | On Friday evening, I played Scrabble with AV. I threatened to write a program to solve Scrabble, because I was so bad at playing that game. Writing a Scrabble solver is a two part problem - first you need an anagram solver, second you need to search through the board to find the highest scoring solution. But then I discovered Scrabble solver on the Internet, which I gave me a massive boost, and I managed to defeat AV in one game. However, cheating makes the game so boring. Since the problem solving part of my brain was active, I was actively looking for problems to solve. | ||
- | |||
- | After AV has left, I arrived at UEA SU shop to get some snacks. I discovered a book named The Gig List, which contains the list of gigs played in UEA venue between 1966-02-22 to 2018-07-20. It cost £6. I immediate realised the historical significance of this book. It is a good tool for economics research I came up with the idea of investigating how the price of gigs changed over time. | ||
- | |||
- | ===== Objective ===== | ||
- | - Convert The Gig List from the paper format to Excel spreadsheet | ||
- | - Analyse the spreadsheet. | ||
- | - Disseminate findings from the analysis. | ||
- | |||
- | ===== Converting The Gig List from paper format to Excel spreadsheet ===== | ||
- | ==== From paper to image ==== | ||
- | Initially I planned to sacrifice my copy of the Gig List. I started by cutting off the spine of the book using a pair of scissors, and scan the pages using the departmental scanner, which has a document autofeeder. However, after 20% in, I realised that this task was too painful for my delicate hands. I then discovered that there is a digital version of the book. However the downloading of that document had been disabled. I had to use a third party tool. The downloaded PDF file does not include any of the textual information. Only images were downloaded. This meant that I still had to perform OCR on the resulting image. I suppose the good thing is that the quality of the input image for OCR is guaranteed, the bad thing is that I wasted my £6. | ||
- | |||
- | ==== From image to text ==== | ||
- | To convert the downloaded images to text, I ran the following command: | ||
- | <code bash> | ||
- | pdfimages Gig_list.pdf -all ./Gig_list | ||
- | for i in {000..088}; do tesseract Gig_list-$i.jpg Gig_list-$i -l eng -c preserve_interword_spaces=1 ; done | ||
- | cat jpg/*.txt | grep ^[[: | ||
- | cat Gig_list_semi_valid.txt|sed -r -e ' | ||
- | </ | ||
- | |||
- | === OCR === | ||
- | I tried various OCR engines, including '' | ||
- | Note that '' | ||
- | |||
- | === grep and sed === | ||
- | I agree that the way I used grep was particularly ugly. One regular expression search for strings started with number, then alphabets, the next regular expression searches for the letter '' | ||
- | |||
- | And, no I do not understand how to use sed at all. I basically copied and pasted the code from Stackoverflow. I basically decided to use '' | ||
- | |||
- | ==== From text format to Excel spreadsheet ==== | ||
- | Now a bit of manual processing is required - invalid lines needed to be removed. There weren' | ||
- | |||
- | Excel is pretty much GUI, it is pretty easy to use. So it needs no further explanation. However it should be noted that when plotting the chart, exponential trendline is required, as inflation is an exponential growth. | ||
- | |||
- | ==== Calculation in Excel ==== | ||
- | Initially I wasn't sure which model should be fit onto the ticket price. It certainly showed a curve. I then realised that exponential growth curve should be fitted, because inflation is an exponential growth. | ||
- | |||
- | After obtaining the equation for the best fit line, I thought about converting that into annual inflation rate by changing the base of log. However I realised that would be too complicated. So I opted to just substitute timestamp back into the equation of the best fit line. | ||
- | |||
- | The equation for the best fit line happened to be: | ||
- | $$ y= 0.0048e^{0.0002x}, | ||
- | where $x$ is the number of day has passed since 1900-01-01. (This is the way Excel' | ||
- | |||
- | It turns out that the average ticket price ticket to a UEA SU's gig has been increasing on average 7.57% every year. | ||
- | |||
- | ===== Analysis ===== | ||
- | It is clear that the inflation on UEA SU's average gig price is higher than the CPI inflation rate for most years [(inflation> | ||
- | | ||
- | |||
- | - UEA's social standing has increased over the years. Higher profile bands are willing to travel to Norwich to perform. | ||
- | - UEA SU has decided to picked more expensive bands to play at UEA. | ||
- | - The cost of making music has increased over the years. | ||
- | |||
- | Personally I believe the reason behind the price increase is 1 and 2. To ascertain the reason, we can either look at the financial health of the music industry as a whole, or look into UEA SU's financial account over the years. | ||
- | |||
- | However, it should be noted that since 1985, the average ticket price has gone up by 936%, while FTSE 100 has only gone up by 309% [(FTSE100> | ||
- | |||
- | However, we all know that profit does not necessarily link to revenue. It would be interesting to see if the profit created during those gigs have grown in at the same rate. | ||
- | |||
- | ===== Resources ===== | ||
- | The Excel spreadsheet in question can be found at {{: | ||
public/on_uea_gig_ticket_price_and_ftse_100.1574043772.txt.gz · Last modified: 2019/11/18 02:22 by fangfufu