Researchers Turn Data into Dynamic Demographics
Aside from showing off how their travel, culinary and nightlife habits, users of the geolocated “check-in” service Foursquare could shed light on the character of a particular city and its neighborhoods.
Researchers at Carnegie Mellon University’s School of Computer Science say that instead of relying on stagnant, unyielding census and neighborhood zoning data to take the temperature of a given community, Foursquare checkin data can provide the much –needed layer of dynamic city life.
The researchers have developed developed an algorithm that takes the check-ins generated when foursquare members visit participating businesses or venues, and clusters them based on a combination of the location of the venues and the groups of people who most often visit them. This information is then mapped to reveal a city’s Livehoods, a term coined by the SCS researchers.
All of the Livehoods analysis is based on foursquare check-ins that users have shared publicly via social networks such as Twitter. This dataset of 18 million check-ins includes user ID, time, latitude and longitude, and the name and category of the venue for each check-in.
“Our goal is to understand how cities work through the lens of social media,” said Justin Cranshaw, a Ph.D. student in SCS’s Institute for Software Research.
The researchers analyzed data from foursquare, but the same computational techniques could be applied to several other databases of location information. The researchers are exploring applications to city planning, transportation and real estate development. Livehoods also could be useful for businesses developing marketing campaigns or for public health officials tracking the spread of disease.
For now, however, it’s being used to get a grip in the cultural and even class distinctions present in a community. For instance, in their study of Carnegie Mellon’s home in Pittsburgh, the researchers found that the Livehoods they identified sometimes spilled over existing neighborhood boundaries, or identified several communities within a neighborhood. The Pittsburgh analysis was based on 42,787 check-ins by 3,840 users at 5,349 venues.
For instance, “they found that the upscale neighborhood of Shadyside actually had two demographically distinct Livehoods — an older, staid community to the west and a younger, “indie” community to the east. Moreover, the younger Livehood spilled over into East Liberty, a neighborhood that long suffered from decay but recently has seen some upscale development.”
And how does this match up to the class and cultural viewpoints of a human observer? Right on… “That makes sense to me,” observed a 24-year-old resident of eastern Shadyside, one of 27 Pittsburgh residents who were interviewed by researchers to validate the findings. “I think at one point it was more walled off and this was poor (East Liberty) and this was wealthy (Shadyside) and now there are nice places in East Liberty and there’s some more diversity in this area (eastern Shadyside).”
Speaking of class divides, the limitations of the research shine through as a viable point of study themselves. Foursquare users tend to be young, urban professionals with smartphones. Consequently, areas of cities with older, poorer populations are nearly blank in the Livehoods maps—an indication of the class makeup—something potentially valuable when seeking new dwellings or pricing real estate, for instance.
Maps for New York (first map above), San Francisco (just above) and Pittsburgh are available on the project website, http://livehoods.org/. The team has added voting for the next city to be “checked.”
- Click to share on Twitter (Opens in new window)
- Share on Facebook (Opens in new window)
- Click to share on Google+ (Opens in new window)
- Click to share on Pocket (Opens in new window)
- Click to share on Reddit (Opens in new window)
- Click to share on Pinterest (Opens in new window)
- Click to share on Tumblr (Opens in new window)
- Click to email this to a friend (Opens in new window)
August 28, 2015
August 27, 2015
- Accenture Launches Advanced Analytics Applications for Telecommunications Industry
- Navigator Joins Cloudera Accelerator Program
- TIBCO NOW Tour Coming to Europe in October
- Adello Contributes to the Apache Hadoop Project
August 26, 2015
- Teradata Extends Commitment to SUSE Linux Enterprise
- Terbium Labs Utilizing MapR Distribution
- UC Irvine Extension Announces New Big Data Specialized Studies Program
- The ASF Announces Apache Lens as a Top-Level Project
- TIBCO to Acquire Mashery
August 25, 2015
- BlueData Receives $20M in Funding
- Concurrent Announces Latest Release of Driven
- Impetus Unveils Free Versions of StreamAnalytix
- The ASF Announces Apache Ignite as a Top-Level Project
- NEXCOM Works With IBM to Harness Big Data in Industrial IoT Applications
- Intel and BlueData Collaborate to Simplify Big Data Infrastructure
- New Relic Announces FutureStack15
August 24, 2015
- Hortonworks and EY Collaborate
- Scality Completes $45M Series D Funding Round
- Qlik Appoints New Executive VP and Chief Strategy Officer
Most Read Features
- 9 Must-Have Skills to Land Top Big Data Jobs in 2015
- Inside the Zestimate: Data Science at Zillow
- How NVIDIA Is Unlocking the Potential of GPU-Powered Deep Learning
- Solr or Elasticsearch–That Is the Question
- Apache Spark: 3 Real-World Use Cases
- Will Scala Take Over the Big Data World?
- Python Versus R in Apache Spark
- Three Tips for Building a Big-Data Back-End for Your Mobile App
- Does InfiniBand Have a Future on Hadoop?
- Are You Ready For a Data Scientist?
- More Features…
Most Read News In Brief
- Six Big Name Schools with Big Data Programs
- Why Gartner Dropped Big Data Off the Hype Curve
- Google Releases Dataflow, Announces Partners
- HP Updates Big Data Strategy
- What Informatica’s Buyout Means to Big Data Integration
- MarkLogic Hones Its Triple Store
- World Economic Forum Eyes Next Google
- McKinsey: Hype Understates Value of IoT Data
- Spark 1.5 to Incorporate ‘Tungsten’ Upgrades
- IBM Adds Spark to z Systems Mainframe
- More News In Brief…
Most Read This Just In
- Mesosphere Infinity Unveiled
- MapR Continues Expansion of Free Hadoop On-Demand Training
- Platfora Appoints New CEO
- Harte Hanks Utilizing MapR Distribution Including Hadoop to Evolve Big Data Strategy
- MapR Expands Customer Cloud Deployment Options for Hadoop With AWS
- SAS Named Leader in Gartner Magic Quadrant for Data Integration Tools
- ClearStory Data and Google Cloud Platform Collaborate
- Zementis and IBM Announce Joint Strategic Initiative
- GE Introduces Predix Cloud
- Cloudera Director 1.5 Released
- More This Just In…
September 9 - September 10Boston MA United States
September 28 - September 30Frankfurt am Main Germany
October 7 - October 8Royal Victoria Dock London United Kingdom
October 27Las Vegas NV United States
November 4 - November 5
November 9 - November 11Abu Dhabi United Arab Emirates