Researchers Turn Data into Dynamic Demographics
Aside from showing off how their travel, culinary and nightlife habits, users of the geolocated “check-in” service Foursquare could shed light on the character of a particular city and its neighborhoods.
Researchers at Carnegie Mellon University’s School of Computer Science say that instead of relying on stagnant, unyielding census and neighborhood zoning data to take the temperature of a given community, Foursquare checkin data can provide the much –needed layer of dynamic city life.
The researchers have developed developed an algorithm that takes the check-ins generated when foursquare members visit participating businesses or venues, and clusters them based on a combination of the location of the venues and the groups of people who most often visit them. This information is then mapped to reveal a city’s Livehoods, a term coined by the SCS researchers.
All of the Livehoods analysis is based on foursquare check-ins that users have shared publicly via social networks such as Twitter. This dataset of 18 million check-ins includes user ID, time, latitude and longitude, and the name and category of the venue for each check-in.
“Our goal is to understand how cities work through the lens of social media,” said Justin Cranshaw, a Ph.D. student in SCS’s Institute for Software Research.
The researchers analyzed data from foursquare, but the same computational techniques could be applied to several other databases of location information. The researchers are exploring applications to city planning, transportation and real estate development. Livehoods also could be useful for businesses developing marketing campaigns or for public health officials tracking the spread of disease.
For now, however, it’s being used to get a grip in the cultural and even class distinctions present in a community. For instance, in their study of Carnegie Mellon’s home in Pittsburgh, the researchers found that the Livehoods they identified sometimes spilled over existing neighborhood boundaries, or identified several communities within a neighborhood. The Pittsburgh analysis was based on 42,787 check-ins by 3,840 users at 5,349 venues.
For instance, “they found that the upscale neighborhood of Shadyside actually had two demographically distinct Livehoods — an older, staid community to the west and a younger, “indie” community to the east. Moreover, the younger Livehood spilled over into East Liberty, a neighborhood that long suffered from decay but recently has seen some upscale development.”
And how does this match up to the class and cultural viewpoints of a human observer? Right on… “That makes sense to me,” observed a 24-year-old resident of eastern Shadyside, one of 27 Pittsburgh residents who were interviewed by researchers to validate the findings. “I think at one point it was more walled off and this was poor (East Liberty) and this was wealthy (Shadyside) and now there are nice places in East Liberty and there’s some more diversity in this area (eastern Shadyside).”
Speaking of class divides, the limitations of the research shine through as a viable point of study themselves. Foursquare users tend to be young, urban professionals with smartphones. Consequently, areas of cities with older, poorer populations are nearly blank in the Livehoods maps—an indication of the class makeup—something potentially valuable when seeking new dwellings or pricing real estate, for instance.
Maps for New York (first map above), San Francisco (just above) and Pittsburgh are available on the project website, http://livehoods.org/. The team has added voting for the next city to be “checked.”
March 27, 2015
March 26, 2015
March 25, 2015
- TCS Big Data Solutions Certified on Cloudera Enterprise
- Neo Technology Unveils Neo4j 2.2
- MongoDB and Expert System Partner
- Sysorex Announces Partnership with Cloudian
- Teradata University Network Builds Big Data Talent Pipeline
- Atos Launches Self-Service Hadoop Platform
March 24, 2015
- DDN Storage Assists TRIUMF
- Talend Launches Integration Cloud
- Exar’s AltraHD is Now Certified on Cloudera Enterprise 5
- Datawatch to Speak at Gartner Business Intelligence & Analytics Summit
March 23, 2015
- MicroStrategy Certifies Analytics Platform with Apache Drill
- EMC Unveils the Federation Business Data Lake
- Kenshoo Deploys MapR Distribution Including Hadoop
- AquaFold Releases Aqua Data Studio 16
March 20, 2015
March 19, 2015
- Qubole Extends Big Data-as-a-Service Platform With Redshift Connector
- Couchbase Continues European Growth
Most Read Features
- 9 Must-Have Skills to Land Top Big Data Jobs in 2015
- The 3 Key Steps to Building a Predictive App with Machine Learning
- FPGA System Smokes Spark on Streaming Analytics
- Apache Spark: 3 Real-World Use Cases
- Beyond the 3 Vs: Where Is Big Data Now?
- Apache Spark and Java 8: The Big Data Team for 2015
- A Virtual Reality Lens for Big Data Visualization
- Solr or Elasticsearch–That Is the Question
- How Startups Are Using Big Data Tech to Disrupt Markets
- Taming Apache Storm for Real-Time Analytics
- More Features…
Most Read News In Brief
- Six Big Name Schools with Big Data Programs
- Big Data Job Market Unexpectedly Cooled in 2014
- Fujitsu Adding Column-Oriented Processing Engine to PostgreSQL
- Pinterest Shoots ‘Pinball’ Into Open Source
- Apache Spark Ecosystem Continues To Build
- Survey: Mega-Vendors Still Dominate Database Management Market
- Where Does InfiniDB Go From Here?
- Tachyon Nexus Gets $7.5M to Productize Big Data File System
- Hadoop on a Raspberry Pi
- Splunk Goes ‘Light’ with Big Machine Data Analytics
- More News In Brief…
Most Read This Just In
- IBM Announces New Partnership with Carnegie Mellon University
- Attunity Acquires Appfluent
- SAP Releases New Version of Predictive Analytics Software
- Pitt, CMU, and UPMC Form Alliance to Transform Healthcare Through Big Data
- New Release of ParStream Analytics Platform Announced
- ClearStory Data Partners with Hortonworks
- Splunk Announces Fiscal Fourth Quarter and Full Year 2015 Financial Results
- Accenture Completes Acquisition of Gapso
- Analytics Hub to “Predict” General Election Outcome
- EMC Unveils the Federation Business Data Lake
- More This Just In…
Datanami Media Kit
March 30 - April 1Las Vegas NV United States
April 13 - April 16Santa Clara CA United States
April 26 - April 28Boston CA United States
April 28 - April 29San Jose CA United States
May 28 - May 29New York NY United States