Researchers Turn Data into Dynamic Demographics
Aside from showing off how their travel, culinary and nightlife habits, users of the geolocated “check-in” service Foursquare could shed light on the character of a particular city and its neighborhoods.
Researchers at Carnegie Mellon University’s School of Computer Science say that instead of relying on stagnant, unyielding census and neighborhood zoning data to take the temperature of a given community, Foursquare checkin data can provide the much –needed layer of dynamic city life.
The researchers have developed developed an algorithm that takes the check-ins generated when foursquare members visit participating businesses or venues, and clusters them based on a combination of the location of the venues and the groups of people who most often visit them. This information is then mapped to reveal a city’s Livehoods, a term coined by the SCS researchers.
All of the Livehoods analysis is based on foursquare check-ins that users have shared publicly via social networks such as Twitter. This dataset of 18 million check-ins includes user ID, time, latitude and longitude, and the name and category of the venue for each check-in.
“Our goal is to understand how cities work through the lens of social media,” said Justin Cranshaw, a Ph.D. student in SCS’s Institute for Software Research.
The researchers analyzed data from foursquare, but the same computational techniques could be applied to several other databases of location information. The researchers are exploring applications to city planning, transportation and real estate development. Livehoods also could be useful for businesses developing marketing campaigns or for public health officials tracking the spread of disease.
For now, however, it’s being used to get a grip in the cultural and even class distinctions present in a community. For instance, in their study of Carnegie Mellon’s home in Pittsburgh, the researchers found that the Livehoods they identified sometimes spilled over existing neighborhood boundaries, or identified several communities within a neighborhood. The Pittsburgh analysis was based on 42,787 check-ins by 3,840 users at 5,349 venues.
For instance, “they found that the upscale neighborhood of Shadyside actually had two demographically distinct Livehoods — an older, staid community to the west and a younger, “indie” community to the east. Moreover, the younger Livehood spilled over into East Liberty, a neighborhood that long suffered from decay but recently has seen some upscale development.”
And how does this match up to the class and cultural viewpoints of a human observer? Right on… “That makes sense to me,” observed a 24-year-old resident of eastern Shadyside, one of 27 Pittsburgh residents who were interviewed by researchers to validate the findings. “I think at one point it was more walled off and this was poor (East Liberty) and this was wealthy (Shadyside) and now there are nice places in East Liberty and there’s some more diversity in this area (eastern Shadyside).”
Speaking of class divides, the limitations of the research shine through as a viable point of study themselves. Foursquare users tend to be young, urban professionals with smartphones. Consequently, areas of cities with older, poorer populations are nearly blank in the Livehoods maps—an indication of the class makeup—something potentially valuable when seeking new dwellings or pricing real estate, for instance.
Maps for New York (first map above), San Francisco (just above) and Pittsburgh are available on the project website, http://livehoods.org/. The team has added voting for the next city to be “checked.”
March 1, 2017
- Tableau 10.2 Introduced
- Informatica Secure@Source Unveiled
- ClickFox Chooses Zoomdata for Big Data Visualization
February 28, 2017
- Arcadia Data Integrates With MapR Platform
- MapR Names Tom Fisher as Chief Technology Officer
- Narrative Science and Sisense Partner to Unleash BI Insights
- Bright Computing Announces Alliance With Curtiss-Wright
February 27, 2017
- Attunity Wins $1.8 Million Deal Integrating SAP Data With Hadoop Data Lake
- TrendMiner 2.0 Released
- Cloudera Announces Increased Support for Navistar
- 1010data and PeerIQ Announce New Data Partnership
February 24, 2017
- Dell EMC to Sponsor Leverage Big Data + EnterpriseHPC 2017 Summit
- Splunk Announces Fourth Quarter and Full Year 2017 Financial Results
- UCLA Anderson Launches New Master of Science in Business Analytics
February 23, 2017
- MapR Joins Susan G. Komen in the Mission to End Breast Cancer
- SlamData Secures $6.7M in Series A Funding
- Argyle Data Extends Predictive Analytics Offerings to Enterprise Datacenters
- Gartner Positions Yellowfin in Magic Quadrant for BI and Analytics Platforms
- Solix Partners With Taadin Data Technology for Big Data Transformation
- Zaloni Unveils Ingestion Factory
Most Read Features
- 9 Must-Have Skills to Land Top Big Data Jobs in 2015
- Exactly Once: Why It’s Such a Big Deal for Apache Kafka
- Why Anaconda’s Data Science Tent Is So Big–And Getting Bigger
- How Intelligent Machines Learn to Make Sense of the World
- Which Type of SSD is Best: SATA, SAS, or PCIe?
- How Spark Illuminates Deep Learning
- Spark Streaming: What Is It and Who’s Using It?
- Why Deep Learning, and Why Now
- Four Ways to Visualize Big Streaming Data Now
- Data Science Meets Behavioral Science
- More Features…
Most Read News In Brief
- Microsoft Surges in Gartner Quadrant with Power BI
- Wanted (Badly): Big Data Engineers
- Why Gartner Dropped Big Data Off the Hype Curve
- Demand, Salaries Grow for Data Scientists
- ‘Climbing’ Big Data With Virtual Reality
- Six Big Name Schools with Big Data Programs
- Google Launches Video Analytics Challenge
- Intel Unveils Deep Learning Framework for Spark
- Big Iron Warms Up to Big Data
- VoltDB Combines Streaming, Global Data
- More News In Brief…
Most Read This Just In
- Cloudera and Intel Speed Up Machine Learning Workloads With Apache Spark and Intel MKL Integration
- Hortonworks Introduces Enterprise Data Warehouse Optimization Solution
- Hortonworks Announces Financial Results for Fourth Quarter and Full Year 2016
- Unravel Data Now Certified on MapR Converged Data Platform
- MapR Introduces Converged Data Platform for Docker
- Syncsort Announces New Advancements to Big Data Integration Solution
- Intel Unveils Deep Learning Library for Apache Spark
- Wipro Builds Big Data Analytics-as-a-Service Platform on IBM Bluemix
- IBM and Hortonworks Announce HDP for Elastic Storage Server and Spectrum Scale
- Informatica Axon Introduced
- More This Just In…
March 1 - March 2Jakarta Indonesia
March 13 - March 16San Jose CA United States
March 14 - March 15New York NY United States
March 19 - March 21Ponte Vedra FL United States
April 11 - April 12Meydan Dubai United Arab Emirates
May 16 - May 17