Researchers Turn Data into Dynamic Demographics
Aside from showing off how their travel, culinary and nightlife habits, users of the geolocated “check-in” service Foursquare could shed light on the character of a particular city and its neighborhoods.
Researchers at Carnegie Mellon University’s School of Computer Science say that instead of relying on stagnant, unyielding census and neighborhood zoning data to take the temperature of a given community, Foursquare checkin data can provide the much –needed layer of dynamic city life.
The researchers have developed developed an algorithm that takes the check-ins generated when foursquare members visit participating businesses or venues, and clusters them based on a combination of the location of the venues and the groups of people who most often visit them. This information is then mapped to reveal a city’s Livehoods, a term coined by the SCS researchers.
All of the Livehoods analysis is based on foursquare check-ins that users have shared publicly via social networks such as Twitter. This dataset of 18 million check-ins includes user ID, time, latitude and longitude, and the name and category of the venue for each check-in.
“Our goal is to understand how cities work through the lens of social media,” said Justin Cranshaw, a Ph.D. student in SCS’s Institute for Software Research.
The researchers analyzed data from foursquare, but the same computational techniques could be applied to several other databases of location information. The researchers are exploring applications to city planning, transportation and real estate development. Livehoods also could be useful for businesses developing marketing campaigns or for public health officials tracking the spread of disease.
For now, however, it’s being used to get a grip in the cultural and even class distinctions present in a community. For instance, in their study of Carnegie Mellon’s home in Pittsburgh, the researchers found that the Livehoods they identified sometimes spilled over existing neighborhood boundaries, or identified several communities within a neighborhood. The Pittsburgh analysis was based on 42,787 check-ins by 3,840 users at 5,349 venues.
For instance, “they found that the upscale neighborhood of Shadyside actually had two demographically distinct Livehoods — an older, staid community to the west and a younger, “indie” community to the east. Moreover, the younger Livehood spilled over into East Liberty, a neighborhood that long suffered from decay but recently has seen some upscale development.”
And how does this match up to the class and cultural viewpoints of a human observer? Right on… “That makes sense to me,” observed a 24-year-old resident of eastern Shadyside, one of 27 Pittsburgh residents who were interviewed by researchers to validate the findings. “I think at one point it was more walled off and this was poor (East Liberty) and this was wealthy (Shadyside) and now there are nice places in East Liberty and there’s some more diversity in this area (eastern Shadyside).”
Speaking of class divides, the limitations of the research shine through as a viable point of study themselves. Foursquare users tend to be young, urban professionals with smartphones. Consequently, areas of cities with older, poorer populations are nearly blank in the Livehoods maps—an indication of the class makeup—something potentially valuable when seeking new dwellings or pricing real estate, for instance.
Maps for New York (first map above), San Francisco (just above) and Pittsburgh are available on the project website, http://livehoods.org/. The team has added voting for the next city to be “checked.”
July 29, 2014
- Dialogue Moves to Cloud-Based Business Intelligence with Birst
- MapR to Receive Big Data Competency Status by AWS
- Actian Introduces Big Data 2.0 Clear Path Program
- SAS Analytics Uncovers New Heart Attack Treatments That Extend Survival Rates
July 28, 2014
July 24, 2014
- Zettaset Simplifies Big Data Security and Management for the Enterprise
- EnterpriseDB Extends Postgres’ Big Data Capacity With New MongoDB, Hadoop Extensions
- Datawatch Announces Third Quarter Financial Results
- Enlightiks Develops Analytics Platform with Tableau at its Core
July 23, 2014
- Trifacta Appoints New CEO
- InfiniDB Offering New Accelerator Program
- Australian Genome Research Facility Selects Brocade to Handle Big Data Growth
July 22, 2014
- MapR Partners with TCS
- InfiniDB Expands Global Partner Program
- Apache Tez Becomes Top-Level Project
- DDN Joins Global Alliance for Genomics and Health
- Registration Open for Rock Stars of Big Data Analytics Conference
- Teradata Announces Two New Acquisitions
- Corvil Introduces New Streaming Analytics Platform
- Logi Analytics and Actian Partner
Most Read Features
- Google Re-Imagines MapReduce, Launches DataFlow
- Inside Sibyl, Google’s Massively Parallel Machine Learning Platform
- Apache Spark: 3 Real-World Use Cases
- Where Does Spark Go From Here?
- Can You Trust Your Algorithms?
- How Hadoop is Remaking Travel and Expense Reporting at Concur
- Streaming Analytics Ready for Prime Time, Forrester Says
- When to Hadoop, and When Not To
- How T-Mobile Got More from Hadoop
- Databricks Takes Apache Spark to the Cloud, Nabs $33M
- More Features…
Most Read News In Brief
- Six Big Name Schools with Big Data Programs
- Hadoop on a Raspberry Pi
- GraphLabs Wises Up Machine Learning Platform
- Big Data Forcing Update of SQL Standard
- See Spark Run on NoSQL, DataStax Says
- MapR Announces $110M Investment Led by Google
- Hadoop and NoSQL Now Data Warehouse-Worthy: Gartner
- Oracle Aims to Break Big Data Silos with SQL
- Navy Launches Big Data ‘Ecosystem’ Effort
- Big Data Helps Drive Transportation Planning
- More News In Brief…
Most Read This Just In
- SAP Introduces New Big Data Initiatives
- Accenture and Hortonworks Join Forces to Help Businesses Manage Big Data
- Guavus Unveils Reflex 2.0
- IBM Announces $3 Billion Research Initiative
- Cloudera, Databricks, IBM, Intel, and MapR Collaborate
- Guavus Announces Platform Update
- Teradata Introduces Aster R
- Databricks to Deliver Spark Distribution Offering for SAP HANA Platform
- Alteryx and Databricks to Lead Development of SparkR for Hadoop
- T-Systems to Hold First Big Data Challenge
- More This Just In…
October 1 - 2Heidelberg Germany
October 8 - 9Royal Victoria Dock London United Kingdom
October 15 - 17New York United States
October 27 - 29Burlingame
November 4 - 6Santa Clara CA United States