Researchers Turn Data into Dynamic Demographics
Aside from showing off how their travel, culinary and nightlife habits, users of the geolocated “check-in” service Foursquare could shed light on the character of a particular city and its neighborhoods.
Researchers at Carnegie Mellon University’s School of Computer Science say that instead of relying on stagnant, unyielding census and neighborhood zoning data to take the temperature of a given community, Foursquare checkin data can provide the much –needed layer of dynamic city life.
The researchers have developed developed an algorithm that takes the check-ins generated when foursquare members visit participating businesses or venues, and clusters them based on a combination of the location of the venues and the groups of people who most often visit them. This information is then mapped to reveal a city’s Livehoods, a term coined by the SCS researchers.
All of the Livehoods analysis is based on foursquare check-ins that users have shared publicly via social networks such as Twitter. This dataset of 18 million check-ins includes user ID, time, latitude and longitude, and the name and category of the venue for each check-in.
“Our goal is to understand how cities work through the lens of social media,” said Justin Cranshaw, a Ph.D. student in SCS’s Institute for Software Research.
The researchers analyzed data from foursquare, but the same computational techniques could be applied to several other databases of location information. The researchers are exploring applications to city planning, transportation and real estate development. Livehoods also could be useful for businesses developing marketing campaigns or for public health officials tracking the spread of disease.
For now, however, it’s being used to get a grip in the cultural and even class distinctions present in a community. For instance, in their study of Carnegie Mellon’s home in Pittsburgh, the researchers found that the Livehoods they identified sometimes spilled over existing neighborhood boundaries, or identified several communities within a neighborhood. The Pittsburgh analysis was based on 42,787 check-ins by 3,840 users at 5,349 venues.
For instance, “they found that the upscale neighborhood of Shadyside actually had two demographically distinct Livehoods — an older, staid community to the west and a younger, “indie” community to the east. Moreover, the younger Livehood spilled over into East Liberty, a neighborhood that long suffered from decay but recently has seen some upscale development.”
And how does this match up to the class and cultural viewpoints of a human observer? Right on… “That makes sense to me,” observed a 24-year-old resident of eastern Shadyside, one of 27 Pittsburgh residents who were interviewed by researchers to validate the findings. “I think at one point it was more walled off and this was poor (East Liberty) and this was wealthy (Shadyside) and now there are nice places in East Liberty and there’s some more diversity in this area (eastern Shadyside).”
Speaking of class divides, the limitations of the research shine through as a viable point of study themselves. Foursquare users tend to be young, urban professionals with smartphones. Consequently, areas of cities with older, poorer populations are nearly blank in the Livehoods maps—an indication of the class makeup—something potentially valuable when seeking new dwellings or pricing real estate, for instance.
Maps for New York (first map above), San Francisco (just above) and Pittsburgh are available on the project website, http://livehoods.org/. The team has added voting for the next city to be “checked.”
September 21, 2017
- Alation Delivers Governance for Insight in Data Lakes, Both On-premises and in the Cloud
- Talend Introduces New Data Governance & Compliance Solution
- Anaconda to Present at Strata Data Conference
- In-Memory Computing Summit North America 2017 Announces Breakout Session Schedule
- VoltDB Accelerates Access to Translytical Database with Enterprise Lab Program
- Optalysys Raises $4 Million to Break Bottlenecks in Genomic Research and Big Data Analysis
- Vexata Launches with $54M in Venture Funding
September 20, 2017
- Qlik Named a Leader in Independent Enterprise BI Platforms Report
- Next Pathway Launches Cornerstone Version 3.0
- Pepperdata Launches Strategic Partner Program to Serve Systems Integration Service Providers
- GigaSpaces Integrates InsightEdge Platform with BigDL for Scalable Deep Learning Innovation
- Dell EMC Teams with Splunk to Deliver Packaged Solutions
- Rambus Announces First Functional Silicon of Server DIMM Buffer Chipset for Next-generation DDR5
- Arcadia Data Simplifies Big Data with Machine-Assisted Insights for Business Analysts
September 19, 2017
- TIBCO Connected Intelligence Cloud Equips Companies for Digital Transformation
- Actian Vector in Hadoop Turbocharges Spark Performance
- Kyvos Insights to Showcase Kyvos 4.0 at Strata Data Conference
- Syncsort Announces Trillium Quality for Big Data
- Mesosphere Joins Dell EMC’s Reseller Program
- Machine Learning Makes SAP S/4HANA More Intelligent
Most Read Features
- Forrester Reshuffles the Deck on BI and Analytics Tools
- Taking the Data Scientist Out of Data Science
- Machine Learning: Are You Ready? A 7-Part Checklist
- 9 Must-Have Skills to Land Top Big Data Jobs in 2015
- Which Type of SSD is Best: SATA, SAS, or PCIe?
- Kafka Gets Streaming SQL Engine, KSQL
- Machine Learning, Deep Learning, and AI: What’s the Difference?
- Spark Streaming: What Is It and Who’s Using It?
- The Data Science Behind Dollar Shave Club
- Apple Puts a ‘Neural Engine’ Inside the iPhone
- More Features…
Most Read News In Brief
- How AI Fares in Gartner’s Latest Hype Cycle
- Baidu’s AI Algorithm Parses Video
- Anaconda Taps Containers to Simplify Data Science Deployments
- ‘Database Learning’ Aims to Speed Queries
- RDBMS Remains Popular As Data Sources Grow
- Analytics Spending Up, Trust in Data Down
- Microsoft Surges in Gartner Quadrant with Power BI
- Tableau Automates K-Means Clustering in V10 Refresh
- Databricks, Flush With Cash, Steers Spark at AI
- Alteryx Tools Aims to Speed Model Deployment
- More News In Brief…
Most Read This Just In
- UC Irvine Introduces Machine and Deep Learning Programs
- Graph Databases Lie at the Heart of $7 Trillion Self-Driving Car Opportunity
- Arrow Electronics Enables Sensor-to-Cloud-to-Analytics IoT Platform
- Report: SAS Ranks No. 1 in Advanced, Predictive Analytics Market Share
- MapR Receives $56M Equity Raise from Existing Investors
- Snowflake Introduces Cloud Data Warehouse Built for Financial Services
- Forrester Names TIBCO Leader in Streaming Analytics
- Instaclustr Launches Managed Open Source-as-a-Service Platform
- Dataiku Raises $28M Series B to Help Democratize Data Science, Analytics
- Unisys Predictive Freight Solution Wins Global ICMG Award
- More This Just In…
September 25 - September 28
September 25 - September 28New York United States
September 26Dallas TX United States
October 31 - November 2Santa Clara CA United States
December 11 - December 13Boston MA United States