Researchers Turn Data into Dynamic Demographics
Aside from showing off how their travel, culinary and nightlife habits, users of the geolocated “check-in” service Foursquare could shed light on the character of a particular city and its neighborhoods.
Researchers at Carnegie Mellon University’s School of Computer Science say that instead of relying on stagnant, unyielding census and neighborhood zoning data to take the temperature of a given community, Foursquare checkin data can provide the much –needed layer of dynamic city life.
The researchers have developed developed an algorithm that takes the check-ins generated when foursquare members visit participating businesses or venues, and clusters them based on a combination of the location of the venues and the groups of people who most often visit them. This information is then mapped to reveal a city’s Livehoods, a term coined by the SCS researchers.
All of the Livehoods analysis is based on foursquare check-ins that users have shared publicly via social networks such as Twitter. This dataset of 18 million check-ins includes user ID, time, latitude and longitude, and the name and category of the venue for each check-in.
“Our goal is to understand how cities work through the lens of social media,” said Justin Cranshaw, a Ph.D. student in SCS’s Institute for Software Research.
The researchers analyzed data from foursquare, but the same computational techniques could be applied to several other databases of location information. The researchers are exploring applications to city planning, transportation and real estate development. Livehoods also could be useful for businesses developing marketing campaigns or for public health officials tracking the spread of disease.
For now, however, it’s being used to get a grip in the cultural and even class distinctions present in a community. For instance, in their study of Carnegie Mellon’s home in Pittsburgh, the researchers found that the Livehoods they identified sometimes spilled over existing neighborhood boundaries, or identified several communities within a neighborhood. The Pittsburgh analysis was based on 42,787 check-ins by 3,840 users at 5,349 venues.
For instance, “they found that the upscale neighborhood of Shadyside actually had two demographically distinct Livehoods — an older, staid community to the west and a younger, “indie” community to the east. Moreover, the younger Livehood spilled over into East Liberty, a neighborhood that long suffered from decay but recently has seen some upscale development.”
And how does this match up to the class and cultural viewpoints of a human observer? Right on… “That makes sense to me,” observed a 24-year-old resident of eastern Shadyside, one of 27 Pittsburgh residents who were interviewed by researchers to validate the findings. “I think at one point it was more walled off and this was poor (East Liberty) and this was wealthy (Shadyside) and now there are nice places in East Liberty and there’s some more diversity in this area (eastern Shadyside).”
Speaking of class divides, the limitations of the research shine through as a viable point of study themselves. Foursquare users tend to be young, urban professionals with smartphones. Consequently, areas of cities with older, poorer populations are nearly blank in the Livehoods maps—an indication of the class makeup—something potentially valuable when seeking new dwellings or pricing real estate, for instance.
Maps for New York (first map above), San Francisco (just above) and Pittsburgh are available on the project website, http://livehoods.org/. The team has added voting for the next city to be “checked.”
September 13, 2019
- Duality Technologies Named a “Cool Vendor” by Gartner for Privacy Preservation in Analytics
- Valen Analytics Announces InsureRight Manage 2.0
- HPCwire and EnterpriseAI to Cover Silicon & Systems for Deep Learning at AI Hardware Summit as Headline Media Partners
September 12, 2019
- Odaseva Announces Growth with One Trillion Documents Supported and Over 10 Million Users
- IBM and ŠKODA AUTO University Collaborate on new Digital Skills for Students
- Trifacta Raises $100M to Support Explosive Growth of Data Wrangling for AI and the Cloud
- Snowflake and Fedresults Bring Cloud Smart Technology to Federal Government
- IBM Unveils z15 With Industry-First Data Privacy Capabilities
- StorageCraft Research Reveals Rampant Data Growth, and Inadequate IT Infrastructures are a Source of Global Concern and Risk
- Sumo Logic Accelerates Continuous Intelligence for Modern Enterprises with New Product Innovations
September 11, 2019
- StackRox Launches New Sumo Logic App for Kubernetes Security
- Sumo Logic Showcases the Intelligence Economy at Illuminate 2019
- Multi-Cloud on the Rise and Open Source Tech Like Kubernetes is Disrupting the Modern Application Stack, According to Sumo Logic Research
- Accenture Acquires Pragsis Bidoop
- Lucidworks Fusion 5.0 Features Data Science Toolkit Integration & Microservices Architecture Orchestrated by Kubernetes
- TIBCO and Asia Pacific University of Technology and Innovation Announce Enriched Collaboration
- Looker Brings the Data Community Together at JOIN 2019
- InfluxData Launches InfluxDB Cloud 2.0
- Nationwide Drives Data-enabled Culture with ‘Fit to Fly’ Analytics Strategy
- Hazelcast Enhances Real-Time Capabilities for Financial Services Industry
Most Read Features
- Is Python Strangling R to Death?
- Can We Stop Doing ETL Yet?
- Big Data File Formats Demystified
- Re-Imagining Big Data in a Post-Hadoop World
- Seeing the Big Picture on Big Data Market Shift
- How to Build a Better Machine Learning Pipeline
- Is Hadoop Officially Dead?
- 10 Big Data Trends to Watch in 2019
- Why Knowledge Graphs Are Foundational to Artificial Intelligence
- AutoML Tools Emerge as Data Science Difference Makers
- More Features…
Most Read News In Brief
- HPE Acquires MapR
- R Backers Tout Funding Milestone, Seek Comeback
- H2O.ai Tops Off Funding to Accelerate AI Adoption
- Startup Rockset Adds SQL to DynamoDB
- MapR Says It’s Close to Deal to Sell Company
- AI, Analytics Help to Propel Wind Power
- War Unfolding for Control of Elasticsearch
- StreamSets Eases Spark-ETL Pipeline Development
- California’s New Data Privacy Law Takes Effect in 2020
- How IBM Is Turning Db2 into an ‘AI Database’
- More News In Brief…
Most Read This Just In
- Cray ARM-based System ‘Ookami’ to Serve as Testbed for Computational Studies at Stony Brook
- Cloudera Agrees to Acquire Arcadia Data
- Illumina to Share their Data Virtualization Journey at Gartner Catalyst Conference
- Report: SAS Sees 105% Growth in AI Revenue
- Ascend Introduces Queryable Dataflows for Faster Pipeline Development and Overall Time to Big Data Success
- New Graph Database Performance Benchmark Confirms Graph Databases are Ready for Solving Real-World Business Intelligence, Data Challenges
- SnapLogic Delivers AI-powered Pipeline Recommendations and Azure Databricks Support with Latest Platform Release
- Accenture to Acquire Analytics8, Australian Analytics and Data Specialists
- H2O.ai Releases its Automatic Machine Learning Platform
- SAS Establishes Opioid Analytics Users Group
- More This Just In…
September 23 - September 26New York United States
October 20 - October 22Charlotte NC United States
October 23 - October 24Berlin Germany