Conference Program: SIGMOD Sessions
This page describes the complete SIGMOD Conference program. Please use the following links to skip to the sessions of interest:
- Keynote Talks
- Panel
- Research Sessions
- Tutorials
- Industry Sessions
- Demonstrations
- Career in Industry
- New Researcher Symposium
- SIGMOD/PODS Poster Sessions
SIGMOD KEYNOTE TALKS
Keynote Talks
Keynote 1 - From Data to Insights @ Bare Metal Speed
Jignesh Patel
Tuesday, 8:20-10:00
Location: Plenary 1
Session Chair: Susan Davidson
Keynote 2 - The Power Behind the Throne: Information Integration in the Age of Data-Driven Discovery
Laura Haas
Wednesday, 8:20-10:00
Location: Plenary 1
Session Chair: Zack Ives
ACM-W Athena Lecturer Award - Three Favorite Results
Jennifer Widom
Tuesday, 17:30-18:15
Location: Plenary 1
Session Chair: Magda Balazinska
SIGMOD PANEL
Machine Learning and Databases: The Sound of Things to Come or a Cacophony of Hype?
Tuesday, 13:30 - 15:10
Location: MR106
Panel Chair: Christopher Re (Stanford University)
Presenters: Divyakant Agrawal, Magdalena Balazinska, Michael Cafarella, Michael Jordan, Tim Kraska, and Raghu Ramakrishnan
SIGMOD RESEARCH SESSIONS
Research Session 1 - Cloud: Parallel Execution
Tuesday, 10:30-12:10
Location: MR106
Session Chair: Yanlei Diao
- Distributed Outlier Detection using Compressive Sensing
- Locality-aware Partitioning Design in Parallel Database Systems
- ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout
- Implicit Parallelism through Deep Language Embedding
- From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System
Research Session 2 - Matrix and Array Computations
Tuesday, 10:30-12:10
Location: MR101&102
Session Chair: Florin Rusu
- sPCA: Scalable Principal Component Analysis for Big Data
- Exploiting matrix dependency for efficient distributed matrix computation
- LEMP: Fast Retrieval of Large Entries in a Matrix Product
- Skew-Aware Join Optimization for Array Databases
- Resource Elasticity for Large-Scale Machine Learning
Research Session 3 - Security and Access Control
Tuesday, 10:30-12:10
Location: MR104
Session Chair: Lei Chen
- SEMROD: Secure and Efficient MapReduce over HybriD Clouds
- Authenticated Online Data Integration Services
- ENKI: Access Control for Encrypted Query Processing
- Collaborative Access Control in WebdamLog
- Automatic Enforcement of Data Use Policies with DataLawyer
Research Session 4 - Cloud: Fault Tolerance, Reconfiguration
Tuesday, 13:30-15:10
Location: MR101&102
Session Chair: Alexandros Labrinidis
- Cost-based Fault-tolerance for Parallel Data Processing
- Squall: Fine-Grained Live Reconfiguration for Partitioned Main Memory Databases
- Madeus: A Database Live Migration Middleware under Heavy Workloads for Cloud Environment
- Lineage-driven Fault Injection
Research Session 5 - Keyword Search and Text
Tuesday, 13:30-15:10
Location: MR103
Session Chair: K. Selcuk Candan
- Diversity-Aware Top-K Publish/Subscribe for Text Stream
- Diverse and Proportional Size-l Object Summaries for Keyword Search
- Local Filtering: Improving Performance of Approximate Queries on String Collections
- Exact Top-k Nearest Keyword Search on Large Networks
- Efficient Algorithms for Answering the m-Closest Keywords Query
Research Session 6 - Graph Primitives
Tuesday, 15:40-17:20
Location: MR101&102
Session Chair: Ke Yi
- Minimum Spanning Trees in Temporal Graphs
- Efficient Enumeration of Maximal k-Plexes
- Divide & Conquer: I/O Efficient Depth-First Search
- Index-based Optimal Algorithms for Computing Steiner Components with Maximum Connectivity
Research Session 7 - Data Mining
Tuesday, 15:40-17:20
Location: MR105
Session Chair: Julia Stoyanovich
- COMMIT: A Scalable Approach to Mining Communication Motifs from Dynamic Networks
- LASH: Large-Scale Sequence Mining with Hierarchies
- Twister Tries: Approximate Hierarchical Agglomerative Clustering for Average Distance in Linear Time
- DBSCAN Revisited: Mis-Claim, Un-Fixability, and Approximation
- The TagAdvisor: Luring the Lurkers to Review Web Items
Research Session 8 - Uncertainty and Linking
Tuesday, 15:40-17:20
Location: MR104
Session Chair: Pierre Senellart
- Supporting Data Uncertainty in Array Databases
- Identifying the Extent of Completeness of Query Answers over Partially Complete Databases
- k-Hit Query: Top-k Query with Probabilistic Utility Function
- Linking Temporal Records for Profiling Entities
Research Session 9 - Transactional Architectures
Wednesday, 10:30-12:10
Location: MR103
Session Chair: Feifei Li
- On the Design and Scalability of Distributed Shared-Data Databases
- Fast Serializable Multi-Version Concurrency Control for Main-Memory Database Systems
- FOEDUS: OLTP Engine for a Thousand Cores and NVRAM
- Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems
Research Session 10 - Privacy
Wednesday, 10:30-12:10
Location: MR104
Session Chair: Wang-Chiew Tan
- Private Release of Graph Statistics using Ladder Functions
- Bayesian Differential Privacy on Correlated Data
- Modular Order Preserving Encryption, Revisited
- Chiaroscuro: Transparency and Privacy for Massive Personal Time Series Clustering
Research Session 11 - Streams
Wednesday, 10:30-12:10
Location: MR101&102
Session Chair: Jorge Quiané-Ruiz
- Persistent Data Sketching
- Scalable Distributed Stream Join Processing
- SCREEN: Stream Data Cleaning under Speed Constraints
- Location-Aware Pub/Sub System: When Continuous Moving Queries Meet Dynamic Event Streams
Research Session 12 - Spatial data
Wednesday, 15:20-17:00
Location: MR103
Session Chair: Cyrus Shahabi
- Optimal Spatial Dominance: An effective search of Nearest Neighbor Candidates
- THERMAL-JOIN: Scalable Spatial Join for Dynamic Workloads
- Indexing Metric Uncertain Data for Range Queries
- Efficient Route Planning on Public Transportation Networks: A Labelling Approach
Research Session 13 - Crowdsourcing
Wednesday, 15:20-17:00
Location: MR104
Session Chair: Dimitrios Tsoumakos
- The Importance of Being Expert: Efficient Max-Finding in Crowdsourcing
- Minimizing Efforts in Validating Crowd Answers
- iCrowd: An Adaptive Crowdsourcing Framework
- QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications
- tDP: An Optimal-Latency Budget Allocation strategy for Crowdsourced MAXIMUM operations
Research Session 14 - Indexing & Performance
Wednesday, 17:10-18:50
Location: MR101&102
Session Chair: Yufei Tao
- Cache-Efficient Aggregation: Hashing is Sorting
- Efficient Similarity Join and Search on Multi-Attribute Data
- Holistic Indexing in Main-memory Column-stores
- CliffGuard: A Principled Framework for Finding Robust Database Designs
- Exploiting Correlations for Queries with Expensive Predicates
Research Session 15 - Data Cleaning
Wednesday, 17:10-18:50
Location: MR103
Session Chair: Mohamad Sharaf
- Query-Oriented Data Cleaning with Oracles
- BigDansing: A System for Big Data Cleansing
- Data X-Ray: A diagnostic tool for data errors
- KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing
- Crowd-based Deduplication: an Adaptive Approach
Research Session 16 - Transactions
Wednesday, 17:10-18:50
Location: MR104
Session Chair: Ashraf Aboulnaga
- Minimizing Commit Latency of Transactions in Geo-Replicated Data Stores
- Optimizing Optimistic Concurrency Control for Tree-Structured, Log-Structured Databases
- The Homeostasis Protocol: Avoiding Transaction Coordination Through Program Analysis
- Feral Concurrency Control: An Empirical Investigation of Modern Application Integrity
Research Session 17 - Hardware-Aware Query Processing
Thursday, 10:30-12:10
Location: MR103
Session Chair: Spyros Blanas
- Rack-Scale Join Processing using RDMA
- Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation
- Rethinking SIMD Vectorization for In-Memory Databases
- A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew
Research Session 18 - Graph Propagation, Influence, Mining
Thursday, 10:30-12:10
Location: MR101&102
Session Chair: Panagiotis Karras
- GetReal: Towards Realistic Selection of Influence Maximization Strategies in Competitive Networks
- Influence Maximization in Near-Linear Time: A Martingale Approach
- Community Level Diffusion Extraction
- BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs
- The Minimum Wiener Connector Problem
Research Session 19 - Social Networks
Thursday, 10:30-12:10
Location: MR104
Session Chair: Chengfei Liu
- From Group Recommendations to Group Formation
- Real-Time Multi-Criteria Social Graph Partitioning: A Game Theoretic Approach
- Utility-Aware Social Event-Participant Planning
- Online Video Recommendation in Sharing Community
Research Session 20 - Information Extraction and Record Linking
Thursday, 13:30-15:10
Location: MR103
Session Chair: Guoliang Li
- TEGRA: Table Extraction by Global Record Alignment
- Mining Quality Phrases from Massive Text Corpora
- Mining Subjective Properties on the Web
- Microblog Entity Linking with Social Temporal Context
Research Session 21 - RDF and SPARQL
Thursday, 13:30-15:10
Location: MR101&102
Session Chair: Nikos Mamoulis
- Graph-Aware, Workload-Adaptive SPARQL Query Caching
- Left Bit Right: For SPARQL Join Queries with OPTIONAL Patterns (Left-outer-joins)
- How to Build Templates for RDF Question/Answering --An Uncertain Graph Similarity Join Approach
- RBench: Application-Specific RDF Benchmarking
- ALEX: Automatic Link Exploration in Linked Data
Research Session 22 - Time Series & Graph Processing
Thursday, 13:30-15:10
Location: MR104
Session Chair: Alan Fekete
- k-Shape: Efficient and Accurate Clustering of Time Series
- SMiLer: A Semi-Lazy Time Series Prediction System for Sensors
- SQLGraph: An Efficient Relational-Based Property Graph Store
- Updating Graph Indices with a One-Pass Algorithm
Research Session 23 - Advanced Query Processing
Thursday, 15:40-17:20
Location: MR103
Session Chair: Sharad Mehrotra
- An Incremental Anytime Algorithm for Multi-Objective Query Optimization
- Output-sensitive Evaluation of Prioritized Skyline Queries
- Learning Generalized Linear Models Over Normalized Data
- Utilizing IDs to Accelerate Incremental View Maintenance
Research Session 24 - New Models
Thursday, 15:40-17:20
Location: MR104
Session Chair: Jiaheng Lu
- Top-k Spreadsheet-Style Search for Query Discovery
- Proactive Annotation Management in Relational Databases
- Weighted Coverage based Reviewer Assignment
- Distributed Online Tracking
SIGMOD TUTORIAL SESSIONS
- Tutorial 1 - Overview of Data Exploration Techniques
Tuesday, 10:30-12:10
Tuesday, 13:30-15:10
Location: MR105
- Tutorial 2 - Mining and Forecasting of Big Time-series Data
Wednesday, 10:30-12:10
Wednesday, 15:20-17:00
Location: MR105
- Tutorial 3 - Data management in non-volatile memory
Thursday, 10:30-12:10
Thursday, 13:30-15:10
Location: Plenary 1
- Tutorial 4 - Knowledge Curation and Knowledge Fusion: Challenges, Models, and Applications
Thursday, 15:40-17:20
Location: MR105
SIGMOD INDUSTRY SESSIONS
Industry Session 1 - Streaming/Real-Time/Active
Tuesday, 10:30-12:10
Location: MR103
Session Chair: Tyson Condie
- TencentRec: Real-time Stream Recommendation in Practice
- Twitter Heron: Stream Processing at Scale
- Analytics in Motion - High Performance Event-Processing AND Real-Time Analytics in the Same Database
- Why Big Data Industrial Systems Need Rules and What We Can Do About It
Industry Session 2 - Applications
Tuesday, 15:40-17:20
Location: MR103
Session Chair: Jignesh Patel
- Telco Churn Prediction with Big Data
- The LDBC Social Network Benchmark: Interactive Workload
- Rethinking Data-Intensive Science Using Scalable Analytics Systems
- QMapper for Smart Grid: Migrating SQL-based Application to Hive
Industry Session 3 - Novel Systems
Wednesday, 17:10-18:50
Location: MR105
Session Chair: Theo Vassilakis
- REEF: Retainable Evaluator Execution Framework
- Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications
- Design and Implementation of the LogicBlox System
- Spark SQL: Relational Data Processing in Spark
Industry Session 4 - Performance
Thursday, 10:30-12:10
Location: MR105
Session Chair: Yanlei Diao
- Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction
- Oracle Workload Intelligence
- Purity: Building Fast, Highly-Available Enterprise Flash Storage from Commodity Components
- On Improving User Response Times in Tableau
Industry Session 5 - Usability
Thursday, 13:30-15:10
Location: MR105
Session Chair: Magda Balazinska
- Amazon Redshift and the Case for Simpler Data Warehouses
- ShareInsights - An Unified Approach to Full-stack Data Processing
SIGMOD DEMONSTRATION SESSIONS
Demo A
Wednesday, 10:30-12:10
Thursday, 15:40-17:20
Location: MR106
- CE-Storm: Confidential Elastic Processing of Data Streams
- IReS: Intelligent, Multi-Engine Resource Scheduler for Big Data Analytics Workflows
- Exploratory Keyword Search with Interactive Input
- QE3D: Interactive Visualization and Exploration of Complex, Distributed Query Plans
- DataXFormer: An Interactive Data Transformation Tool
- Quality-Driven Continuous Query Execution over Out-of-Order Data Streams
- MoDisSENSE: A Distributed Spatio-Temporal and Textual Processing Platform for Social Networking Services
- DocRicher: An Automatic Annotation System for Text Documents Using Social Media
- A Demonstration of Rubato DB: A Highly Scalable NewSQL Database System for OLTP and Big Data Applications
- G-OLA: Generalized Online Aggregation for Interactive Analysis on Big Data
Demo B
Wednesday, 15:20-17:00
Thursday, 10:30-12:10
Location: MR106
- Thrifty: Offering Parallel Database as a Service using the Shared-Process Approach
- BenchPress: Dynamic Workload Control in the OLTP-Bench Tesbed
- Demonstrating Data Near Here - Scientific Data Search
- Slider: an Efficient Incremental Reasoner
- WANalytics: Geo-Distributed Analytics for a Data Intensive World
- FTT: a System for Finding and Tracking Tourists in Public Transport Services
- SharkDB: An In-Memory Storage System for Massive Trajectory Data
- Ringo: Interactive Graph Analytics on Big-Memory Machines
- STORM: Spatio-Temporal Online Reasoning and Management of Large Spatio-Temporal Data
- PAXQuery: Parallel Analytical XML Processing
Demo C
Wednesday, 17:10-18:50
Thursday, 13:30-15:10
Location: MR106
- Graft: A Debugging Tool For Apache Giraph
- Even Metadata is Getting Big: Annotation Summarization using InsightNotes
- StoryPivot: Comparing and Contrasting Story Evolution
- The Flatter, the Better — Query Compilation Based on the Flattening Transformation
- D2WORM: A Management Infrastructure for Distributed Data-centric Workflows
- NL2CM: A Natural Language Interface to Crowd Mining
- Optimistic Recovery for Iterative Dataflows in Action
- A Secure Search Engine for the Personal Cloud
- Just can't get enough - Synthesizing Big Data
- A SQL Debugger Built from Spare Parts - Turning a SQL:1999 Database System into Its Own Debugger
CAREERS IN INDUSTRY
Careers in Industry
Tuesday 18:30 - 19:30
Location: MR106
Hashtag: #sigmod2015 #industry_career
Session Chair: Feifei Li (University of Utah)
Please click here to check out the Panelist.
Every year, the SIGMOD Conference also includes industry sessions and
industry sponsors from industry leaders on big data management,
database systems, and data management in general. Many students after
graduation will join these companies and start an exciting career in
industry. This year, the SIGMOD Conference organizes a career in
industry panel event to address various aspects of starting a career
in the data management industry. Each platinum or gold sponsor of the
conference will send a representative to participate this panel. This
year’s event consists of representatives from Facebook, Google, IBM, Oracle, SAP,
Tableau, and Twitter. This event is geared towards undergraduate
and graduate students and junior researchers, but we expect it to
attract a very broad audience because it will be both informative and
entertaining. Each panelist gives a short introduction, based on their
personal experiences and perspectives gained over the years. After the
introduction, an informal discussion ensues, where questions are
welcomed from the audience.
The panel will discuss various aspects of preparing, planning out, and
executing a successful industry career. This topic is of particular
interest to undergraduate and graduate students, but is relevant to
the entire data management community.
SIGMOD NEW RESEARCHER SYMPOSIUM
How to fail
Wednesday 13:20 - 14:50
Location: Plenary 1
Hashtag: #sigmod15 #nrs
Session Chairs: Alexandra Meliou (University of Massachusetts Amherst); Julia Stoyanovich (Drexel University)
Every year, the SIGMOD Conference includes an exciting symposium
addressing various aspects of starting a career in the data management
community. These symposia are geared towards graduate students and
junior researchers, but they attract a very broad audience because
they are both informative and entertaining. Each panelist gives a
10-minute presentation, based on their personal experiences and
perspectives gained over the years. After the presentations, an
informal discussion ensues, where questions are welcomed from the
audience.
This year's panel will address the topic: “How to fail.” The panel will discuss various aspects of overcoming and learning from failure. This topic is of particular interest to graduate students and junior researchers, but is relevant to the entire data management community. The issues discussed in the presentation may include, but are not limited to:
— Bad research ideas: How to recognize them, avoid them, and abandon them. Can you transform a bad research direction into a good one?
— Whom should you ask for advice and should you always follow it?
— How should you balance your time across different responsibilities? Which responsibilities should you say "no" to? Is it OK to drop tasks that you committed to?
— What is the proper level of multitasking? How many research directions should you pursue at a time?
— When should you give up on a student, an advisor, or a topic?
SIGMOD/PODS POSTER SESSIONS
- PODS Research Poster Session
Monday 12:10 - 13:30
Location: Level 1 Foyer Papers from PODS Sessions 1 to 4. - Undergraduate Research Poster Session
Monday 19:15 - 21:15
Location: Polly Woodside Tall Ship reception area Papers from Undergraduate Research Posters. - SIGMOD/PODS Research Poster Session 1
Tuesday 12:10 - 13:30
Location: Level 1 Foyer Papers from PODS Sessions 5 to 7 and SIGMOD Sessions 1 to 8. - SIGMOD/PODS Research Poster Session 2
Wednesday 12:10 - 13:30
Location: Level 1 Foyer Papers from PODS Sessions 8 to 10 and SIGMOD Sessions 9 to 16. - SIGMOD/PODS Research Poster Session 3
Thursday 12:10 - 13:30
Location: Level 1 Foyer Papers from SIGMOD Sessions 17 to 24.