Calls For Submissions

PODS Program

SIGMOD Program

Conference Program: SIGMOD Sessions

This page describes the complete SIGMOD Conference program. Please use the following links to skip to the sessions of interest:

Keynote Talks
Panel
Research Sessions
Tutorials
Industry Sessions
Demonstrations
Career in Industry
New Researcher Symposium
SIGMOD/PODS Poster Sessions

SIGMOD KEYNOTE TALKS

Keynote Talks

Keynote 1 - From Data to Insights @ Bare Metal Speed

Jignesh Patel
Tuesday, 8:20-10:00
Location: Plenary 1
Session Chair: Susan Davidson

Keynote 2 - The Power Behind the Throne: Information Integration in the Age of Data-Driven Discovery

Laura Haas
Wednesday, 8:20-10:00
Location: Plenary 1
Session Chair: Zack Ives

ACM-W Athena Lecturer Award - Three Favorite Results

Jennifer Widom
Tuesday, 17:30-18:15
Location: Plenary 1
Session Chair: Magda Balazinska

SIGMOD PANEL

Machine Learning and Databases: The Sound of Things to Come or a Cacophony of Hype?

Tuesday, 13:30 - 15:10
Location: MR106
Panel Chair: Christopher Re (Stanford University)
Presenters: Divyakant Agrawal, Magdalena Balazinska, Michael Cafarella, Michael Jordan, Tim Kraska, and Raghu Ramakrishnan

SIGMOD RESEARCH SESSIONS

Research Session 1 - Cloud: Parallel Execution

Tuesday, 10:30-12:10
Location: MR106
Session Chair: Yanlei Diao

Distributed Outlier Detection using Compressive Sensing
Locality-aware Partitioning Design in Parallel Database Systems
ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout
Implicit Parallelism through Deep Language Embedding
From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System

Research Session 2 - Matrix and Array Computations

Tuesday, 10:30-12:10
Location: MR101&102
Session Chair: Florin Rusu

sPCA: Scalable Principal Component Analysis for Big Data
Exploiting matrix dependency for efficient distributed matrix computation
LEMP: Fast Retrieval of Large Entries in a Matrix Product
Skew-Aware Join Optimization for Array Databases
Resource Elasticity for Large-Scale Machine Learning

Research Session 3 - Security and Access Control

Tuesday, 10:30-12:10
Location: MR104
Session Chair: Lei Chen

SEMROD: Secure and Efficient MapReduce over HybriD Clouds
Authenticated Online Data Integration Services
ENKI: Access Control for Encrypted Query Processing
Collaborative Access Control in WebdamLog
Automatic Enforcement of Data Use Policies with DataLawyer

Research Session 4 - Cloud: Fault Tolerance, Reconfiguration

Tuesday, 13:30-15:10
Location: MR101&102
Session Chair: Alexandros Labrinidis

Cost-based Fault-tolerance for Parallel Data Processing
Squall: Fine-Grained Live Reconfiguration for Partitioned Main Memory Databases
Madeus: A Database Live Migration Middleware under Heavy Workloads for Cloud Environment
Lineage-driven Fault Injection

Research Session 5 - Keyword Search and Text

Tuesday, 13:30-15:10
Location: MR103
Session Chair: K. Selcuk Candan

Diversity-Aware Top-K Publish/Subscribe for Text Stream
Diverse and Proportional Size-l Object Summaries for Keyword Search
Local Filtering: Improving Performance of Approximate Queries on String Collections
Exact Top-k Nearest Keyword Search on Large Networks
Efficient Algorithms for Answering the m-Closest Keywords Query

Research Session 6 - Graph Primitives

Tuesday, 15:40-17:20
Location: MR101&102
Session Chair: Ke Yi

Minimum Spanning Trees in Temporal Graphs
Efficient Enumeration of Maximal k-Plexes
Divide & Conquer: I/O Efficient Depth-First Search
Index-based Optimal Algorithms for Computing Steiner Components with Maximum Connectivity

Research Session 7 - Data Mining

Tuesday, 15:40-17:20
Location: MR105
Session Chair: Julia Stoyanovich

COMMIT: A Scalable Approach to Mining Communication Motifs from Dynamic Networks
LASH: Large-Scale Sequence Mining with Hierarchies
Twister Tries: Approximate Hierarchical Agglomerative Clustering for Average Distance in Linear Time
DBSCAN Revisited: Mis-Claim, Un-Fixability, and Approximation
The TagAdvisor: Luring the Lurkers to Review Web Items

Research Session 8 - Uncertainty and Linking

Tuesday, 15:40-17:20
Location: MR104
Session Chair: Pierre Senellart

Supporting Data Uncertainty in Array Databases
Identifying the Extent of Completeness of Query Answers over Partially Complete Databases
k-Hit Query: Top-k Query with Probabilistic Utility Function
Linking Temporal Records for Profiling Entities

Research Session 9 - Transactional Architectures

Wednesday, 10:30-12:10
Location: MR103
Session Chair: Feifei Li

On the Design and Scalability of Distributed Shared-Data Databases
Fast Serializable Multi-Version Concurrency Control for Main-Memory Database Systems
FOEDUS: OLTP Engine for a Thousand Cores and NVRAM
Let’s Talk About Storage & Recovery Methods for Non-Volatile Memory Database Systems

Research Session 10 - Privacy

Wednesday, 10:30-12:10
Location: MR104
Session Chair: Wang-Chiew Tan

Private Release of Graph Statistics using Ladder Functions
Bayesian Differential Privacy on Correlated Data
Modular Order Preserving Encryption, Revisited
Chiaroscuro: Transparency and Privacy for Massive Personal Time Series Clustering

Research Session 11 - Streams

Wednesday, 10:30-12:10
Location: MR101&102
Session Chair: Jorge Quiané-Ruiz

Persistent Data Sketching
Scalable Distributed Stream Join Processing
SCREEN: Stream Data Cleaning under Speed Constraints
Location-Aware Pub/Sub System: When Continuous Moving Queries Meet Dynamic Event Streams

Research Session 12 - Spatial data

Wednesday, 15:20-17:00
Location: MR103
Session Chair: Cyrus Shahabi

Optimal Spatial Dominance: An effective search of Nearest Neighbor Candidates
THERMAL-JOIN: Scalable Spatial Join for Dynamic Workloads
Indexing Metric Uncertain Data for Range Queries
Efficient Route Planning on Public Transportation Networks: A Labelling Approach

Research Session 13 - Crowdsourcing

Wednesday, 15:20-17:00
Location: MR104
Session Chair: Dimitrios Tsoumakos

The Importance of Being Expert: Efficient Max-Finding in Crowdsourcing
Minimizing Efforts in Validating Crowd Answers
iCrowd: An Adaptive Crowdsourcing Framework
QASCA: A Quality-Aware Task Assignment System for Crowdsourcing Applications
tDP: An Optimal-Latency Budget Allocation strategy for Crowdsourced MAXIMUM operations

Research Session 14 - Indexing & Performance

Wednesday, 17:10-18:50
Location: MR101&102
Session Chair: Yufei Tao

Cache-Efficient Aggregation: Hashing is Sorting
Efficient Similarity Join and Search on Multi-Attribute Data
Holistic Indexing in Main-memory Column-stores
CliffGuard: A Principled Framework for Finding Robust Database Designs
Exploiting Correlations for Queries with Expensive Predicates

Research Session 15 - Data Cleaning

Wednesday, 17:10-18:50
Location: MR103
Session Chair: Mohamad Sharaf

Query-Oriented Data Cleaning with Oracles
BigDansing: A System for Big Data Cleansing
Data X-Ray: A diagnostic tool for data errors
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing
Crowd-based Deduplication: an Adaptive Approach

Research Session 16 - Transactions

Wednesday, 17:10-18:50
Location: MR104
Session Chair: Ashraf Aboulnaga

Minimizing Commit Latency of Transactions in Geo-Replicated Data Stores
Optimizing Optimistic Concurrency Control for Tree-Structured, Log-Structured Databases
The Homeostasis Protocol: Avoiding Transaction Coordination Through Program Analysis
Feral Concurrency Control: An Empirical Investigation of Modern Application Integrity

Research Session 17 - Hardware-Aware Query Processing

Thursday, 10:30-12:10
Location: MR103
Session Chair: Spyros Blanas

Rack-Scale Join Processing using RDMA
Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation
Rethinking SIMD Vectorization for In-Memory Databases
A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew

Research Session 18 - Graph Propagation, Influence, Mining

Thursday, 10:30-12:10
Location: MR101&102
Session Chair: Panagiotis Karras

GetReal: Towards Realistic Selection of Influence Maximization Strategies in Competitive Networks
Influence Maximization in Near-Linear Time: A Martingale Approach
Community Level Diffusion Extraction
BEAR: Block Elimination Approach for Random Walk with Restart on Large Graphs
The Minimum Wiener Connector Problem

Research Session 19 - Social Networks

Thursday, 10:30-12:10
Location: MR104
Session Chair: Chengfei Liu

From Group Recommendations to Group Formation
Real-Time Multi-Criteria Social Graph Partitioning: A Game Theoretic Approach
Utility-Aware Social Event-Participant Planning
Online Video Recommendation in Sharing Community

Research Session 20 - Information Extraction and Record Linking

Thursday, 13:30-15:10
Location: MR103
Session Chair: Guoliang Li

TEGRA: Table Extraction by Global Record Alignment
Mining Quality Phrases from Massive Text Corpora
Mining Subjective Properties on the Web
Microblog Entity Linking with Social Temporal Context

Research Session 21 - RDF and SPARQL

Thursday, 13:30-15:10
Location: MR101&102
Session Chair: Nikos Mamoulis

Graph-Aware, Workload-Adaptive SPARQL Query Caching
Left Bit Right: For SPARQL Join Queries with OPTIONAL Patterns (Left-outer-joins)
How to Build Templates for RDF Question/Answering --An Uncertain Graph Similarity Join Approach
RBench: Application-Specific RDF Benchmarking
ALEX: Automatic Link Exploration in Linked Data

Research Session 22 - Time Series & Graph Processing

Thursday, 13:30-15:10
Location: MR104
Session Chair: Alan Fekete

k-Shape: Efficient and Accurate Clustering of Time Series
SMiLer: A Semi-Lazy Time Series Prediction System for Sensors
SQLGraph: An Efficient Relational-Based Property Graph Store
Updating Graph Indices with a One-Pass Algorithm

Research Session 23 - Advanced Query Processing

Thursday, 15:40-17:20
Location: MR103
Session Chair: Sharad Mehrotra

An Incremental Anytime Algorithm for Multi-Objective Query Optimization
Output-sensitive Evaluation of Prioritized Skyline Queries
Learning Generalized Linear Models Over Normalized Data
Utilizing IDs to Accelerate Incremental View Maintenance

Research Session 24 - New Models

Thursday, 15:40-17:20
Location: MR104
Session Chair: Jiaheng Lu

Top-k Spreadsheet-Style Search for Query Discovery
Proactive Annotation Management in Relational Databases
Weighted Coverage based Reviewer Assignment
Distributed Online Tracking

SIGMOD TUTORIAL SESSIONS

Tutorial 1 - Overview of Data Exploration Techniques

Tuesday, 10:30-12:10
Tuesday, 13:30-15:10
Location: MR105
Tutorial 2 - Mining and Forecasting of Big Time-series Data

Wednesday, 10:30-12:10
Wednesday, 15:20-17:00
Location: MR105
Tutorial 3 - Data management in non-volatile memory

Thursday, 10:30-12:10
Thursday, 13:30-15:10
Location: Plenary 1
Tutorial 4 - Knowledge Curation and Knowledge Fusion: Challenges, Models, and Applications

Thursday, 15:40-17:20
Location: MR105

SIGMOD INDUSTRY SESSIONS

Industry Session 1 - Streaming/Real-Time/Active

Tuesday, 10:30-12:10
Location: MR103
Session Chair: Tyson Condie

TencentRec: Real-time Stream Recommendation in Practice
Twitter Heron: Stream Processing at Scale
Analytics in Motion - High Performance Event-Processing AND Real-Time Analytics in the Same Database
Why Big Data Industrial Systems Need Rules and What We Can Do About It

Industry Session 2 - Applications

Tuesday, 15:40-17:20
Location: MR103
Session Chair: Jignesh Patel

Telco Churn Prediction with Big Data
The LDBC Social Network Benchmark: Interactive Workload
Rethinking Data-Intensive Science Using Scalable Analytics Systems
QMapper for Smart Grid: Migrating SQL-based Application to Hive

Industry Session 3 - Novel Systems

Wednesday, 17:10-18:50
Location: MR105
Session Chair: Theo Vassilakis

REEF: Retainable Evaluator Execution Framework
Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications
Design and Implementation of the LogicBlox System
Spark SQL: Relational Data Processing in Spark

Industry Session 4 - Performance

Thursday, 10:30-12:10
Location: MR105
Session Chair: Yanlei Diao

Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction
Oracle Workload Intelligence
Purity: Building Fast, Highly-Available Enterprise Flash Storage from Commodity Components
On Improving User Response Times in Tableau

Industry Session 5 - Usability

Thursday, 13:30-15:10
Location: MR105
Session Chair: Magda Balazinska

Amazon Redshift and the Case for Simpler Data Warehouses
ShareInsights - An Unified Approach to Full-stack Data Processing

SIGMOD DEMONSTRATION SESSIONS

Demo A

Wednesday, 10:30-12:10
Thursday, 15:40-17:20
Location: MR106

CE-Storm: Confidential Elastic Processing of Data Streams
IReS: Intelligent, Multi-Engine Resource Scheduler for Big Data Analytics Workflows
Exploratory Keyword Search with Interactive Input
QE3D: Interactive Visualization and Exploration of Complex, Distributed Query Plans
DataXFormer: An Interactive Data Transformation Tool
Quality-Driven Continuous Query Execution over Out-of-Order Data Streams
MoDisSENSE: A Distributed Spatio-Temporal and Textual Processing Platform for Social Networking Services
DocRicher: An Automatic Annotation System for Text Documents Using Social Media
A Demonstration of Rubato DB: A Highly Scalable NewSQL Database System for OLTP and Big Data Applications
G-OLA: Generalized Online Aggregation for Interactive Analysis on Big Data

Demo B

Wednesday, 15:20-17:00
Thursday, 10:30-12:10
Location: MR106

Thrifty: Offering Parallel Database as a Service using the Shared-Process Approach
BenchPress: Dynamic Workload Control in the OLTP-Bench Tesbed
Demonstrating Data Near Here - Scientific Data Search
Slider: an Efficient Incremental Reasoner
WANalytics: Geo-Distributed Analytics for a Data Intensive World
FTT: a System for Finding and Tracking Tourists in Public Transport Services
SharkDB: An In-Memory Storage System for Massive Trajectory Data
Ringo: Interactive Graph Analytics on Big-Memory Machines
STORM: Spatio-Temporal Online Reasoning and Management of Large Spatio-Temporal Data
PAXQuery: Parallel Analytical XML Processing

Demo C

Wednesday, 17:10-18:50
Thursday, 13:30-15:10
Location: MR106

Graft: A Debugging Tool For Apache Giraph
Even Metadata is Getting Big: Annotation Summarization using InsightNotes
StoryPivot: Comparing and Contrasting Story Evolution
The Flatter, the Better — Query Compilation Based on the Flattening Transformation
D2WORM: A Management Infrastructure for Distributed Data-centric Workflows
NL2CM: A Natural Language Interface to Crowd Mining
Optimistic Recovery for Iterative Dataflows in Action
A Secure Search Engine for the Personal Cloud
Just can't get enough - Synthesizing Big Data
A SQL Debugger Built from Spare Parts - Turning a SQL:1999 Database System into Its Own Debugger

CAREERS IN INDUSTRY

Careers in Industry
Tuesday 18:30 - 19:30
Location: MR106
Hashtag: #sigmod2015 #industry_career
Session Chair: Feifei Li (University of Utah)
Please click here to check out the Panelist.

Every year, the SIGMOD Conference also includes industry sessions and industry sponsors from industry leaders on big data management, database systems, and data management in general. Many students after graduation will join these companies and start an exciting career in industry. This year, the SIGMOD Conference organizes a career in industry panel event to address various aspects of starting a career in the data management industry. Each platinum or gold sponsor of the conference will send a representative to participate this panel. This year’s event consists of representatives from Facebook, Google, IBM, Oracle, SAP, Tableau, and Twitter. This event is geared towards undergraduate and graduate students and junior researchers, but we expect it to attract a very broad audience because it will be both informative and entertaining. Each panelist gives a short introduction, based on their personal experiences and perspectives gained over the years. After the introduction, an informal discussion ensues, where questions are welcomed from the audience.
The panel will discuss various aspects of preparing, planning out, and executing a successful industry career. This topic is of particular interest to undergraduate and graduate students, but is relevant to the entire data management community.

SIGMOD NEW RESEARCHER SYMPOSIUM

How to fail
Wednesday 13:20 - 14:50
Location: Plenary 1
Hashtag: #sigmod15 #nrs
Session Chairs: Alexandra Meliou (University of Massachusetts Amherst); Julia Stoyanovich (Drexel University)

Every year, the SIGMOD Conference includes an exciting symposium addressing various aspects of starting a career in the data management community. These symposia are geared towards graduate students and junior researchers, but they attract a very broad audience because they are both informative and entertaining. Each panelist gives a 10-minute presentation, based on their personal experiences and perspectives gained over the years. After the presentations, an informal discussion ensues, where questions are welcomed from the audience.
This year's panel will address the topic: “How to fail.” The panel will discuss various aspects of overcoming and learning from failure. This topic is of particular interest to graduate students and junior researchers, but is relevant to the entire data management community. The issues discussed in the presentation may include, but are not limited to:
— Bad research ideas: How to recognize them, avoid them, and abandon them. Can you transform a bad research direction into a good one?
— Whom should you ask for advice and should you always follow it?
— How should you balance your time across different responsibilities? Which responsibilities should you say "no" to? Is it OK to drop tasks that you committed to?
— What is the proper level of multitasking? How many research directions should you pursue at a time?
— When should you give up on a student, an advisor, or a topic?

SIGMOD/PODS POSTER SESSIONS

PODS Research Poster Session
Monday 12:10 - 13:30
Location: Level 1 Foyer
Papers from PODS Sessions 1 to 4.
Undergraduate Research Poster Session
Monday 19:15 - 21:15
Location: Polly Woodside Tall Ship reception area
Papers from Undergraduate Research Posters.
SIGMOD/PODS Research Poster Session 1
Tuesday 12:10 - 13:30
Location: Level 1 Foyer
Papers from PODS Sessions 5 to 7 and SIGMOD Sessions 1 to 8.
SIGMOD/PODS Research Poster Session 2
Wednesday 12:10 - 13:30
Location: Level 1 Foyer
Papers from PODS Sessions 8 to 10 and SIGMOD Sessions 9 to 16.
SIGMOD/PODS Research Poster Session 3
Thursday 12:10 - 13:30
Location: Level 1 Foyer
Papers from SIGMOD Sessions 17 to 24.

Welcome

Organization

Links