DSAA 2017 Program


October 18
room A
room B
room C
room D
19:00 – 21:00 Reception
October 19
room A
room B
room C
room D
8:30 – 9:00 Opening
(Room A [Topaz15])
9:00 – 10:00 Keynote: Michael I. Jordan
On Computational Thinking, Inferential Thinking and Data Science
Chair: Tomoyuki Higuchi
(Room A [Topaz15])
10:00 – 10:30 Coffee Break
(Foyers of Rooms A [Topaz15] & B [Silver12])
Exhibition (Room B [Silver12])
10:40 – 12:20 (Research Track)
Classification and Regression
(Application Track)
Image and Behavior Modeling
(Special Session)
Evolving Networks (EvoNets)
Tutorial 1
Collecting Data with Serverless Applications
12:20 – 14:00 Lunch
Exhibition (Room B [Silver12])
14:00 – 14:40 Exhibition
(Room B [Silver12])
14:40 – 16:20 Invited Industry Talk 1 (Application Track)
Network Analysis and Topic Modeling
(Special Session)
Big Data and Disaster Management (BDDM) / Advanced Informatic Measurement using Statistics, Machine Learning and Pattern Recognition (AimSMLPR)
Tutorial 1
Collecting Data with Serverless Applications
16:20 – 16:50 Coffee Break
(Foyers of Rooms A [Topaz15] & B [Silver12])
Exhibition (Room B [Silver12])
17:00 – 18:50 (Research Track)
Time Series Modeling and Forecast
(Research Track)
Search and Sequence Modeling
(Special Session)
Environmental and Geo-spatial Data Analytics (EnGeoData 2017) (1)
(Research Track)
Statistical Approaches
October 20
room A
room B
room C
room D
8:30 – 9:30 Keynote: Hiroaki Kitano
Nobel Turing Challenge: Grand Challenge of AI, Robotics, and Systems Biology
Chair: Hiroshi Motoda
(Room A [Topaz15])
9:30 – 10:00 Coffee Break
(Foyers of Rooms A [Topaz15] & B [Silver12])
Exhibition (Room B [Silver12])
10:10 – 11:50 (Research Track)
Network Service (1)
(Application Track)
New Applications (1)
(Special Session)
Environmental and Geo-spatial Data Analytics (EnGeoData 2017) (2)
Tutorial 2
Mining Attributed Networks
11:50 – 13:20 Lunch
Exhibition (Room B [Silver12])
13:20 – 13:50 Exhibition
(Room B [Silver12])
14:00 – 15:40 (Research Track)
Network Service (2)
(Application Track)
New Applications (2)
(Special Session)
Beyond IID: Non-IID Learning (NonIIDLearning)
Tutorial 2
Mining Attributed Networks
15:40 – 16:20 Coffee Break
(Foyers of Rooms A [Topaz15] & B [Silver12])
Exhibition (Room B [Silver12])
16:20 – 18:00 Invited Industry Talk 2 (Research Track)
Graph and Network
19:00 – 21:30 Banquet
(Prince Room at Grand Prince Hotel Takanawa)
October 21
room A
room B
room C
room D
8:30 – 9:30 Keynote: Katharina J. Morik
Data Analytics for Data Science
Chair: Fosca Giannotti
(Room A [Topaz15])
9:30 – 10:00 Coffee Break
(Foyers of Rooms A [Topaz15] & B [Silver12])
Exhibition (Room B [Silver12])
10:10 – 11:50 (Research Track)
Acoustic and Video Recognition
(Application Track)
Outliers and Compression
(Special Session)
Data Science in Societal Debates (DSSD) / Game Data Science (GDS 2017)
Tutorial 3
Visually Do Statistical Shape Analysis!
11:50 – 13:20 Lunch
Exhibition (Room B [Silver12])
13:20 – 15:00 (Research Track)
Feature Exploration and Classification
(Research Track)
Estimating Dependency and Dimensions
(Special Session)
Data and Information Quality (DIQ)
Tutorial 3
Visually Do Statistical Shape Analysis!
15:00 – 16:30 Trends & Controversy: ‘Trust’, position statements
(Room A [Topaz15])
16:30 – 17:00 Coffee Break
(Foyers of Room A [Topaz15])
17:00 – 18:00 Panel: ‘Trust’, panel discussion
(Room A [Topaz15])
18:00 – 18:30 Closing
(Room A [Topaz15])


October 19     10:40 – 12:20

Classification and Regression   (Room A [Topaz15])

Chair: Min-Ling Zhang
The k-Nearest Representatives Classifier: A Distance-Based Classifier with Strong Generalization Bounds (long)
Cyrus Cousins and Eli Upfal
Cyclic Classifier Chain for Cost-Sensitive Multilabel Classification (regular)
Yi-An Lin and Hsuan-Tien Lin
Learning Low-Rank Document Embeddings with Weighted Nuclear Norm Regularization (regular)
Lukas Pfahler, Katharina Morik, Frederik Elwert, Samira Tabti, and Volkhard Krech
Learning Through Utility Optimization in Regression Tasks (regular)
Paula Branco, Luis Torgo, Rita P. Ribeiro, Eibe Frank, Bernhard Pfahringer, and Markus Michael Rau

Image and Behavior Modeling   (Room B [Silver12])

Chair: Chandrika Kamath
Animal Recognition and Identification with Deep Convolutional Neural Networks for Automated Wildlife Monitoring
Hung Nguyen, Sarah J. Maclagan, Tu Dinh Nguyen, Thin Nguyen, Paul Flemons, Kylie Andrews, Euan G. Ritchie, and Dinh Phung
Nazr-CNN: Fine-Grained Classification of UAV Imagery for Damage Assessment
Nazia Attari, Ferda Ofli, Mohammad Awad, Ji Lucas, and Sanjay Chawla
Website Navigation Behavior Analysis for Bot Detection
Rabih Haidar and Shady Elbassuoni
Inform Product Change Through Experimentation with Data-Driven Behavioral Segmentation
Zhenyu Zhao, Yan He, and Miao Chen

Evolving Networks (EvoNets)   (Room C [Momiji/Sumire/Shoubu])

Chair: Joao Gama
Scalable RFM-Enriched Representation Learning for Churn Prediction
Sandra Mitrovic, Gaurav Singh, Bart Baesens, Wilfried Lemahieu, and Jochen De Weerdt
A Comparative Study of Different Approaches for Tracking Communities in Evolving Social Networks
Ziwei He, Etienne Gael Tajeuna, Shengrui Wang, and Mohamed Bouguessa
The Initialization and Parameter Setting Problem in Tensor Decomposition-Based Link Prediction
Sofia Da Silva Fernandes, Hadi Fanaee Tork, and João Manuel Portela Da Gama

October 19     14:40 – 16:20

Invited Industry Talk 1   (Room A [Topaz15])

Chair: Yoji Kiyota
Building content quality models for News Feed – Baidu’s practice
Yanjun Ma
(detailed information)  
Four Waves of AI Business – NEC the WISE and NEXT –
Satoshi Morinaga
(detailed information)  
Data as gravity – Yahoo! JAPAN’s collaborative challenges
Akira Tajima
(detailed information)

Network Analysis and Topic Modeling   (Room B [Silver12])

Chair: Longbin Cao
Materials Science Literature-Patent Relevance Search: A Heterogeneous Network Analysis Approach
Pingjie Tang, Jed Pitera, Dmitry Zubarev, and Nitesh V. Chawla
NDlib: Studying Network Diffusion Dynamics
Giulio Rossetti, Letizia Milli, Salvatore Rinzivillo, Alina Sirbu, Dino Pedreschi, and Fosca Giannotti
Full-Text or Abstract? Examining Topic Coherence Scores Using Latent Dirichlet Allocation
Shaheen Syed and Marco Spruit
Incremental Author Name Disambiguation for Scientific Citation Data
Zhengqiao Zhao, Jason Rollins, Linge Bai, and Gail Rosen

Big Data and Disaster Management (BDDM)/ Advanced Informatic Measurement using Statistics, Machine Learning and Pattern Recognition (AimSMLPR)   (Room C [Momiji/Sumire/Shoubu])

Chair: Yusheng Ji
Supercharging Crowd Dynamics Estimation in Disasters Via Spatio-Temporal Deep Neural Network
Fang-Zhou Jiang, Lei Zhong, Kanchana Thilakarathna, Aruna Seneviratne, Kiyoshi Takano, Shigeki Yamada, and Yusheng Ji
Geo-Spatial Multimedia Sentiment Analysis in Disasters
Abdullah Alfarrarjeh, Sumeet Agrawal, Seon Ho Kim, and Cyrus Shahabi
Situational Awareness from Social Media Photographs Using Automated Image Captioning
João Monteiro, Asanobu Kitamoto, and Bruno Martins
Machine Learning Independent of Population Distributions for Measurement
Takashi Washio, Gaku Imamura, and Genki Yoshikawa

October 19     17:00 – 18:50

Time Series Modeling and Forecast   (Room A [Topaz15])

Chair: Thanh Phuong Nguyen
A Dynamic Factor Machine Learning Method for Multi-Variate and Multi-Step-Ahead Forecasting (long)
Gianluca Bontempi, Yann-Aël Le Borgne, and Jacopo De Stefani
CSAR: The Cross-Sectional Autoregression Model (long)
Claudio Hartmann, Martin Hahmann, Dirk Habich, and Wolfgang Lehner
Dynamic and Heterogeneous Ensembles for Time Series Forecasting (long)
Vitor Cerqueira, Luis Torgo, Mariana Oliveira, and Bernhard Pfahringer
Forward-Backward Smoothing for Hidden Markov Models of Point Pattern Data (regular)
Nhan Dam, Dinh Phung, Ba-Ngu Vo, and Viet Huynh

Search and Sequence Modeling   (Room B [Silver12])

Chair: Saso Dzeroski
RadiusSketch: Massively Distributed Indexing of Time Series (long)
Djamel Edine Yagoubi, Reza Akbarinia, Florent Masseglia and Dennis Shasha
BJR-Tree: Fast Skyline Computation Algorithm for Serendipitous Searching Problems (long)
Kenichi Koizumi, Peter Eades, Kei Hiraki, and Mary Inaba
A Directional Change Based Trading Strategy with Dynamic Thresholds (long)
Nora Alkhamees and Maria Fasli
Subsequence Search Considering Duration and Relations of Events in Time Interval-Based Events Sequences (regular)
Cheng-Wei Yang, Bijay Prasad Jaysawal, and Jen-Wei Huang

Environmental and Geo-spatial Data Analytics (EnGeoData 2017) (1)   (Room C [Momiji/Sumire/Shoubu])

Chair: Maguelonne Teisseire
There’s a Path for Everyone: A Data-Driven Personal Model Reproducing Mobility Agendas
Riccardo Guidotti, Roberto Trasarti, Mirco Nanni, Fosca Giannotti, and Dino Pedreschi
Heterogeneous Information Integration for Mountain Augmented Reality Mobile Apps
Darian Frajberg, Piero Fraternali, and Rocio Nahime Torres
Predictive Classification of Water Consumption Time Series Using Non-homogeneous Markov Models
Milad Leyli Abadi, Allou Samé, Latifa Oukhellou, Nicolas Cheifetz, Pierre Mandel, Cédric Féliers, and Olivier Chesneau
DP-POIRS: A Diversified and Personalized Point-of-Interest Recommendation System
Xiangfu Meng, Yanhuan Tang, and Xiaoyan Zhang

Statistical Approaches   (Room D [Rindou/Shakunage/Lavender])

Chair: Kamath Chandrika
On the Jeffreys-Lindley Paradox and the Looming Reproducibility Crisis in Machine Learning (long)
Daniel Berrar and Werner Dubitzky
M3A: Model, MetaModel, and Anomaly Detection for Inter-Arrivals of Web Searches and Postings (long)
Da-Cheng Juan, Neil Shah, Mingyu Tang, Zhiliang Qian, Diana Marculescu, and Christos Faloutsos
A Consistency-Based Multimodal Graph Embedding Method for Dimensionality Reduction (long)
Ilias Kalamaras, Anastasios Drosou, Eleftheria Polychronidou, and Dimitrios Tzovaras
Sample, Estimate, Tune: Scaling Bayesian Auto-tuning of Data Science Pipelines (regular)
Alec Anderson, Sebastien Dubois, Alfredo Cuesta-infante, and Kalyan Veeramachaneni

October 20     10:10 – 11:50

Network Service (1)   (Room A [Topaz15])

Chair: Xintao Wu
Multiple Social Role Embedding (long)
Linchuan Xu, Xiaokai Wei, Jiannong Cao, and Philip S. Yu
FeatureHub: Towards Collaborative Data Science (long)
Micah J. Smith, Roy Wedge, and Kalyan Veeramachaneni
Identifying Anomalous Nodes in Multidimensional Networks (regular)
Amani Chouchane and Mohamed Bouguessa
Discovering Community Structure in Multilayer Networks (regular)
Soumajit Pramanik, Raphael Tackx, Anchit Navelkar, Jean-Loup Guillaume, and Bivas Mitra

New Applications (1)   (Room B [Silver12])

Chair: Shin’ya Nakano
The Data and Science Behind GrabShare Carpooling
Muchen Tang, Serene Ow, Wenqing Chen, Yang Cao, Kong-Wei Lye, and YaoZhang Pan
Regression Based Model for Autosteering of a Car with Delayed Steering Response
Vsevolod Nikulin, Albert Podusenko, Ivan Tanev, and Katsunori Shimohara
Ensemble-Based Location Tracking Using Passive RFID
Hao-Ying Liang, Yun-Tung Shieh, Addicam Sanjay, Shao-Wen Yang, and Shou-De Lin
Leveraging on Predictive Analytics to Manage Clinic No Show and Improve Accessibility of Care
Guanhua Lee, Sijia Wang, Fransiscus Dipuro, Jue Hou, Priyanka Grover, Lian Leng Low, Nan Liu, and Chui Yee Loke

Environmental and Geo-spatial Data Analytics (EnGeoData 2017) (2)   (Room C [Momiji/Sumire/Shoubu])

Chair: Mathieu Roche
A Shape-Based Approach to Spatio-Temporal Data Analysis Using Satellite Imagery
Darpan Baheti and K.S Rajan
Mobility Genome™ – A Framework for Mobility Intelligence from Large-Scale Spatio-Temporal Data
The Anh Dang, Jayakumaran Deepak, Jingxuan Wang, Shixin Luo, Yunye Jin, Yibin Ng, Aloysius Lim, and Ying Li
A Peak Detection Method to Uncover Events from Social Media
Carmela Comito, Deborah Falcone, and Domenico Talia
Semantic Trajectory Modeling for Dynamic Built Environments
Christophe Cruz

October 20     14:00 – 15:40

Network Service (2)   (Room A [Topaz15])

Chair: Mohamed Bouguessa
Disentangled Link Prediction for Signed Social Networks via Disentangled Representation Learning (long)
Linchuan Xu, Xiaokai Wei, Jiannong Cao, and Philip S. Yu
Exploiting Digital DNA for the Analysis of Similarities in Twitter Behaviours (long)
Stefano Cresci, Roberto Di Pietro, Marinella Petrocchi, Angelo Spognardi, and Maurizio Tesconi
Where are You Going? Next Place Prediction from Twitter (regular)
Carmela Comito
A Study of Stochastic Mixed Membership Models for Link Prediction in Social Networks (regular)
Adrien Dulac, Eric Gaussier, and Christine Largeron

New Applications (2)   (Room B [Silver12])

Chair: Shady Elbassuoni
HiSPEED: A System for Mining Performance Appraisal Data and Text
Girish Keshav Palshikar, Manoj Apte, Sachin Pawar, and Nitin Ramrakhiyani
Identification of Signal and Noise Components in Spacecraft Neutral Particle Data Using a Bi-Level Mixture Model
Shin’Ya Nakano and Yoshifumi Futaana
A Collaborative Filtering-Based Two Stage Model with Item Dependency for Course Recommendation
Eric L. Lee, Tsung-Ting Kuo, and Shou-De Lin
Enriching Course-Specific Regression Models with Content Features for Grade Prediction
Qian Hu, Agoritsa Polyzou, George Karypis, and Huzefa Rangwala

Beyond IID: Non-IID Learning (NonIIDLearning)   (Room C [Momiji/Sumire/Shoubu])

Chair: Longbing Cao
Steganalysis Feature Subspace Selection Based on Fisher Criterion
Chunfang Yang, Yi Zhang, Ping Wang, Xiangyang Luo, Fenlin Liu, and Jicang Lu
Coupled Bayesian Matrix Factorization in Recommender Systems
Xueci Zhao, Chengzhang Zhu, and Lizhi Cheng
A Comparative Study of Performance Estimation Methods for Time Series Forecasting
Vitor Cerqueira, Luis Torgo, Jasmina Smailović, and Igor Mozetic

October 20     16:20 – 18:00

Invited Industry Talk 2   (Room A [Topaz15])

Chair: Kiyoshi Izumi
The main problems and their solutions in industrial machine learning applications
Wenyuan Dai
(detailed information)  
Textual Information based automatic Item Classification -a case of Challenging Company in Japan-
Kei Harada
(detailed information)  

Graph and Network   (Room B [Silver12])

Chair: Gianluca Bontempi
On Spectral Analysis of Directed Signed Graphs (long)
Yuemeng Li, Xintao Wu, and Aidong Lu
AnonML: Locally Private Machine Learning over a Network of Peers (long)
Bennett Cyphers and Kalyan Veeramachaneni
Maximizing Network Performance Based on Group Centrality by Creating Most Effective k-Links (regular)
Kouzou Ohara, Kazumi Saito, Masahiro Kimura, and Hiroshi Motoda
Multi-task Network Embedding (regular)
Linchuan Xu, Xiaokai Wei, Jiannong Cao, and Philip S. Yu

October 21     10:10 – 11:50

Acoustic and Video Recognition   (Room A [Topaz15])

Chair: Grant J. Scott
What Makes a Video Memorable? (long)
Akankshya Kar, Prashasthi Mavin, Yogesh Ghaturle, and Vani M.
Convolutional Neural Networks Based Multi-Task Deep Learning for Movie Review Classification (regular)
Xuanyi Li, Weimin Wu, and Hongye Su
Masked Conditional Neural Networks for Automatic Sound Events Recognition (regular)
Fady Medhat, David Chesmore, and John Robinson
A Spatial-Cue-Based Probabilistic Model for Bird Song Scene Analysis (regular)
Ryosuke Kojima, Osamu Sugiyama, Kotaro Hoshiba, Reiji Suzuki, and Kazuhiro Nakadai

Outliers and Compression   (Room B [Silver12])

Chair: Hirotaka Kaji
Learning to Compress Unstructured Mesh Data from Simulations
Chandrika Kamath
An Assessment of Streaming Active Learning Strategies for Real-Life Credit Card Fraud Detection
Fabrizio Carcillo, Yann-Aël Le Borgne, Olivier Caelen, and Gianluca Bontempi
A Probabilistic, Mechanism-Indepedent Outlier Detection Method for Online Experimentation
Yan He and Miao Chen

Data Science in Societal Debates (DSSD) / Game Data Science (GDS 2017)   (Room C [Momiji/Sumire/Shoubu])

Chair: Stefano Cresci / África Periáñez
News Consumption during the Italian Referendum: A Cross-Platform Analysis on Facebook and Twitter
Michela Del Vicario, Sabrina Gaito, Walter Quattrociocchi, Matteo Zignani, and Fabiana Zollo
Feature Analysis for Fake Review Detection through Supervised Classification
Julien Fontanarava, Gabriella Pasi, and Marco Viviani
Online k-Maxoids Clustering
Rafet Sifa and Christian Bauckhage

October 21     13:20 – 15:00

Feature Exploration and Classification   (Room A [Topaz15])

Chair: Shady Elbassuoni
Combining Instance and Feature Neighbors for Efficient Multi-label Classification (long)
Len Feremans, Boris Cule, Celine Vens, and Bart Goethals
Expert Estimates for Feature Relevance are Imperfect (regular)
Patrick M. De Boer, Marcel C. Bühler, and Abraham Bernstein
Multi-label Learning with Label-Specific Features Via Clustering Ensemble (regular)
Wang Zhan and Min-Ling Zhang
Customizing Travel Packages with Interactive Composite Items (regular)
Manish Singh, Ria Mae Borromeo, Anas Hosami, Sihem Amer-Yahia, and Shady Elbassuoni

Estimating Dependency and Dimensions   (Room B [Silver12])

Chair: Ying Li
Latent Dimensionality Estimation for Probabilistic Canonical Correlation Analysis Using Normalized Maximum Likelihood Code-Length (long)
Tomohiko Nakmaura, Tomoharu Iwata, and Kenji Yamanishi
A Novel Approach for Estimating Multiple Sparse Precision Matrices Using l0,0 Regularization (long)
Duy Nhat Phan and Hoai An Le Thi
Copula-Based High Dimensional Cross-Market Dependence Modeling (regular)
Jia Xu, Wei Wei, and Longbing Cao
Causal Patterns: Extraction of Multiple Causal Relationships by Mixture of Probabilistic Partial Canonical Correlation Analysis (regular)
Hiroki Mori, Keisuke Kawano, and Hiroki Yokoyama

Data and Information Quality (DIQ)   (Room C [Momiji/Sumire/Shoubu])

Chair: Rong Duan
SECODA: Segmentation- and Combination-Based Detection of Anomalies
Ralph Foorthuis
Extended Methods to Handle Classification Biases
Emma Beauxis-Aussalet and Lynda Hardman
Toward Optimal Streaming Feature Selection
Noura Al Nuaimi and Mohammad M. Masud