Abstract is missing.
- Cognitive computing: From breakthroughs in the lab to applications on the fieldGuruduth S. Banavar. 1 [doi]
- Harnessing the data revolution: A perspective from the national science foundationChaitanya K. Baru. 2 [doi]
- Big data security and privacyElisa Bertino. 3 [doi]
- On the power of big data: Mining structures from massive, unstructured text dataJiawei Han. 4 [doi]
- Leveraging high performance computing to drive advanced manufacturing R&D at the US department of energyMark Johnson. 5-6 [doi]
- Database decay and how to avoid itMichael Stonebraker, Dong Deng, Michael L. Brodie. 7-16 [doi]
- Cache-oblivious loops based on a novel space-filling curveChristian Böhm, Martin Perdacher, Claudia Plant. 17-26 [doi]
- DD-Rtree: A dynamic distributed data structure for efficient data distribution among cluster nodes for spatial data mining algorithmsJagat Sesh Challa, Poonam Goyal, S. Nikhil, Aditya Mangla, Sundar Balasubramaniam, Navneet Goyal. 27-36 [doi]
- A meta-graph approach to analyze subgraph-centric distributed programming modelsRavikant Dindokar, Neel Choudhury, Yogesh Simmhan. 37-47 [doi]
- Exact structure learning of Bayesian networks by optimal path extensionSubhadeep Karan, Jaroslaw Zola. 48-55 [doi]
- Datalography: Scaling datalog graph analytics on graph processing systemsWalaa Eldin Moustafa, Vicky Papavasileiou, Ken Yocum, Alin Deutsch. 56-65 [doi]
- Predicting statistics of asynchronous SGD parameters for a large-scale distributed deep learning system on GPU supercomputersYosuke Oyama, Akihiro Nomura, Ikuro Sato, Hiroki Nishimura, Yukimasa Tamatsu, Satoshi Matsuoka. 66-75 [doi]
- Consensus optimization with delayed and stochastic gradients on decentralized networksBenjamin Sirb, Xiaojing Ye. 76-85 [doi]
- Pairwise topic model and its application to topic transition and evolutionXiaoli Song, Yan Rui, Xiaohua Hu. 86-95 [doi]
- Interpretable and effective opinion spam detection via temporal patterns mining across websitesYuan Yuan, Sihong Xie, Chun-Ta Lu, Jie Tang, Philip S. Yu. 96-105 [doi]
- A fast structured regression for large networksFang Zhou, Mohamed F. Ghalwash, Zoran Obradovic. 106-115 [doi]
- Antecedents of big data quality: An empirical examination in financial service organizationsAdiska Fardani Haryadi, Joris Hulstijn, Agung Wahyudi, Haiko Van Der Voort, Marijn Janssen. 116-121 [doi]
- PSH: A probabilistic signature hash method with hash neighborhood candidate generation for fast edit-distance string comparison on big dataJoseph Jupin, Justin Y. Shi, Eduard C. Dragut. 122-127 [doi]
- Efficient multiple scale kernel classifiersRocco Langone, Johan A. K. Suykens. 128-133 [doi]
- A theoretical model for n-gram distribution in big data corporaJoaquim F. Silva, Carlos Gonçalves, José C. Cunha. 134-141 [doi]
- The self-avoiding walk-jump (SAWJ) algorithm for finding maximum degree nodes in large graphsJonathan Stokes, Steven Weber. 142-149 [doi]
- Semantic pattern mining for text miningXiaoli Song, Xiaotong Wang, Xiaohua Hu. 150-155 [doi]
- Detecting gradual changes from data stream using MDL-change statisticsKenji Yamanishi, Kohei Miyaguchi. 156-163 [doi]
- Exploiting temporal divergence of topic distributions for event detectionRongda Zhu, Aston Zhang, Jian Peng 0001, ChengXiang Zhai. 164-171 [doi]
- Thrill: High-performance algorithmic distributed batch data processing with C++Timo Bingmann, Michael Axtmann, Emanuel Jöbstl, Sebastian Lamm, Huyen Chau Nguyen, Alexander Noe, Sebastian Schlag, Matthias Stumpp, Tobias Sturm, Peter Sanders 0001. 172-183 [doi]
- Towards resource-efficient cloud systems: Avoiding over-provisioning in demand-prediction based resource provisioningLiuhua Chen, Haiying Shen. 184-193 [doi]
- Mix 'n' match multi-engine analyticsKaterina Doka, Nikolaos Papailiou, Victor Giannakouris, Dimitrios Tsoumakos, Nectarios Koziris. 194-203 [doi]
- Matrix factorizations at scale: A comparison of scientific data analytics in spark and C+MPI using three case studiesAlex Gittens, Aditya Devarakonda, Evan Racah, Michael T. Ringenburg, Lisa Gerhardt, Jey Kottalam, Jialin Liu, Kristyn J. Maschhoff, Shane Canon, Jatin Chhugani, Pramod Sharma, Jiyan Yang, James Demmel, Jim Harrell, Venkat Krishnamurthy, Michael W. Mahoney, Prabhat. 204-213 [doi]
- YinMem: A distributed parallel indexed in-memory computation system for large scale data analyticsYin Huang, Yelena Yesha, Milton Halem, Yaacov Yesha, Shujia Zhou. 214-222 [doi]
- Efficient data access strategies for Hadoop and Spark on HPC cluster with heterogeneous storageNusrat Sharmin Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, Dhabaleswar K. Panda. 223-232 [doi]
- Comparing application performance on HPC-based Hadoop platforms with local storage and dedicated storageZhuozhao Li, Haiying Shen, Jeffrey Denton, Walter Ligon. 233-242 [doi]
- CCRP: Customized cooperative resource provisioning for high resource utilization in cloudsJinwei Liu, Haiying Shen, Husnu S. Narman. 243-252 [doi]
- High-performance design of apache spark with RDMA and its benefits on various workloadsXiaoyi Lu, Dipti Shankar, Shashank Gugnani, Dhabaleswar K. Panda. 253-262 [doi]
- A low-load stream processing scheme for IoT environmentsTomoki Yoshihisa, Takahiro Hara. 263-272 [doi]
- Spark-GPU: An accelerated in-memory data processing engine on clustersYuan Yuan, Meisam Fathi Salmi, Yin Huai, Kaibo Wang, Rubao Lee, Xiaodong Zhang. 273-283 [doi]
- Argo: Architecture-aware graph partitioningAngen Zheng, Alexandros Labrinidis, Panos K. Chrysanthis, Jack Lange. 284-293 [doi]
- Adapting to data sparsity for efficient parallel PARAFAC tensor decomposition in HadoopKareem S. Aggour, Bülent Yener. 294-301 [doi]
- Cloud Kotta: Enabling secure and scalable data analytics in the cloudYadu N. Babuji, Kyle Chard, Aaron Gerow, Eamon Duede. 302-310 [doi]
- Entity resolution acceleration using the automata processorChunkun Bo, Ke Wang, Jeffrey J. Fox, Kevin Skadron. 311-318 [doi]
- I'll take that to go: Big data bags and minimal identifiers for exchange of large, complex datasetsKyle Chard, Mike D'Arcy, Benjamin D. Heavner, Ian T. Foster, Carl Kesselman, Ravi K. Madduri, Alexis Rodriguez, Stian Soiland-Reyes, Carole A. Goble, Kristi Clark, Eric W. Deutsch, Ivo D. Dinov, Nathan D. Price, Arthur W. Toga. 319-328 [doi]
- Massive parallelism for non-linear and non-stationary data analysis with GPGPUChun-Chieh Chen, Chih-Ya Shen, Ming-Syan Chen. 329-334 [doi]
- Big data framework interference in restricted private cloud settingsStratos Dimopoulos, Chandra Krintz, Rich Wolski. 335-340 [doi]
- Evaluating the impact of data placement to spark and SciDB with an Earth Science use caseKhoa Doan, Amidu O. Oloso, Kwo-Sen Kuo, Thomas L. Clune, Hongfeng Yu, Brian Nelson, Jian Zhang. 341-346 [doi]
- Java thread and process performance for parallel machine learning on multicore HPC clustersSaliya Ekanayake, Supun Kamburugamuve, Pulasthi Wickramasinghe, Geoffrey C. Fox. 347-354 [doi]
- Power efficient big data analytics algorithms through low-level operationsGheorghi Guzun, Josiah C. McClurg, Guadalupe Canahuate, Raghuraman Mudumbai. 355-361 [doi]
- Evaluating the impacts of code-level performance tunings on power efficiencySatoshi Imamura, Keitaro Oka, Yuichiro Yasui, Yuichi Inadomi, Katsuki Fujisawa, Toshio Endo, Koji Ueno, Keiichiro Fukazawa, Nozomi Hata, Yuta Kakibuka, Koji Inoue, Takatsugu Ono. 362-369 [doi]
- RADU: Bridging the divide between data and infrastructure management to support data-driven collaborationsFan Jiang, Claris Castillo, Charles Schmitt. 370-377 [doi]
- A comparison of general-purpose distributed systems for data processingJinfeng Li, James Cheng, Yunjian Zhao, Fan Yang, Yuzhen Huang, Haipeng Chen, Ruihao Zhao. 378-383 [doi]
- A popularity-aware cost-effective replication scheme for high data durability in cloud storageJinwei Liu, Haiying Shen. 384-389 [doi]
- Managing hot metadata for scientific workflows on multisite cloudsLuis Pineda-Morales, Ji Liu 0003, Alexandru Costan, Esther Pacitti, Gabriel Antoniu, Patrick Valduriez, Marta Mattoso. 390-397 [doi]
- I/O chunking and latency hiding approach for out-of-core sorting acceleration using GPU and flash NVMHitoshi Sato, Ryo Mizote, Satoshi Matsuoka, Hirotaka Ogawa. 398-403 [doi]
- Boldio: A hybrid and resilient burst-buffer over lustre for accelerating big data I/ODipti Shankar, Xiaoyi Lu, Dhabaleswar K. Panda. 404-409 [doi]
- Real time processing of streaming and static informationChristoforos Svingos, Theofilos Mailis, Herald Kllapi, Lefteris Stamatogiannakis, Yannis Kotidis, Yannis E. Ioannidis. 410-415 [doi]
- HPTA: High-performance text analyticsHans Vandierendonck, Karen L. Murphy, Mahwish Arif, Dimitrios S. Nikolopoulos. 416-423 [doi]
- Performance evaluation of big data frameworks for large-scale data analyticsJorge Veiga, Roberto R. Expósito, Xoan C. Pardo, Guillermo L. Taboada, Juan Tourifio. 424-431 [doi]
- SLA-based profit optimization for resource management of big data analytics-as-a-service platforms in cloud computing environmentsYali Zhao, Rodrigo N. Calheiros, James Bailey, Richard O. Sinnott. 432-441 [doi]
- Materialized view selection in feed following systemsKaiji Chen, Yongluan Zhou. 442-451 [doi]
- MuSQLE: Distributed SQL query execution over multiple engine environmentsVictor Giannakouris, Nikolaos Papailiou, Dimitrios Tsoumakos, Nectarios Koziris. 452-461 [doi]
- Sampling-based distributed Kernel mean matching using sparkAhsanul Haque, Zhuoyi Wang, Swarup Chandra, Yupeng Gao, Latifur Khan, Charu Aggarwal. 462-471 [doi]
- Clockwise compression for trajectory data under road network constraintsYudian Ji, Yuda Zang, Wuman Luo, Xibo Zhou, Ye Ding, Lionel M. Ni. 472-481 [doi]
- Semantic approach to automating management of big data privacy policiesKaruna P. Joshi, Aditi Gupta 0003, Sudip Mittal, Claudia Pearce, Anupam Joshi, Tim Finin. 482-491 [doi]
- Handling uncertainty in trajectories of moving objects in unconstrained outdoor spacesEleazar Leal, Le Gruenwald, Jianting Zhang. 492-501 [doi]
- Accelerating range queries for large-scale unstructured meshesCuong Nguyen, Philip J. Rhodes. 502-511 [doi]
- In pursuit of outliers in multi-dimensional data streamsMd. Shiblee Sadik, Le Gruenwald, Eleazar Leal. 512-521 [doi]
- WISDOM: Weighted incremental spatio-temporal multi-task learning via tensor decompositionJianpeng Xu, Jiayu Zhou, Pang-Ning Tan, Xi Liu, Lifeng Luo. 522-531 [doi]
- Advantage of integration in big data: Feature generation in multi-relational databases for imbalanced learningFarrukh Ahmed, Michele Samorani, Colin Bellinger, Osmar R. Zaïane. 532-539 [doi]
- Sampling labelled profile data for identity resolutionMatthew John Edwards, Stephen Wattam, Paul Rayson, Awais Rashid. 540-547 [doi]
- Pick your choice in HBase: Security or performanceFrank Pallas, Johannes Gunther, David Bermbach. 548-554 [doi]
- BDTUne: Hierarchical correlation-based performance analysis and rule-based diagnosis for big data systemsRui Ren, Zhen Jia, Lei Wang, Jianfeng Zhan, Tianxu Yi. 555-562 [doi]
- Transfer learning algorithms for autonomous reconfiguration of wearable systemsRamyar Saeedi, Hassan Ghasemzadeh, Assefaw H. Gebremedhin. 563-569 [doi]
- Efficient processing of top-k joins in MapReduceMei Saouk, Christos Doulkeridis, Akrivi Vlachou, Kjetil Nørvåg. 570-577 [doi]
- Object identification with Pay-As-You-Go crowdsourcingTing Wu, Chen Jason Zhang, Lei Chen 0002, Pan Hui, Siyuan Liu. 578-585 [doi]
- Estimation of local subgraph countsNesreen K. Ahmed, Theodore L. Willke, Ryan A. Rossi. 586-595 [doi]
- Multi-step threshold algorithm for efficient feature-based query processing in large-scale multimedia databasesChristian Beecks, Alexander Gras. 596-605 [doi]
- PRIIME: A generic framework for interactive personalized interesting pattern discoveryMansurul Alam Bhuiyan, Mohammad Al Hasan. 606-615 [doi]
- Labeling actors in multi-view social networks by integrating information from within and across multiple viewsNgot Bui, Thanh Le, Vasant Honavar. 616-625 [doi]
- Online social network evolution: Revisiting the Twitter graphHariton Efstathiades, Demetris Antoniades, George Pallis, Marios D. Dikaiakos, Zoltán Szlávik, Robert-Jan Sips. 626-635 [doi]
- Parallel top-k subgraph query in massive graphs: Computing from the perspective of single vertexJianliang Gao, Bo Song, Ping Liu, Weimao Ke, Jianxin Wang, Xiaohua Hu. 636-645 [doi]
- REQUEST: A scalable framework for interactive construction of exploratory queriesXiaoyu Ge, Yanbing Xue, Zhipeng Luo, Mohamed A. Sharaf, Panos K. Chrysanthis. 646-655 [doi]
- Dynamic feature generation and selection on heterogeneous graph for music recommendationChun Guo 0001, Xiaozhong Liu. 656-665 [doi]
- An adaptive information-theoretic approach for identifying temporal correlations in big data setsNguyen Ho, Huy Vo, Mai Vu. 666-675 [doi]
- Towards unsupervised home location inference from online social mediaChao Huang, Dong Wang, Shenglong Zhu, Daniel Yue Zhang. 676-685 [doi]
- Improved methods for static index pruningWei Jiang, Juan Rodriguez, Torsten Suel. 686-695 [doi]
- Parallel computation of k-nearest neighbor joins using MapReduceWooyeol Kim, Younghoon Kim, Kyuseok Shim. 696-705 [doi]
- Harnessing relationships for domain-specific subgraph extraction: A recommendation use caseSarasi Lalithsena, Pavan Kapanipathi, Amit P. Sheth. 706-715 [doi]
- Scalable link community detection: A local dispersion-aware approachAlex Delis, Alexandros Ntoulas, Panagiotis Liakos. 716-725 [doi]
- Outlier detection via sampling ensembleHongfu Liu, Yuchao Zhang, Bo Deng, Yun Fu. 726-735 [doi]
- Random surfing on multipartite graphsAthanasios N. Nikolakopoulos, Antonia Korba, John D. Garofalakis. 736-745 [doi]
- An active learning method for data streams with concept driftCheong Hee Park, Youngsoon Kang. 746-752 [doi]
- Adaptive neuron apoptosis for accelerating deep learning on large scale systemsCharles Siegel, Jeff Daily, Abhinav Vishnu. 753-762 [doi]
- DeltaSherlock: Identifying changes in the cloudAta Turk, Hao Chen, Anthony Byrne, John Knollmeyer, Sastry S. Duri, Canturk Isci, Ayse Kivilcim Coskun. 763-772 [doi]
- Community detection with partially observable links and node attributesXiaokai Wei, Bokai Cao, Weixiang Shao, Chun-Ta Lu, Philip S. Yu. 773-782 [doi]
- Parallel gathering discovery over big trajectory dataYongyi Xian, Yan Liu, Chuanfei Xu. 783-792 [doi]
- CER: Complementary entity recognition via knowledge expansion on large unlabeled product reviewsHu Xu, Sihong Xie, Lei Shu, Philip S. Yu. 793-802 [doi]
- HEER: Heterogeneous graph embedding for emerging relation detection from newsJingyuan Zhang, Chun-Ta Lu, Mianwei Zhou, Sihong Xie, Yi Chang, Philip S. Yu. 803-812 [doi]
- Efficient triangle listing for billion-scale graphsHao Zhang, Yuanyuan Zhu, Lu Qin, Hong Cheng, Jeffrey Xu Yu. 813-822 [doi]
- Towards understanding word embeddings: Automatically explaining similarity of termsYating Zhang, Adam Jatowt, Katsumi Tanaka. 823-832 [doi]
- Predicting taxi demand at high spatial resolution: Approaching the limit of predictabilityKai Zhao, Denis Khryashchev, Juliana Freire, Cláudio T. Silva, Huy T. Vo. 833-842 [doi]
- TelcoFlow: Visual exploration of collective behaviors based on telco dataYixian Zheng, Wenchao Wu, Haipeng Zeng, Nan Cao, Huamin Qu, Mingxuan Yuan, Jia Zeng, Lionel M. Ni. 843-852 [doi]
- Distributed and parallel high utility sequential pattern miningMorteza ZiHayat, Zane Zhenhua Hu, Aijun An, Yonggang Hut. 853-862 [doi]
- Improving efficiency of maximizing spread in the flow authority model for large sparse networksPhilip K. Chan, Ebad Ahmadzadeh. 863-868 [doi]
- Semi-supervised Dirichlet-Hawkes process with applications of topic detection and tracking in TwitterWanying Ding, Yue Zhang, Chaomei Chen, Xiaohua Hu. 869-874 [doi]
- Effective and efficient graph augmentation in large graphsIoanna Filippidou, Yannis Kotidis. 875-880 [doi]
- Fast nearest neighbor search through sparse random projections and votingVille Hyvönen, Teemu Pitkänen, Sotiris K. Tasoulis, Elias Jaasaari, Risto Tuomainen, Liang Wang, Jukka Corander, Teemu Roos. 881-888 [doi]
- Summarizing big graphs by means of pseudo-boolean constraintsSaïd Jabbour, Nizar Mhadhbi, Abdesattar Mhadhbi, Badran Raddaoui, Lakhdar Sais. 889-894 [doi]
- Big data on a few pixelsUwe Jugel, Zbigniew Jerzak, Volker Markl. 895-900 [doi]
- Shape matching using skeleton context for automated bow echo detectionMohammad Mahdi Kamani, Farshid Farhat, Stephen Wistar, James Z. Wang. 901-908 [doi]
- Scalability analysis of distributed search in large peer-to-peer networksWeimao Ke, Javed Mostafa. 909-914 [doi]
- VHT: Vertical hoeffding treeNicolas Kourtellis, Gianmarco De Francisci Morales, Albert Bifet, Arinto Murdopo. 915-922 [doi]
- Compressed learning for time series classificationYuh-Jye Lee, Hsing-Kuo Pao, Shueh-Han Shih, Jing-Yao Lin, Xin-Rong Chen. 923-930 [doi]
- Connection discovery using shared images by Gaussian relational topic modelXiaopeng Li, Ming Cheung, James She. 931-936 [doi]
- Inferring restaurant styles by mining crowd sourced photos from user-review websitesHaofu Liao, Yucheng Li, Tianran Hu, Jiebo Luo. 937-944 [doi]
- Multiple submodels parallel support vector machine on sparkChang Liu, Bin Wu, Yi Yang, Zhihong Guo. 945-950 [doi]
- What makes a group fail: Modeling social group behavior in event-based social networksXiang Liu, Torsten Suel. 951-956 [doi]
- Efficient large scale near-duplicate video detection base on sparkJinna Lv, Bin Wu, Shuai Yang, Bingjing Jia, Peigang Qiu. 957-962 [doi]
- Context-aware point of interest recommendation using tensor factorizationStathis Maroulis, Ioannis Boutsis, Vana Kalogeraki. 963-968 [doi]
- Persistent cascades: Measuring fundamental communication structure in social networksSteven Morse, Marta C. Gonzalez, Natasha Markuzon. 969-975 [doi]
- TruthCore: Non-parametric estimation of truth from a collection of authoritative sourcesTathagata Mukherjee, Biswas Parajuli, Piyush Kumar, Eduardo Pasiliao. 976-983 [doi]
- Efficient index updates for mixed update and query loadsSergey Nepomnyachiy, Torsten Suel. 984-991 [doi]
- Compartmentalized adaptive topic mining on social media streamsGopi Chand Nutakki, Olfa Nasraoui. 992-997 [doi]
- Computing triangle and open-wedge heavy-hitters in large networksAduri Pavan, Paul Quint, Stephen D. Scott, N. V. Vinodchandran, J. Smith. 998-1005 [doi]
- Addressing the big-earth-data variety challenge with the hierarchical triangular meshMichael L. Rilee, Kwo-Sen Kuo, Thomas L. Clune, Amidu Oloso, Paul G. Brown, Hongfeng Yu. 1006-1011 [doi]
- Online multi-view clustering with incomplete viewsWeixiang Shao, LiFang He, Chun-Ta Lu, Philip S. Yu. 1012-1017 [doi]
- Expenditure aware rating prediction for recommendationChuan Shi, Bowei He, Menghao Zhang, Fuzhen Zhuang, Philip S. Yu, Naiwang Guo. 1018-1025 [doi]
- Kernels for scalable data analysis in science: Towards an architecture-portable futureSreenivas R. Sukumar, Ramakrishnan Kannan, Seung-Hwan Lim, Michael A. Matheson. 1026-1031 [doi]
- Scalable dynamic graph summarizationIoanna Tsalouchidou, Gianmarco De Francisci Morales, Francesco Bonchi, Ricardo A. Baeza-Yates. 1032-1039 [doi]
- Extreme scale breadth-first search on supercomputersKoji Ueno, Toyotaro Suzumura, Naoya Maruyama, Katsuki Fujisawa, Satoshi Matsuoka. 1040-1047 [doi]
- Three-hop distance estimation in social graphsPascal Welke, Alexander Markowetz, Torsten Suel, Maria Christoforaki. 1048-1055 [doi]
- Incremental learning for matrix factorization in recommender systemsTong Yu, Ole J. Mengshoel, Alvin Jude, Eugen Feller, Julien Forgeat, Nimish Radia. 1056-1063 [doi]
- Parallel clustering method for non-disjoint partitioning of large-scale data based on spark frameworkAbir Zayani, Chiheb-Eddine Ben N'cir, Nadia Essoussi. 1064-1069 [doi]
- Point of interest recommendation with social and geographical influenceDa-Chuan Zhang, Mei Li, Chang-Dong Wang. 1070-1075 [doi]
- On robust truth discovery in sparse social media sensingDaniel Yue Zhang, Rungang Han, Dong Wang, Chao Huang. 1076-1081 [doi]
- On the feasibility of an embedded machine learning processor for intrusion detectionRajesh Sankaran, Ricardo A. Calix. 1082-1089 [doi]
- Android malware development on public malware scanning platforms: A large-scale data-driven studyHeqing Huang, Cong Zheng, Junyuan Zeng, Wu Zhou, Sencun Zhu, Peng Liu 0005, Suresh Chari, Ce Zhang. 1090-1099 [doi]
- Improving the utility in differential private histogram publishing: Theoretical study and practiceHui Li, JiangTao Cui, Xiaobin Lin, Jianfeng Ma. 1100-1109 [doi]
- DistSD: Distance-based social discovery with personalized posterior screeningXiao Pan, Jiawei Zhang, Fengjiao Wang, Philip S. Yu. 1110-1119 [doi]
- H2O: A hybrid and hierarchical outlier detection method for large scale data protectionQuan Zhang, Mu Qiao, Ramani R. Routray, Weisong Shi. 1120-1129 [doi]
- Scalable attack propagation model and algorithms for honeypot systemsAriel Bar, Bracha Shapira, Lior Rokach, Moshe Unger. 1130-1135 [doi]
- Local subspace-based outlier detection using global neighbourhoodsBas van Stein, Matthijs van Leeuwen, Thomas Bäck. 1136-1142 [doi]
- Protecting the location privacy of mobile social media usersShuo Wang, Richard O. Sinnott, Surya Nepal. 1143-1150 [doi]
- Enabling factor analysis on thousand-subject neuroimaging datasetsMichael J. Anderson, Mihai Capota, Javier S. Turek, Xia Zhu, Theodore L. Willke, Yida Wang, Po-Hsuan Chen, Jeremy R. Manning, Peter J. Ramadge, Kenneth A. Norman. 1151-1160 [doi]
- Shooting a moving target: Motion-prediction-based transmission for 360-degree videosYanan Bao, Huasen Wu, Tianxiao Zhang, Albara Ah Ramli, Xin Liu. 1161-1170 [doi]
- Lazer: Distributed memory-efficient assembly of large-scale genomesSayan Goswami, Arghya Kusum Das, Richard Platania, Kisung Lee, Seung-Jong Park. 1171-1181 [doi]
- Leveraging multi-granularity energy data for accurate energy demand forecast in smart gridsZhichuan Huang, Ting Zhu. 1182-1191 [doi]
- Learning large-scale plantation mapping from imperfect annotatorsXiaowei Jia, Ankush Khandelwal, James Gerber, Kimberly Carlson, Paul West, Vipin Kumar. 1192-1201 [doi]
- Ad allocation with secondary metricsDarja Krushevskaja, William Simpson, S. Muthukrishnan. 1202-1211 [doi]
- Embedding feature selection for large-scale hierarchical classificationAzad Naik, Huzefa Rangwala. 1212-1221 [doi]
- Network analysis for identifying and characterizing disease outbreak influence from voluminous epidemiology dataNaman Shah, Harshil Shah, Matthew Malensek, Sangmi Lee Pallickara, Shrideep Pallickara. 1222-1231 [doi]
- Scalable genomics: From raw data to aligned reads on Apache YARNFrancesco Versaci, Luca Pireddu, Gianluigi Zanetti. 1232-1241 [doi]
- Real-time full correlation matrix analysis of fMRI dataYida Wang, Bryn Keller, Mihai Capota, Michael J. Anderson, Narayanan Sundaram, Jonathan D. Cohen, Kai Li, Nicholas B. Turk-Browne, Theodore L. Willke. 1242-1251 [doi]
- When remote sensing data meet ubiquitous urban data: Fine-grained air quality inferenceYanan Xu, Yanmin Zhu. 1252-1261 [doi]
- Buyer targeting optimization: A unified customer segmentation perspectiveJingyuan Yang, Chuanren Liu, Mingfei Teng, March Liao, Hui Xiong. 1262-1271 [doi]
- Using machine learning to identify major shifts in human gut microbiome protein family abundance in diseaseMehrdad Yazdani, Bryn C. Taylor, Justine W. Debelius, Weizhong Li, Rob Knight, Larry Smarr. 1272-1280 [doi]
- Online inference for time-varying temporal dependency discovery from time seriesChunqiu Zeng, Qing Wang, Wentao Wang, Tao Li, Larisa Shwartz. 1281-1290 [doi]
- Automated IT system failure prediction: A deep learning approachKe Zhang, Jianwu Xu, Martin Renqiang Min, Guofei Jiang, Konstantinos Pelechrinis, Hui Zhang. 1291-1300 [doi]
- Estimating human interactions with electrical appliances for activity-based energy savings recommendationsHông-Ân Cao, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes. 1301-1308 [doi]
- Scalable nearest neighbor based hierarchical change detection framework for crop monitoringZexi Chen, Ranga Raju Vatsavai, Bharathkumar Ramachandra, Qiang Zhang, Nagendra Singh, Sreenivas Sukumar. 1309-1314 [doi]
- Optimizing callout in unified ad marketsAman Gupta, S. Muthukrishnan, Smita Wadhwa. 1315-1321 [doi]
- Application-driven sensing data reconstruction and selection based on correlation mining and dynamic feedbackZhichuan Huang, Tiantian Xie, Ting Zhu, Jianwu Wang, Qingquan Zhang. 1322-1327 [doi]
- Identifying dynamic changes with noisy labels in spatial-temporal data: A study on large-scale water monitoring applicationXiaowei Jia, Xi C. Chen, Anuj Karpatne, Vipin Kumar. 1328-1333 [doi]
- A strategic approach for visualizing the value of big data (SAVV-BIGD) frameworkMike Lakoju, Alan Serrano. 1334-1339 [doi]
- A scalable approach for location-specific detection of Santa Ana conditionsMai H. Nguyen, Dylan Uys, Daniel Crawl, Charles Cowart, Ilkay Altintas. 1340-1345 [doi]
- Experiences with smart city traffic pilotSusanna Pirttikangas, Ekaterina Gilman, Xiang Su, Teemu Leppänen, Anja Keskinarkaus, Mika Rautiainen, Mikko Pyykkönen, Jukka Riekki. 1346-1352 [doi]
- How interesting images are: An atypicality approach for social networksElyas Sabeti, Anders Høst-Madsen. 1353-1358 [doi]
- Exploring memory hierarchy and network topology for runtime AMR data sharing across scientific applicationsWenzhao Zhang, Houjun Tang, Stephen Ranshous, Surendra Byna, Daniel F. Martin, Kesheng Wu, Bin Dong, Scott Klasky, Nagiza F. Samatova. 1359-1366 [doi]
- Pitfalls of long-term online controlled experimentsPavel Dmitriev, Brian Frasca, Somit Gupta, Ron Kohavi, Garnet Vaz. 1367-1376 [doi]
- An architecture for the deployment of statistical models for the big data eraJuergen Heit, Jiayi Liu, Mohak Shah. 1377-1384 [doi]
- Information retrieval, fusion, completion, and clustering for employee expertise estimationRaya Horesh, Kush R. Varshney, Jinfeng Yi. 1385-1393 [doi]
- Empirical evaluations of preprocessing parameters' impact on predictive coding's effectivenessRishi Chhatwal, Nathaniel Huber-Fliflet, Robert Keeling, Jianping Zhang, Haozhen Zhao. 1394-1401 [doi]
- LogProv: Logging events as provenance of big data analytics pipelines with trustworthinessRuoyu Wang, Daniel Sun, Guoqiang Li 0001, Muhammad Atif, Surya Nepal. 1402-1411 [doi]
- Pattern recognition and classification of HVAC rule-based faults in commercial buildingsBradford Littooy, Sophie Loire, Michael Georgescu, Igor Mezic. 1412-1421 [doi]
- Deep parallelization of parallel FP-growth using parent-child MapReduceAdetokunbo Makanju, Zahra Farzanyar, Aijun An, Nick Cercone, Zane Zhenhua Hu, Yonggang Hu. 1422-1431 [doi]
- The state of SQL-on-Hadoop in the cloudNicolás Poggi, Josep Lluis Berral, Thomas Fenech, David Carrera, José A. Blakeley, Umar Farooq Minhas, Nikola Vujic. 1432-1443 [doi]
- Detecting fraud, corruption, and collusion in international development contracts: The design of a proof-of-concept automated systemEmily Grace, Ankit Rai, Elissa M. Redmiles, Rayid Ghani. 1444-1453 [doi]
- Automatic generation of relational attributes: An application to product returnsMichele Samorani, Farrukh Ahmed, Osmar R. Zaïane. 1454-1463 [doi]
- Data-at-rest security for sparkSyed Yousaf Shah, Brent Paulovicks, Petros Zerfos. 1464-1473 [doi]
- Do we trust image measurements? Variability, accuracy and traceability of image featuresMylene Simon, Joe Chalfoun, Mary Brady, Peter Bajcsy. 1474-1482 [doi]
- Mini-apps for high performance data analysisSreenivas R. Sukumar, Michael A. Matheson, Ramakrishnan Kannan, Seung-Hwan Lim. 1483-1492 [doi]
- Predicting annual average daily highway traffic from large data and very few measurementsTomasz Tajmajer, Malwina Splawinska, Piotr Wasilewski, Stan Matwin. 1493-1501 [doi]
- Fast, lenient and accurate: Building personalized instant search experience at LinkedInGanesh Venkataraman, Abhimanyu Lad, Lin Guo, Shakti Sinha. 1502-1511 [doi]
- Diversifying trending topic discovery via Semidefinite ProgrammingHui Wu, Yi Fang, Huming Wu, Shenhong Zhu. 1512-1521 [doi]
- Storytelling in heterogeneous Twitter entity network based on hierarchical cluster routingXuchao Zhang, Zhiqian Chen, Weisheng Zhong, Arnold P. Boedihardjo, Chang-Tien Lu. 1522-1531 [doi]
- Quantifying skill relevance to job titlesWenjun Zhou, Yun Zhu, Faizan Javed, Mahmudur Rahman, Janani Balaji, Matt Mcnair. 1532-1541 [doi]
- SmartCache: Application layer caching to improve performance of large-scale memory mappingZhenyun Zhuang, Haricharan Ramachandra, Badri Sridharan, Brandon Duncan, Kishore Gopalakrishna, Jean-Francois Im. 1542-1550 [doi]
- Hidden Markov based anomaly detection for water supply systemsZahra Zohrevand, Uwe Glässer, Hamed Yaghoubi Shahir, Mohammad A. Tayebi, Robert Costanzo. 1551-1560 [doi]
- Advancing NLP via a distributed-messaging approachIlaria Bordino, Andrea Ferretti, Marco Firrincieli, Francesco Gullo, Marcello Paris, Stefano Pascolutti, Gianluca Sabena. 1561-1568 [doi]
- Automated port traffic statistics: From raw data to visualisationLuca Cazzanti, Antonio Davoli, Leonardo M. Millefiori. 1569-1573 [doi]
- UStore: An optimized storage system for enterprise data warehouses at UnionPayHongfeng Chai, Hao Liu, Xibo Zhou, Yanjun Xu, Shuo He, Jinzhi Hua, Dongjie He, Weihuai Liu. 1574-1578 [doi]
- Extensive large-scale study of error surfaces in sampling-based distinct value estimators for databasesVinay Deolalikar, Hernan Laffitte. 1579-1586 [doi]
- Forecasting squatting of demand in display advertisingAmita Gajewar, Lizhong Wu, Jignesh Parmar, Ramana Yerneni. 1587-1594 [doi]
- Data quality: Experiences and lessons from operationalizing big dataArchana Ganapathi, Yanpei Chen. 1595-1602 [doi]
- KDD meets Big DataNancy W. Grady. 1603-1608 [doi]
- Classification of massive mobile web log URLs for customer profiling & analyticsRajaraman Kanagasabai, Anitha Veeramani, Shangfeng Hu, Sangaralingam Kajanan, Giuseppe Manai. 1609-1614 [doi]
- Company recommendation for new graduates via implicit feedback multiple matrix factorization with Bayesian optimizationMasahiro Kazama, Issei Sato, Haruaki Yatabe, Tairiku Ogihara, Tetsuro Onishi, Hiroshi Nakagawa. 1615-1620 [doi]
- Human network usage patterns revealed by telecom dataYiming Kong, Hui Zang, Xiaoli Ma. 1621-1626 [doi]
- A distributed approach to estimating sea port operational regions from lots of AIS dataLeonardo M. Millefiori, Dimitrios Zissis, Luca Cazzanti, Gianfranco Arcieri. 1627-1632 [doi]
- Uniformization, organization, association and use of metadata from multiple content providers and manufacturers: A close look at the Building Automation System (BAS) sectorThibaud Nesztler, Don Kasper, Michael Georgescu, Sophie Loire, Igor Mezic. 1633-1638 [doi]
- QED: Groupon's ETL management and curated feature catalog system for machine learningDerrick C. Spell, Ling-Yong Wang, Richard T. Shomer, Bahador Nooraei, Jarrell Waggoner, Xiao Han T. Zeng, Jae Young Chung, Kai-Chen Cheng, Daniel Kirsche. 1639-1646 [doi]
- Big-data-driven anomaly detection in industry (4.0): An approach and a case studyLjiljana Stojanovic, Marko Dinic, Nenad Stojanovic, Aleksandar Stojadinovic. 1647-1652 [doi]
- Cross-modal event summarization: A network of networks approachJieJun Xu, Samuel D. Johnson, Kang-Yu Ni. 1653-1657 [doi]
- Managing a complicated workflow based on dataflow-based workflow schedulerTeruyoshi Zenmyo, Satoshi Iijima, Ichiro Fukuda. 1658-1663 [doi]
- An edge-set based large scale graph processing systemLi Zhou, Yinglong Xia, Hui Zang, Jian Xu, Mingzhen Xia. 1664-1669 [doi]
- Event detection from social network streams using frequent pattern mining with dynamic support valuesNora Alkhamees, Maria Fasli. 1670-1679 [doi]
- Big data analytics in cloud gaming: Players' patterns recognition using artificial neural networksVictor Perazzolo Barros, Pollyana Notargiacomo. 1680-1689 [doi]
- MapReduce-based deep learning with handwritten digit recognition case studyNada Basit, Yutong Zhang, Hao Wu, Haoran Liu, Jieming Bin, Yijun He, Abdeltawab M. Hendawi. 1690-1699 [doi]
- Text mining and sentiment extraction in central bank documentsGiuseppe Bruno. 1700-1708 [doi]
- To link or not to link: Ranking hyperlinks in Wikipedia using collective attentionPhilip Thruesen, Jaroslav Cechak, Blandine Seznec, Roel Castalio, Nattiya Kanhabua. 1709-1718 [doi]
- An overview of studies about students' performance analysis and learning analytics in MOOCsIsmail Duru, Gulustan Dogan, Banu Diri. 1719-1723 [doi]
- Smart online vehicle tracking system for security applicationsBrahim Hnich, Faisal R. Al-Osaimi, Ata Sasmaz, Ozkan Sayin, Amine Lamine, Majid Alotaibi. 1724-1733 [doi]
- An optimized frequent pattern mining algorithm with multiple minimum supportsHsiao-Wei Hu, Hao-Chen Chang, Wen-Shiu Lin. 1734-1741 [doi]
- Improving item-based recommendation accuracy with user's preferences on Apache MahoutAmmar Jabakji, Hasan Dag. 1742-1749 [doi]
- Change detection and classification of digital collectionsSampath Jayarathna, Faryaneh Poursardar. 1750-1759 [doi]
- A feature selection method based on Lorentzian metricYerzhan Kerimbekov, Hasan Sakir Bilge. 1760-1767 [doi]
- A survey on semantic Web and big data technologies for social network analysisSercan Kulcu, Erdogan Dogdu, A. Murat Ozbayoglu. 1768-1777 [doi]
- Table classification using both structure and content information: A case study of financial documentsQuanzhi Li, Sameena Shah, Rui Fang. 1778-1783 [doi]
- Patient-record level integration of de-identified healthcare big databasesXiao Li, Reza Sharifi Sedeh, Liao Wang, Yang Yang. 1784-1786 [doi]
- A Bayesian predictor of airline class seats based on multinomial event modelBingchuan Liu, Yudong Tan, Huimin Zhou. 1787-1791 [doi]
- Identifying trolls and determining terror awareness level in social networks using a scalable frameworkBusra Mutlu, Merve Mutlu, Kasim Oztoprak, Erdogan Dogdu. 1792-1798 [doi]
- DelayRadar: A multivariate predictive model for transit systemsAparna Oruganti, Fangzhou Sun, Hiba Baroud, Abhishek Dubey. 1799-1806 [doi]
- A real-time autonomous highway accident detection model based on big data processing and computational intelligenceA. Murat Ozbayoglu, Yusuf Gökhan Küçükayan, Erdogan Dogdu. 1807-1813 [doi]
- Subgroup discovery on big data: Pruning the search space on exhaustive search algorithmsFrancisco Padillo, José María Luna, Sebastián Ventura. 1814-1823 [doi]
- The difference-of-datasets framework: A statistical method to discover insightPaul Raff, Ze Jin. 1824-1831 [doi]
- Online trajectory segmentation and summary with applications to visualization and retrievalYehezkel S. Resheff. 1832-1840 [doi]
- Skeleton decomposition analysis for subspace clusteringAli Sekmen, Akram Aldroubi, Ahmet Bugra Koku. 1841-1848 [doi]
- An extended IoT framework with semantics, big data, and analyticsOmer Berat Sezer, Erdogan Dogdu, A. Murat Ozbayoglu, Aras Onal. 1849-1856 [doi]
- Event segmentation using MapReduce based big data clusteringM. Omair Shafiq. 1857-1866 [doi]
- User and entity behavior analytics for enterprise securityMadhu Shashanka, Min-Yi Shen, Jisheng Wang. 1867-1874 [doi]
- Swarm Intelligence (SI) based profiling and scheduling of big data applicationsThamarai Selvi Somasundaram, Kannan Govindarajan, Vivekanandan Suresh Kumar. 1875-1880 [doi]
- Improving clustering efficiency by SimHash-based K-Means algorithm for big data analyticsJenq-Haur Wang, Jia-Zhi Lin. 1881-1888 [doi]
- The effect of pets on happiness: A data-driven approach via large-scale social mediaYuchen Wu, Jianbo Yuan, Quanzeng You, Jiebo Luo. 1889-1894 [doi]
- Intelligent authorship identification with using Turkish newspapers metadataOzlem Yavanoglu. 1895-1900 [doi]
- Solving cold-start problem in large-scale recommendation engines: A deep learning approachJianbo Yuan, Walid Shalaby, Mohammed Korayem, David Lin, Khalifeh AlJadda, Jiebo Luo. 1901-1910 [doi]
- Urban human mobility data mining: An overviewKai Zhao, Sasu Tarkoma, Siyuan Liu, Huy T. Vo. 1911-1920 [doi]
- Fine-grained mining of illicit drug use patterns using social multimedia data from instagramYiheng Zhou, Numair Sani, Jiebo Luo. 1921-1930 [doi]
- Research on the big data system of massive open online courseZhenwei Du, Haopeng Chen, Jian-wei Jiang. 1931-1936 [doi]
- Clinical named entity recognition: Challenges and opportunitiesSrinivasa Rao Kundeti, J. Vijayananda, Srikanth Mujjiga, M. Kalyan. 1937-1945 [doi]
- Very fast frequent itemset mining: Simplicial complex methods (Extended abstract)Tsau Young Lin. 1946-1949 [doi]
- Online anomaly detection using non-parametric technique for big data streams in cloud collaborative environmentG. S. Smrithy, Sathyan Munirathinam, Ramadoss Balakrishnan. 1950-1955 [doi]
- A proposal of a privacy-preserving questionnaire by non-deterministic information and its analysisShusaku Tsumoto, Michinori Nakata, Hiroshi Sakai, Chenxi Liu. 1956-1965 [doi]
- Prediction of Indian election using sentiment analysis on Hindi TwitterParul Sharma, Teng-Sheng Moh. 1966-1971 [doi]
- Construction of clinical pathway from histories of clinical actions in hospital information systemShusaku Tsumoto, Shoji Hirano, Haruko Iwata. 1972-1981 [doi]
- Mining process for improvement of clinical process qualityShusaku Tsumoto, Shoji Hirano, Haruko Iwata, Norio Yoshimoto, Tomohiro Kimura. 1982-1990 [doi]
- Multi-layer text classification with voting for consumer reviewsYan Zhu, Melody Moh, Teng-Sheng Moh. 1991-1999 [doi]
- SCEM: Smart & effective crowd management with a novel scheme of big data analyticsShakti Awaghad. 2000-2003 [doi]
- A system and architecture for reusable abstractions of manufacturing processesAlexander Brodsky, Mohan Krishnamoorthy, William Z. Bernstein, M. Omar Nachawati. 2004-2013 [doi]
- Evaluation of a PMML-based GPR scoring engine on a cloud platform and microcomputer board for smart manufacturingMax Ferguson, Kincho H. Law, Raunak Bhinge, David Dornfeld, Jinkyoo Park, Yung-Tsun Tina Lee. 2014-2023 [doi]
- Predicting rare failure events using classification trees on large scale manufacturing data with complex interactionsJeff Hebert. 2024-2028 [doi]
- Using big data to enhance the bosch production line performance: A Kaggle challengeAnkita Mangal, Nishant Kumar. 2029-2035 [doi]
- Bayesian optimization for predicting rare internal failures in manufacturing processesAbhinav Maurya. 2036-2045 [doi]
- Machine learning, linear and Bayesian models for logistic regression in failure detection problemsBohdan Pavlyshenko. 2046-2050 [doi]
- Convergence and divergence in academic and industrial interests on IOT based manufacturingSrinivasan Radhakrishnan, Sagar Kamarthi. 2051-2056 [doi]
- Complexity-entropy feature plane for gear fault detectionSrinivasan Radhakrishnan, Sagar Kamarthi. 2057-2061 [doi]
- Cloud-based machine learning for predictive analytics: Tool wear prediction in millingDazhong Wu, Connor Jennings, Janis P. Terpenny, Soundar Kumara. 2062-2069 [doi]
- Predict failures in production lines: A two-stage approach with clustering and supervised learningDarui Zhang, Bin Xu, Jasmine Wood. 2070-2074 [doi]
- Holistic disaster recovery approach for big data NoSQL workloadsAharon Abadi, Ashraf Haib, Roie Melamed, Alaa Nassar, Aidan Shribman, Hisham Yasin. 2075-2080 [doi]
- Data-driven cloud-based IT services performance forecastingGenady Ya. Grabarnik, Mauro Tortonesi, Larisa Shwartz. 2081-2086 [doi]
- On-demand data analytics in HPC environments at leadership computing facilities: Challenges and experiencesJohn Harney, Seung-Hwan Lim, Sreenivas S. Sukumar, Dale Stansberry, Peter Xenopoulos. 2087-2096 [doi]
- Intercloud brokerages based on PLS method for deploying infrastructures for big data analyticsKatsunori Miura, Tazro Ohta, Courtney Powell, Masaharu Munetomo. 2097-2102 [doi]
- Motivating dynamic features for resolution time estimation within IT operations managementKayhan Moharreri, Jayashree Ramanathan, Rajiv Ramnath. 2103-2108 [doi]
- Identifying performance bottlenecks in Hive: Use of processor countersAlexander C. Shulyak, Lizy K. John. 2109-2114 [doi]
- Leveraging large sensor streams for robust cloud controlAlok Singh, Eric G. Stephan, Todd Elsethagen, Matt Macduff, Bibi Raju, Malachi Schram, Kerstin Kleese van Dam, Darren J. Kerbyson, Ilkay Altintas. 2115-2120 [doi]
- Fine-grained power analysis of emerging graph processing workloads for cloud operations managementShuang Song, Xinnian Zheng, Andreas Gerstlauer, Lizy K. John. 2121-2126 [doi]
- Open big data infrastructures to everyoneKonstantinos Tsakalozos, Cory Johns, Kevin Monroe, Pete VanderGiessen, Andrew Mcleod, Antonio Rosales. 2127-2129 [doi]
- Spatial-crowd: A big data framework for efficient data visualizationShahbaz Atta, Bilal Sadiq, Akhlaq Ahmad, Sheikh Nasir Saeed, Emad Felemban. 2130-2138 [doi]
- Multi-scalar analysis of geospatial agricultural data for sustainabilityAnne M. Denton, Mostofa Ahsan, David W. Franzen, John Nowatzki. 2139-2146 [doi]
- A framework for evaluating urban land use mix from crowd-sourcing dataLuciano Gervasoni, Martí Bosch, Serge Fenet, Peter Sturm. 2147-2156 [doi]
- Crowdsensing and analyzing micro-event tweets for public transportation insightsThong Hoang, Pei Hua Cher, Philips Kokoh Prasetyo, Ee-Peng Lim. 2157-2166 [doi]
- A study for understanding of tourist person trip pattern based on log data of Wi-Fi access pointsYu Ichifuji, Yoshihide Matsuo, Noriaki Koide, Nobuhiro Akashi, Yoshitaka Terai, Toru Kobayashi. 2167-2174 [doi]
- Estimation of national tourism statistics based on Wi-Fi association log dataNoriaki Koide, Yu Ichifuji, Hideki Yoshii, Noboru Sonehara. 2175-2179 [doi]
- Peer-to-peer microlending platforms: Characterization of online traitsGaurav Paruthi, Enrique Frías-Martínez, Vanessa Frías-Martínez. 2180-2189 [doi]
- Network optimization of food flows in the U.SCaleb Robinson, Arezoo Shirazi, Mengmeng Liu, Bistra Dilkina. 2190-2198 [doi]
- Measuring activities and values of industrial clusters based on job opportunity data collected from an internet Japanese job matching siteAki-Hiro Sato, Tsutomu Watanabe. 2199-2208 [doi]
- Solar irradiance forecasting by machine learning for solar car racesXiaoyan Shao, Siyuan Lu, Theodore G. van Kessel, Hendrik F. Hamann, Leda Daehler, Jeffrey Cwagenberg, Alan Li. 2209-2216 [doi]
- Hotel plan popularity factor analysis of hotels in the Keihanshin regionHiroshi Tsuda, Masakazu Ando, Yu Ichifuji. 2217-2224 [doi]
- Mixed data and classification of transit stopsLaura L. Tupper, David S. Matteson, John C. Handley. 2225-2232 [doi]
- A scalable and composable map-reduce systemMahwish Arif, Hans Vandierendonck, Dimitrios S. Nikolopoulos, Bronis R. de Supinski. 2233-2242 [doi]
- A workload aware model of computational resource selection for big data applicationsAmit Gupta, Weijia Xu, Natalia Ruiz-Juri, Kenneth Perrine. 2243-2250 [doi]
- Evaluation of K-means data clustering algorithm on Intel Xeon PhiSunWoo Lee, Wei-keng Liao, Ankit Agrawal, Nikos Hardavellas, Alok N. Choudhary. 2251-2260 [doi]
- Materials discovery: Understanding polycrystals from large-scale electron patternsRuoqian Liu, Ankit Agrawal, Wei-keng Liao, Alok N. Choudhary, Marc De Graef. 2261-2269 [doi]
- Building a research data science platform from industrial machinesFang Cherry Liu, Fu Shen, Duen Horng Chau, Neil Bright, Mehmet Belgin. 2270-2275 [doi]
- Visually programming dataflows for distributed data analyticsLauritz Thamsen, Thomas Renner, Marvin Byfeld, Markus Paeschke, Daniel Schroder, Felix Bohm. 2276-2285 [doi]
- Big data analytics on HPC architectures: Performance and costPeter Xenopoulos, Jamison Daniel, Michael Matheson, Sreenivas Sukumar. 2286-2295 [doi]
- Supporting large scale connected vehicle data analysis using HIVEWeijia Xu, Natalia Ruiz-Juri, Amit Gupta, Amanda Deering, Chandra Bhat, James Kuhr, Jackson Archer. 2296-2304 [doi]
- Legion-based scientific data analytics on heterogeneous processorsLina Yu, Hongfeng Yu. 2305-2314 [doi]
- Accelerating mathematical knot simulations with R on the webJuan Lin, Di Zhong, Yiwen Zhong, Hui Zhang 0006. 2315-2321 [doi]
- A geohydrologie data visualization framework with an extendable user interface designYanfu Zhou, Jieting Wu, Lina Yu, Hongfeng Yu, Zhenghong Tang. 2322-2331 [doi]
- Efficient portfolio allocation with sparse volatility estimation for high-frequency financial dataJian Zou, Chuqin Huang. 2332-2341 [doi]
- Dask & Numba: Simple libraries for optimizing scientific python codeJames Crist. 2342-2343 [doi]
- A big data platform integrating compressed linear algebra with columnar databasesVishnu Gowda Harish, Vinay Kumar Bingi, John A. Miller. 2344-2352 [doi]
- PinterNet: A thematic label curation tool for large image datasetsRuoqian Liu, Diana Palsetia, Arindam Paul, Reda Al-Bahrani, Dipendra Jha, Wei-keng Liao, Ankit Agrawal, Alok N. Choudhary. 2353-2362 [doi]
- Implementing dictionary learning in Apache Flink, Or: How I learned to relax and love iterationsGeoffrey Mon, Milad Makkie, Xiang Li, Tianming Liu, Shannon Quinn. 2363-2367 [doi]
- Making massive computational experiments painlessHatef Monajemi, David L. Donoho, Victoria Stodden. 2368-2373 [doi]
- Too big to mail: On the way to publish large-scale mobile analytics dataElla Peltonen, Eemil Lagerspetz, Petteri Nurmi, Sasu Tarkoma. 2374-2377 [doi]
- Content-based recommendation for podcast audio-items using natural language processing techniquesZhou Xing, Marzieh Parandehgheibi, Fei Xiao, Nilesh Kulkarni, Chris Pouliot. 2378-2383 [doi]
- A glue language for event stream processingSylvain Hallé, Sébastien Gaboury, Raphaël Khoury. 2384-2391 [doi]
- Real-time processing of proteomics data: The internet of things and the connected laboratoryChristopher Hillman, Karen Petrie, Andrew Cobley, Mark Whitehorn. 2392-2399 [doi]
- Predicting the shape and peak time of news article viewsYaser Keneshloo, Shuguang Wang, Eui-Hong Sam Han, Naren Ramakrishnan. 2400-2409 [doi]
- An FPGA-based low-latency network processing for spark streamingKohei Nakamura, Ami Hayashi, Hiroki Matsutani. 2410-2415 [doi]
- Handling delayed labels in temporally evolving data streamsJoshua Plasse, Niall M. Adams. 2416-2424 [doi]
- A multi-layer software architecture framework for adaptive real-time analyticsAthena Vakali, Paschalis Korosoglou, Pavlos Daoglou. 2425-2430 [doi]
- Implementing trajectory data stream analysis in parallelYongyi Xian, Chuanfei Xu, Yan Liu. 2431-2436 [doi]
- Language independent big-data system for the prediction of user location on TwitterJaime Alonso-Lorenzo, Enrique Costa-Montenegro, Milagros Fernández Gavilanes. 2437-2446 [doi]
- Forecasting Nike's sales using Facebook dataLinda Camilla Boldt, Vinothan Vinayagamoorthy, Florian Winder, Melanie Schnittger, Mats Ekran, Raghava Rao Mukkamala, Niels Buus Lassen, Benjamin Flesch, Abid Hussain, Ravi Vatrapu. 2447-2456 [doi]
- Finding informative comments for video viewingSeung-Woo Choi, Aviv Segev. 2457-2465 [doi]
- Prediction of information diffusion in social networks using dynamic carrying capacityAnahita Davoudi, Mainak Chatterjee. 2466-2469 [doi]
- When do luxury cars hit the road? Findings by a big data approachYang Feng, Jiebo Luo. 2470-2474 [doi]
- Tweet sentiment as proxy for political campaign momentumDavid Watts, K. M. George, Ashwin Kumar T. K, Zenia Arora. 2475-2484 [doi]
- A new approach to building the interindustry input-output table using block estimation techniquesRyohei Hisano. 2485-2494 [doi]
- Nowcast of firm sales using POS data toward stock market stabilityAtushi Ishikawa, Shouji Fujimoto, Takayuki Mizuno. 2495-2499 [doi]
- Uncovering information flow among users by time-series retweet data: Who is a friend of whom on Twitter?Yuka Kamiko, Mitsuo Yoshida, Hirotada Ohashi, Fujio Toriumi. 2500-2504 [doi]
- Quantifying moral foundations from various topics on Twitter conversationsRishemjit Kaur, Kazutoshi Sasahara. 2505-2512 [doi]
- Application of an integer-valued autoregressive model to hit phenomenaYasuko Kawahata, Tamio Koyama. 2513-2517 [doi]
- Analytical method of web user behavior using Hidden Markov ModelHirotaka Kawazu, Fujio Toriumi, Masanori Takano, Kazuya Wada, Ichiro Fukuda. 2518-2524 [doi]
- Leveraging social big data for performance evaluation of E-commerce websitesEyad Makki, Lin-Ching Chang. 2525-2534 [doi]
- User-generated content curation with deep convolutional neural networksRubén Tous, Otto Wüst, Mauro Gomez, Jonatan Poveda, Marc Elena, Jordi Torres, Mouna Makni, Eduard Ayguadé. 2535-2540 [doi]
- Pricing the woman card: Gender politics between hillary clinton and donald trumpYu Wang, Yang Feng, Jiebo Luo, Xiyang Zhang. 2541-2544 [doi]
- Automated classification of extremist Twitter accounts using content-based and network-based featuresDaniel Xie, JieJun Xu, Tsai-Ching Lu. 2545-2549 [doi]
- Towards a heterogeneous, polystore-like data architecture for the US Department of Veteran Affairs (VA) enterprise analyticsEdmon Begoli, Derek Kistler, Jack Bates. 2550-2554 [doi]
- Analytics-driven data ingestion and derivation in the AWESOME polystoreSubhasis Dasgupta, Kevin Coakley, Amarnath Gupta. 2555-2564 [doi]
- A semantic approach to polystoresEvgeny Kharlamov, Theofilos P. Mailis, Konstantina Bereta, Dimitris Bilidas, S. Brandt, Ernesto Jiménez-Ruiz, Steffen Lamparter, Christian Neuenstadt, Özgür L. Özçep, Ahmet Soylu, Christoforos Svingos, G. Xiao, Dmitriy Zheleznyakov, Diego Calvanese, Ian Horrocks, Martin Giese, Yannis E. Ioannidis, Yannis Kotidis, R. Moller, Arild Waaler. 2565-2573 [doi]
- Benchmarking polystores: The CloudMdsQL experienceBoyan Kolev, Raquel Pau, Oleksandra Levchenko, Patrick Valduriez, Ricardo Jiménez-Peris, José Orlando Pereira. 2574-2579 [doi]
- Digree: A middleware for a graph databases polystoreVasilis Spyropoulos, Christina Vasilakopoulou, Yannis Kotidis. 2580-2589 [doi]
- Hobbits: Hadoop and Hive based Internet traffic analysisAbdeltawab M. Hendawi, Fatemah Alali, Xiaoyu Wang, Yunfei Guan, Tianshu Zhou, Xiao Liu, Nada Basit, John A. Stankovic. 2590-2599 [doi]
- URBAN-NET: A network-based infrastructure monitoring and analysis system for emergency management and public safetySangKeun Lee, Liangzhe Chen, Sisi Duan, Supriya Chinthavali, Mallikarjun Shankar, B. Aditya Prakash. 2600-2609 [doi]
- Unravelling the Myth of big data and artificial intelligence in sustainable natural resource developmentGandhi Sivakumar, Drew Johnson, Rashida Hodge. 2610-2615 [doi]
- Big data computation of taxi movement in New York CityJoya A. Deri, Franz Franchetti, José M. F. Moura. 2616-2625 [doi]
- Linked data view methodology and application to BIM alignment and interoperabilityHolly Ferguson, Charles Vardeman, Jarek Nabrzyski. 2626-2635 [doi]
- The SMART approach to comprehensive quality assessment of site-based spatial-temporal dataRafal A. Angryk, Douglas E. Galarus. 2636-2645 [doi]
- Adapting K-means clustering to identify spatial patterns in stormsUpa Gupta, Kulsawasd Jitkajornwanich, Ramez Elmasri, Leonidas Fegaras. 2646-2654 [doi]
- Determining feature extractors for unsupervised learning on satellite imagesBehnam Hedayatnia, Mehrdad Yazdani, Mai H. Nguyen, Jessica Block, Ilkay Altintas. 2655-2663 [doi]
- An experimental study of big spatial data systemsAndrew Hulbert, Thomas Kunicki, James N. Hughes, Anthony D. Fox, Christopher N. Eichelberger. 2664-2671 [doi]
- IBM PAIRS curated big data service for accelerated geospatial data analytics and discoverySiyuan Lu, Xiaoyan Shao, Marcus Freitag, Levente J. Klein, Jason Renwick, Fernando J. Marianno, Conrad M. Albrecht, Hendrik F. Hamann. 2672-2675 [doi]
- A comparative study of dual-tree algorithm implementations for computing 2-body statistics in spatial dataChengcheng Mou, Shaoping Chen, Yi-Cheng Tu. 2676-2685 [doi]
- Towards a provenance-aware spatial-temporal architectural framework for massive data integration and analysisIvens Portugal, Paulo S. C. Alencar, Donald D. Cowan. 2686-2691 [doi]
- Using parallel hierarchical clustering to address spatial big data challengesAlan Woodley, Ling-Xiang Tang, Shlomo Geva, Richi Nayak, Timothy Chappell. 2692-2698 [doi]
- Big data development platform for engineering applicationsChien-Heng Wu, Franco Lin, Wen-Yi Chang, Whey-Fone Tsai, Hsi-Ching Lin, Chao-Tung Yang. 2699-2702 [doi]
- Large-scale solar panel mapping from aerial images using deep convolutional networksJiangye Yuan, Hsiu-Han Lexie Yang, Olufemi A. Omitaomu, Budhendra L. Bhaduri. 2703-2708 [doi]
- Symmetric repositioning of bisecting K-means centers for increased reduction of distance calculations for big data clusteringYu Zhuang. 2709-2715 [doi]
- Evaluating machine learning algorithms for anomaly detection in cloudsAnton Gulenko, Marcel Wallschläger, Florian Schmidt, Odej Kao, Feng Liu. 2716-2721 [doi]
- Preliminary big data in a 5G test networkTeemu Kanstrén, Jussi Liikka, Jukka Mäkelä, Markus Luoto, Jarmo Prokkola. 2722-2727 [doi]
- Quick model fitting using a classifying engineYiming Kong, Hui Zang, Xiaoli Ma. 2728-2733 [doi]
- Spark-based rare association rule mining for big datasetsRuilin Liu, Kai Yang, Yanjia Sun, Tao Quan, Jin Yang. 2734-2739 [doi]
- WHAT: A big data approach for accounting of modern web servicesMartino Trevisan, Idilio Drago, Marco Mellia, Han Hee Song, Mario Baldi. 2740-2745 [doi]
- BINARY: A framework for big data integration for ad-hoc queryingAzadeh Eftekhari, Farhana H. Zulkernine, Patrick Martin. 2746-2753 [doi]
- Container-based virtualization for byte-addressable NVM data storageEllis R. Giles. 2754-2763 [doi]
- NoSQL schema evolution and big data migration at scaleMeike Klettke, Uta Störl, Manuel Shenavai, Stefanie Scherzinger. 2764-2774 [doi]
- Scheduling big data workflows in the cloud under budget constraintsAravind Mohan, Mahdi Ebrahimi, Shiyong Lu, Alexander Kotov. 2775-2784 [doi]
- Big data availability: Selective partial checkpointing for in-memory database queriesDaniel Playfair, Amitabh Trehan, Barry McLarnon, Dimitrios S. Nikolopoulos. 2785-2794 [doi]
- The digital transformation and smart data analytics: An overview of enabling developments and application areasNico Rödder, David Dauer, Kevin Laubis, Paul Karaenke, Christof Weinhardt. 2795-2802 [doi]
- Towards an integrated health research process: A cloud-based approachMatthieu-P. Schapranow, Matthias Uflacker, Murat Sariyar, Sebastian C. Semler, Johannes Fichte, Dietmar Schielke, Kismet Ekinci, Thomas Zahn. 2813-2818 [doi]
- Model-driven deployment and management of workflows on analytics frameworksMerlijn Sebrechts, Sander Borny, Thomas Vanhove, Gregory van Seghbroeck, Tim Wauters, Bruno Volckaert, Filip De Turck. 2819-2826 [doi]
- Is elasticity of scalable databases a Myth?Daniel Seybold, Nicolas Wagner, Benjamin Erb, Jörg Domaschka. 2827-2836 [doi]
- Analyzing the performance of data replication and data partitioning in the cloud: The BEOWULF approachAlexander Stiemer, Ilir Fetai, Heiko Schuldt. 2837-2846 [doi]
- Understanding performance interference in multi-tenant cloud databases and web applicationsMiguel G. Xavier, Kassiano J. Matteussi, Fabian Lorenzo, César A. F. De Rose. 2847-2852 [doi]
- Evaluation-driven research in data science: Leveraging cross-field methodologiesBonnie J. Dorr, Peter C. Fontana, Craig S. Greenberg, Marion Le Bras, Mark A. Przybocki. 2853-2862 [doi]
- Bad big data scienceFrank S. Haug. 2863-2871 [doi]
- Big data team process methodologies: A literature review and the identification of key factors for a project's successJeffrey S. Saltz, Ivan Shamshurin. 2872-2879 [doi]
- Progression analysis of signals: Extending CRISP-DM to stream analyticsPankush Kalgotra, Ramesh Sharda. 2880-2885 [doi]
- Software engineering for big data projects: Domains, methodologies and gapsVijay Dipti Kumar, Paulo S. C. Alencar. 2886-2895 [doi]
- Non-deep CNN for multi-modal image classification and feature learning: An Azure-based modelSohini Roychowdhury, Johnny Ren. 2893-2812 [doi]
- Not all software engineers can become good data engineersJeffrey S. Saltz, Sibel Yilmazel, Ozgur Yilmazel. 2896-2901 [doi]
- A hacking toolset for big tabular files (Codenames: Bin4tsv, Kabutomushi)Toshiyuki Shimono. 2902-2910 [doi]
- Distributed and cloud-based multi-model analytics experiments on large volumes of climate change data in the earth system grid federation eco-systemSandro Fiore, Marcin Plóciennik, Charles M. Doutriaux, Cosimo Palazzo, J. Boutte, Tomasz Zok, Donatello Elia, Michal Owsiak, Alessandro D'Anca, Z. Shaheen, Riccardo Bruno, Marco Fargetta, Miguel Caballer, Germán Moltó, Ignacio Blanquer, Roberto Barbera, Mário David, Giacinto Donvito, Dean N. Williams, V. Anantharaj, Davide Salomoni, Giovanni Aloisio. 2911-2918 [doi]
- Modeling martian thermal inertia in a distributed memory high performance computing environmentJason Laura, Robin L. Fergason. 2919-2928 [doi]
- Where big data meets linked data: Applying standard data models to environmental data streamsAdam M. Leadbetter, Damian Smyth, Robert Fuller, Eoin O'Grady, Adam Shepherd. 2929-2937 [doi]
- Three-dimensional spatial join count exploiting CPU optimized STR R-treeRyuya Mitsuhashi, Hideyuki Kawashima, Takahiro Nishimichi, Osamu Tatebe. 2938-2947 [doi]
- Implementing connected component labeling as a user defined operator for SciDBAmidu Oloso, Kwo-Sen Kuo, Thomas L. Clune, Paul Brown, Alex Poliakov, Hongfeng Yu. 2948-2952 [doi]
- A new parallel python tool for the standardization of earth system model dataKevin Paul, Sheri A. Mickelson, John M. Dennis. 2953-2959 [doi]
- Using cloud bursting to count trees and shrubs in Sub-Saharan AfricaMichael Requa, Garrison Vaughan, John David, Ben Cotton. 2960-2963 [doi]
- SciSpark: Highly interactive in-memory science data analyticsBrian Wilson, Rahul Palamuttam, Kim Whitehall, Chris Mattmann, Alex Goodman, Maziyar Boustani, Sujen Shah, Paul Zimdars, Paul M. Ramirez. 2964-2973 [doi]
- Visualization and diagnosis of earth science data through Hadoop and SparkShujia Zhou, Xiaowen Li, Toshihisa Matsui, Wei-Kuo Tao. 2974-2980 [doi]
- Persisting in-memory databases using SCMEllis Giles, Kshitij Doshi, Peter J. Varman. 2981-2990 [doi]
- SS-dedup: A high throughput stateful data routing algorithm for cluster deduplication systemZhihao Huang, Hui Li, Xin Li, Wei He. 2991-2995 [doi]
- EStore: An effective optimized data placement structure for HiveXin Li, Hui Li, Zhihao Huang, Bing Zhu, Jiawei Cai. 2996-3001 [doi]
- Towards optimizing large-scale data transfers with end-to-end integrity verificationSi Liu, Eun Sung Jung, Rajkumar Kettimuthu, Xian-He Sun, Michael E. Papka. 3002-3007 [doi]
- CoLoc: Distributed data and container colocation for data-intensive applicationsThomas Renner, Lauritz Thamsen, Odej Kao. 3008-3015 [doi]
- Linked data platform for building cloud-based smart applications and connecting API access points with data discovery techniquesHolly Ferguson, Charles Vardeman, Jarek Nabrzyski. 3016-3025 [doi]
- MetaStore: A metadata framework for scientific data repositoriesAjinkya Prabhune, Hasebullah Ansari, Anil Keshav, Rainer Stotzka, Michael Gertz, Jürgen Hesser. 3026-3035 [doi]
- Automated schema extraction for PID information typesUlrich Schwardmann. 3036-3044 [doi]
- Facilitating reproducible research by investigating computational metadataPriyaa Thavasimani, Paolo Missier. 3045-3051 [doi]
- Constellation: A science graph network for scalable data and knowledge discovery in extreme-scale scientific collaborationsSudharshan S. Vazhkudai, John Harney, Raghul Gunasekaran, Dale Stansberry, Seung-Hwan Lim, Tom Barron, Andrew Nash, Arvind Ramanathan. 3052-3061 [doi]
- Detecting spammers on social networks based on a hybrid modelGuangxia Xu, Jin Qi, Deling Huang, Mahmoud Daneshmand. 3062-3068 [doi]
- Bandwidth provision strategies for reliable data movements in dedicated networksLiudong Zuo, Mengxia Michelle Zhu. 3069-3078 [doi]
- Investigation of forecasting methods for the hourly spot price of the day-ahead electric power marketsRadhakrishnan Angamuthu Chinnathambi, Prakash Ranganathan. 3079-3086 [doi]
- Leveraging user expertise in collaborative systems for annotating energy datasetsHông-Ân Cao, Felix Rauchenstein, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes. 3087-3096 [doi]
- Temporal association rules for electrical activity detection in residential homesHông-Ân Cao, Tri Kurniawan Wijaya, Karl Aberer, Nuno Nunes. 3097-3106 [doi]
- Leveraging cloud computing to convert the non-intrusive load monitor into a powerful framework for grid-responsive buildingsSaman Mostafavi, Benjamin Futrell, John Troxler, Robert W. Cox. 3107-3114 [doi]
- Big data, better energy management and control decisions for distribution systems in smart gridShady S. Refaat, Haitham Abu-Rub, Amira Mohamed. 3115-3120 [doi]
- Detecting non-technical energy losses through structural periodic patterns in AMI dataViktor Botev, Magnus Almgren, Vincenzo Gulisano, Olaf Landsiedel, Marina Papatriantafilou, Joris van Rooij. 3121-3130 [doi]
- Lossless compression of high-frequency voltage and current data in smart gridsAndreas Unterweger, Dominik Engel. 3131-3139 [doi]
- Indexing spatiotemporal relations in solar event datasetsBerkay Aydin, Ahmet Küçük, Rafal A. Angryk. 3140-3148 [doi]
- Spatio-temporal interpolation methods for solar events metadataSoukaina Filali Boubrahimi, Berkay Aydin, Dustin Kempton, Rafal A. Angryk. 3149-3157 [doi]
- Processing and managing the Kepler mission's treasure trove of stellar and exoplanet dataJon M. Jenkins. 3158-3167 [doi]
- Describing solar images with sparse coding for similarity searchDustin J. Kempton, Michael A. Schuh, Rafal A. Angryk. 3168-3176 [doi]
- A data-driven analysis of interplanetary coronal mass ejecta and magnetic flux ropesRuizhe Ma, Rafal A. Angryk, Pete Riley. 3177-3186 [doi]
- Running scientific algorithms as array database operators: Bringing the processing power to the dataSimon Marcin, André Csillaghy. 3187-3193 [doi]
- The best of both worlds: Using automatic detection and limited human supervision to create a homogenous magnetic catalog spanning four solar cyclesA. Munoz-Jaramillo, Z. A. Werginz, J. P. Vargas-Acosta, M. D. DeLuca, J. C. Windmueller, J. Zhang, D. W. Longcope, D. A. Lamb, C. E. DeForest, S. Vargas-Dominguez, J. W. Harvey, P. C. H. Martens. 3194-3203 [doi]
- An input catalog and target selection for the transiting exoplanet survey satelliteRyan J. Oelkers, Keivan G. Stassun, Joshua A. Pepper, Nathan M. De Lee, Martin A. Paegert. 3204-3213 [doi]
- Method for estimating cycle lengths from multidimensional time series: Test cases and application to a massive "in silico" datasetN. Olspert, M. J. Kapyla, J. Pelti. 3214-3223 [doi]
- Opening up dark digital archives through the use of analytics to identify sensitive contentBennett B. Borden, Jason R. Baron. 3224-3229 [doi]
- Mining and analysing one billion requests to linguistic servicesMarco Büchler, Greta Franzini, Emily Franzini, Thomas Eckart. 3230-3239 [doi]
- Mind the explanatory gap: Quality from quantityJenny Bunn. 3240-3244 [doi]
- Exploring archives with probabilistic models: Topic modelling for the valorisation of digitised archives of the European CommissionSimon Hengchen, Mathias Coeckelbergs, Seth van Hooland, Ruben Verborgh, Thomas Steiner. 3245-3249 [doi]
- Understanding computational web archives research methods using research objectsEmily Maemura, Christoph Becker, Ian Milligan. 3250-3259 [doi]
- Traces through time: A probabilistic approach to connected archival dataSonia Ranade. 3260-3265 [doi]
- Computational provenance: DataONE and implications for cultural heritage institutionsRobert J. Sandusky. 3266-3271 [doi]
- Appraising digital archives with ArchivematicaMichael Shallcross. 3272-3276 [doi]
- Breaking down the invisible wall to enrich archival science and practiceKenneth Thibodeau. 3277-3282 [doi]
- Content-based comparison for collections identificationWeijia Xu, Ruizhu Huang, Maria Esteva, Jawon Song, Ramona L. Walls. 3283-3289 [doi]
- Deep topology classification: A new approach for massive graph classificationStephen Bonner, John Brennan, Georgios Theodoropoulos, Ibad Kureshi, Andrew Stephen McGough. 3290-3297 [doi]
- GFP-X: A parallel approach to massive graph comparison using sparkStephen Bonner, John Brennan, Georgios Theodoropoulos, Ibad Kureshi, Andrew Stephen McGough. 3298-3307 [doi]
- Fast distributed k-nn graph updateThibault Debatty, Fabio Pulvirenti, Pietro Michiardi, Wim Mees. 3308-3317 [doi]
- An incremental local-first community detection method for dynamic graphsHiroki Kanezashi, Toyotaro Suzumura. 3318-3325 [doi]
- Massive graph processing on nanocomputersBryan Rainey, David F. Gleich. 3326-3335 [doi]
- GraphFlow: Workflow-based big graph processingSara Riazi, Boyana Norris. 3336-3343 [doi]
- On the hyperbolicity of large-scale networks and its estimationW. Sean Kennedy, Iraj Saniee, Onuttom Narayan. 3344-3351 [doi]
- Parallel graph mining with dynamic load balancingNilothpal Talukder, Mohammed J. Zaki. 3352-3359 [doi]
- Distributed exact subgraph matching in small diameter dynamic graphsCharith Wickramaarachchi, Rajgopal Kannan, Charalampos Chelmis, Viktor K. Prasanna. 3360-3369 [doi]
- Fast reachability query computation on big attributed graphsDuncan Yung, Shi-Kuo Chang. 3370-3380 [doi]
- Drug target path discovery on semantic biomedical big dataFang Du, Ting Li, Yingjie Shi, Lijuan Song, Xiaojun Gu. 3381-3386 [doi]
- A framework to predict outcome for cancer patients using data from a nursing EHRMuhammad Kamran Lodhi, Rashid Ansari, Yingwei Yao, Gail M. Keenan, Diana J. Wilkie, Ashfaq A. Khokhar. 3387-3395 [doi]
- Distributed rank-1 dictionary learning: Towards fast and scalable solutions for fMRI big data analyticsMilad Makkie, Xiang Li, Tianming Liu, Shannon Quinn, Binbin Lin, Jieping Ye. 3396-3403 [doi]
- Mortality prediction of ICU patients using lab test data by feature vector compaction & classificationMohammad M. Masud, Abdel Rahman Al Harahsheh. 3404-3411 [doi]
- Iterative unified clustering in big dataVasundhara Misal, Vandana P. Janeja, Sai C. Pallaprolu, Yelena Yesha, Raghu Chintalapati. 3412-3421 [doi]
- Application of big data analytics for automated estimation of CT image qualityMaitham D. Naeemi, Johnny Ren, Nathan Hollcroft, Adam M. Alessio, Sohini Roychowdhury. 3422-3431 [doi]
- Wearable sensor based human posture recognitionJianwu Wang, Zhichuan Huang, Wenbin Zhang, Ankita Patil, Ketan Patil, Ting Zhu, Eric J. Shiroma, Mitchell A. Schepps, Tamara B. Harris. 3432-3438 [doi]
- Simple and effective pre-processing for automated melanoma discrimination based on cytological findingsTakuya Yoshida, M. Emre Celebi, Gerald Schaefer, Hitoshi Iyatomi. 3439-3442 [doi]
- Big data approach in healthcare used for intelligent design - Software as a serviceWeider D. Yu, Jaspal Singh Gill, Maulin Dalal, Piyush Jha, Sajan Shah. 3443-3449 [doi]
- Interactive personalized interesting pattern discoveryMansurul Alam Bhuiyan, Mohammad Al Hasan. 3450-3456 [doi]
- Android malware detection with weak ground truth dataJordan DeLoach, Doina Caragea, Xinming Ou. 3457-3464 [doi]
- Probabilistic parallelisation of blocking non-matched records for big dataChenxiao Dou, Daniel Sun, Yi-Cheng Chen, Guoqiang Li 0001, Jianquan Liu. 3465-3473 [doi]
- Universal data discovery using atypicalityAnders Høst-Madsen, Elyas Sabeti, Chad Walton, Su Jun Lim. 3474-3483 [doi]
- A Markov chain collaborative filtering model for course enrollment recommendationsElham Sahebkar Khorasani, Zhao Zhenge, John Champaign. 3484-3490 [doi]
- Predicting traffic of online advertising in real-time bidding systems from perspective of demand-side platformsHsu-Chao Lai, Wen-Yueh Shih, Jiun-Long Huang, Yi-Cheng Chen. 3491-3498 [doi]
- Leveraging cloud data to mitigate user experience from 'breaking bad'Nicholas A. James, Arun Kejariwal, David S. Matteson. 3499-3508 [doi]
- Topic modeling for management sciences: A network-based approachMax Menenberg, Surya Pathak, Hari P. Udyapuram, Srinagesh Gavirneni, Sohini Roychowdhury. 3509-3518 [doi]
- The technical hashtag in Twitter data: A hadoop experienceIzabela Moise. 3519-3528 [doi]
- Using semantic-based approach to manage perspectives of process mining: Application on improving learning process domain dataKingsley Okoye, Abdel-Rahman H. Tawil, Usman Naeem, Syed Islam, Elyes Lamine. 3529-3538 [doi]
- Label propagation in big data to detect remote access TrojansSai C. Pallaprolu, Josephine M. Namayanja, Vandana P. Janeja, C. T. Sai Adithya. 3539-3547 [doi]
- A novel big-data processing framwork for healthcare applications: Big-data-healthcare-in-a-boxFuad Rahman, Marvin J. Slepian, Ari Mitra. 3548-3555 [doi]
- An efficient parallel topic-sensitive expert finding algorithm using sparkYao-Ming Yang, Chang-Dong Wang, Jian-Huang Lai. 3556-3562 [doi]
- Exploring the utilization of places through a scalable "Activities in Places" analysis mechanismLinlin You, Bige Tunçer. 3563-3572 [doi]
- Robust K-subspaces recovery with combinatorial initializationJun He, Yue Zhang, Jiye Wang, Nan Zeng, HanYong Hao. 3573-3582 [doi]
- TSmap3D: Browser visualization of high dimensional time series dataSupun Kamburugamuve, Pulasthi Wickramasinghe, Saliya Ekanayake, Chathuri Wimalasena, Milinda Pathirage, Geoffrey C. Fox. 3583-3592 [doi]
- On the theory and practice of high-dimensional data indexing with iDistanceMichael A. Schuh, Rafal A. Angryk. 3593-3600 [doi]
- "Influence sketching": Finding influential samples in large-scale regressionsMichael Wojnowicz, Ben Cruz, Xuan Zhao, Brian Wallace, Matt Wolff, Jay Luan, Caleb Crable. 3601-3612 [doi]
- Minimum density hyperplanes in the feature spaceKatie R. Yates, Nicos G. Pavlidis. 3613-3618 [doi]
- Structure preserving dimension reduction with 2D images as predictorsBo Zhang, Liwei Wang. 3619-3624 [doi]
- Memory access pattern based insider threat detection in big data systemsSantosh Aditham, Nagarajan Ranganathan, Srinivas Katkoori. 3625-3628 [doi]
- Automated big security text pruning and classificationKhudran Alzhrani, Ethan M. Rudd, C. Edward Chow, Terrance E. Boult. 3629-3637 [doi]
- Big data analytics as-a-service: Issues and challengesClaudio Agostino Ardagna, Paolo Ceravolo, Ernesto Damiani. 3638-3644 [doi]
- Data privacy for IoT systems: Concepts, approaches, and research directionsElisa Bertino. 3645-3647 [doi]
- Towards an effective and efficient malware detection systemChia-Tien Dan Lo, Pablo Ordóñez, Carlos Cepeda Mora. 3648-3655 [doi]
- Private databases on the cloud: Models, issues and research perspectivesAlfredo Cuzzocrea, Carlo Mastroianni, Giorgio Mario Grasso. 3656-3661 [doi]
- Concise essence-preserving big data representationPhilip Derbeko, Shlomi Dolev, Ehud Gudes, Jeffrey D. Ullman. 3662-3665 [doi]
- Trusted cloud SQL DBS with on-the-fly AES decryption/encryptionSushil Jajodia, Witold Litwin, Thomas J. E. Schwarz. 3666-3675 [doi]
- An entropy-based analytic model for the privacy-preserving in open dataSoo-Hyung Kim, Changwook Jung, Yoon-Joon Lee. 3676-3684 [doi]
- Phishing detection based on newly registered domainsXueni Li, Guanggang Geng, Zhiwei Yan, Yong Chen, Xiaodong Lee. 3685-3692 [doi]
- Security and privacy for big data: A systematic literature reviewBoel Nelson, Tomas Olovsson. 3693-3702 [doi]
- Phishing through social bots on TwitterMohammad Shafahi, Leon Kempers, Hamideh Afsarmanesh. 3703-3712 [doi]
- Reverse engineering smart card malware using side channel analysis with machine learning techniquesHippolyte Djonon Tsague, Bheki Twala. 3713-3721 [doi]
- S3C: An architecture for space-efficient semantic search over encrypted data in the cloudJason Woodworth, Mohsen Amini Salehi, Vijay Raghavan. 3722-3731 [doi]
- A systems approach to big data technology applied to supply chainTomohiro Fukui. 3732-3736 [doi]
- Optimizing performance of sentiment analysis through design of experimentsGary S. W. Goh, Andy J. L. Ang, Allan N. Zhang. 3737-3742 [doi]
- Analysis for supply hub in industrial cluster: Classic vs. new perspectiveVahid Kayvanfar, S. M. Moattar Husseini, Behrooz Karimi, Mohsen S. Sajadieh, Tan Wen Jun. 3743-3748 [doi]
- A DEA approach for Supplier Selection with AHP and risk considerationJasmine J. Lim, Allan N. Zhang. 3749-3758 [doi]
- Deep learning in the automotive industry: Applications and toolsAndré Luckow, Matthew Cook, Nathan Ashcraft, Edwin Weill, Emil Djerekarov, Bennie Vorster. 3759-3768 [doi]
- The Bayesian estimators of polytomous item response theory models with approximated conditional likelihood and their mathematical optimalitiesKazumasa Mori, Takuya Ohmori. 3769-3772 [doi]
- Data blending in manufacturing and supply chainsB. Y. Ong, Rong Wen, Allan N. Zhang. 3773-3778 [doi]
- Adaptive resilient strategies for supply chain networksWen Jun Tan, Wentong Cai, Zhengping Li. 3779-3784 [doi]
- Prediction of regional goods demand incorporating the effect of weatherTakuya Watanabe, Hiroaki Muroi, Motoki Naruke, Kyoto Yono, Gen Kobayashi, Masanori Yamasaki. 3785-3791 [doi]
- Weighted clustering of spatial pattern for optimal logistics hub deploymentRong Wen, Wenjing Yan, Allan N. Zhang. 3792-3797 [doi]
- Vessel movement analysis and pattern discovery using density-based clustering approachWenjing Yan, Rong Wen, Allan N. Zhang, Dazhi Yang. 3798-3806 [doi]
- Spatial data dimension reduction using quadtree: A case study on satellite-derived solar radiationDazhi Yang, Gary S. W. Goh, Siwei Jiang, Allan N. Zhang. 3807-3812 [doi]
- Forecast UPC-level FMCG demand, Part III: Grouped reconciliationDazhi Yang, Gary S. W. Goh, Siwei Jiang, Allan N. Zhang. 3813-3819 [doi]
- Word embeddings for Arabic sentiment analysisA. Aziz Altowayan, Lixin Tao. 3820-3825 [doi]
- Giving voice to office customers: Best practices in how office handles verbatim text feedbackMichael Bentley, Soumya Batra. 3826-3832 [doi]
- Unlock big data emotions: Weighted word embeddings for sentiment classificationXiangfeng Dai, Robert Prout. 3833-3838 [doi]
- Big social data analytics of changes in consumer behaviour and opinion of a TV broadcasterAnna Hennig, Anne-Sofie Amodt, Henrik Hernes, Helene Mejer Nygardsmoen, Peter Arenfeldt Larsen, Raghava Rao Mukkamala, Benjamin Flesch, Abid Hussain, Ravi Vatrapu. 3839-3848 [doi]
- TV ratings vs. social media engagement: Big social data analytics of the Scandinavian TV talk show SkavlanHenrikke Hovda Larsen, Johanna Margareta Forsberg, Sigrid Viken Hemstad, Raghava Rao Mukkamala, Abid Hussain, Ravi Vatrapu. 3849-3858 [doi]
- Totally automated keyword extractionTayfun Pay. 3859-3863 [doi]
- Efficient natural language pre-processing for analyzing large data setsBelainine Billal, Alexsandro Fonseca, Fatiha Sadat. 3864-3871 [doi]
- A grapheme-level approach for constructing a Korean morphological analyzer without linguistic knowledgeJihun Choi, Jonghem Youn, Sang-goo Lee. 3872-3879 [doi]
- lexiDB: A scalable corpus database management systemMatthew Coole, Paul Rayson, John A. Mariani. 3880-3884 [doi]
- Large-scale taxonomy categorization for noisy product listingsPradipto Das, Yandi Xia, Aaron Levine, Giuseppe Di Fabbrizio, Ankur Datta. 3885-3894 [doi]
- Scaling character-based morphological tagging to fourteen languagesGeorg Heigold, Josef van Genabith, Günter Neumann. 3895-3902 [doi]
- Lightweight system for NE-tagged news headlines corpus creationAvinash Kumar, Dhaval Patel, Nikita Jain. 3903-3912 [doi]
- Domain-specific user preference prediction based on multiple user activitiesYunfei Long, Qin Lu, Yue Xiao, Minglei Li, Chu-Ren Huang. 3913-3921 [doi]
- Document classification through image-based character embedding and wildcard trainingDaiki Shimada, Ryunosuke Kotani, Hitoshi Iyatomi. 3922-3927 [doi]
- Large-scale text processing pipeline with Apache SparkA. Svyatkovskiy, K. Imai, M. Kroeger, Y. Shiraito. 3928-3935 [doi]
- Automatic classification of securities using hierarchical clustering of the 10-KsHoseong Yang, Hye-Jin Lee, Sungzoon Cho, Eugene Cho. 3936-3943 [doi]
- Max-node sampling: An expansion-densification algorithm for data collectionKatchaguy Areekijseree, Ricky Laishram, Sucheta Soundarajan. 3944-3946 [doi]
- Real-time sentiment analysis of Saudi dialect tweets using SPARKAdel Assiri, Ahmed Emam, Hmood Al-Dossari. 3947-3950 [doi]
- Modeling, validation and verification of cell-scaffold contact measurements over terabyte-sized 3D image collectionPeter Bajcsy, Soweon Yoon, Mylene Simon, Mary Brady, Ram D. Sriram, Nathan Hotaling, Nicholas Schaub, Carl G. Simon, Piotr M. Szczypinski, Stephen J. Florczyk. 3951-3953 [doi]
- An integrated assessment approach to different collaborative filtering algorithmsRaja Sarath Kumar Boddu. 3954-3956 [doi]
- Sequential randomized matrix factorization for Gaussian processesShaunak D. Bopardikar, George S. Eskander Ekladious. 3957-3959 [doi]
- Comparison of lossless video and image compression codecs for medical computed tomography datasetsVy Bui, Lin-Ching Chang, Dunling Li, Li-yueh Hsu, Marcus Y. Chen. 3960-3962 [doi]
- ORANGE: Spatial big data analysis platformSunghwan Cho, Sunghal Hong, Changsoo Lee. 3963-3965 [doi]
- Accessing and distributing large volumes of NetCDF dataRanjeet Devarakonda, Yaxing Wei, Michele Thornton. 3966-3967 [doi]
- Next-gen tools for big scientific data: ARM data center exampleRanjeet Devarakonda, Kyle Dumas, Sheman Beus, Everett Rush, Bhargavi Krishna, Rob Records, Giri Prakash. 3968-3970 [doi]
- Correlation between weather and weather-related tweets - A preliminary studySrabasti Dutta, Sumantro Ray, S. Roy. 3971-3973 [doi]
- Fall recognition using wearable technologies and machine learning algorithmsAustin Harris, Hanna True, Zhen Hu, Jin Cho, Nancy Fell, Mina Sartipi. 3974-3976 [doi]
- "What makes a pro eating disorder hashtag": Using hashtags to identify pro eating disorder tumblr posts and Twitter usersLing He, Jiebo Luo. 3977-3979 [doi]
- Evaluation of distributed processing of caffe framework using poor performance deviceAyae Ichinose, Masato Oguchi, Atsuko Takefusa, Hidemoto Nakada. 3980-3982 [doi]
- Fast and space-efficient secure frequent pattern mining by FHEHiroki Imabayashi, Yu Ishimaki, Akira Umayabara, Hayato Yamana. 3983-3985 [doi]
- Analysis of Pokémon GO using sociophysics approachAkira Ishii, Masanori Ajito, Yasuko Kawahata. 3986-3988 [doi]
- Privacy-preserving string search for genome sequences with FHE bootstrapping optimizationYu Ishimaki, Hiroki Imabayashi, Kana Shimizu, Hayato Yamana. 3989-3991 [doi]
- Harmonization of methods to facilitate reproducibility in medical data processing: Applications to diffusion tensor magnetic resonance imagingJeffrey Jenkins, Lin-Ching Chang, Elizabeth B. Hutchinson, M. Okan Irfanoglu, Carlo Pierpaoli. 3992-3994 [doi]
- TPR∗-tree Performance improvement for big tactical moving objectsSeungwoo Jeon, Jaegi Hong, Bonghee Hong, Chumsu Kim. 3995-3997 [doi]
- A data analysis and visualization system for large-scale e-bike dataXiaoxia Jia, Peng Cheng, Jiming Chen. 3998-4000 [doi]
- Big data application in job trend analysisPriyanka Kale, Shilpa Balan. 4001-4003 [doi]
- Nowcasting with social media dataDavid L. Kimmey, Jin S. Yoo. 4004 [doi]
- CareerMapper: An automated resume evaluation toolVivian Lai, Kyong Jin Shim, Richard Jayadi Oentaryo, Philips Kokoh Prasetyo, Casey Vu, Ee-Peng Lim, David Lo. 4005-4007 [doi]
- Predicted max degree sampling: Sampling in directed networks to maximize node coverage through crawlingRicky Laishram, Katchaguy Areekijseree, Sucheta Soundarajan. 4008-4010 [doi]
- A generator of test data set for tactical moving objects based on velocityJiwan Lee, Jaegi Hong, Bonghee Hong, Jinsu Ahn. 4011-4013 [doi]
- Using paraphrases to improve tweet classification: Comparing WordNet and word embedding approachesQuanzhi Li, Sameena Shah, Mohammad Mahdi Ghassemi, Rui Fang, Armineh Nourbakhsh, Xiaomo Liu. 4014-4016 [doi]
- A framework for large-scale bacterial motility behavior analysisXiaomeng Liang, Lin-Ching Chang, Arash Massoudieh. 4017-4019 [doi]
- Inferring relations in knowledge graphs with tensor decompositionsAnkur Padia, Konstantinos Kalpakis, Tim Finin. 4020-4022 [doi]
- Towards a more meterless parking system: Understanding meter payment behavior and trends in Washington, DCBenito O. Perez, Yiwei Ma, Mengran Wang, Xiaomeng Liang, Negin Askarzadeh. 4023-4025 [doi]
- HPC infrastructure to support the next-generation ARM facility data operationsGiri Prakash, Jitendra Kumar, Everett Rush, Robert Records, Anthony Clodfelter, Jimmy W. Voyles. 4026-4028 [doi]
- Using automated enforcement data to achieve vision zero goals: A case studyJ. M. Rogers, S. S. Dey, R. Retting, R. Jain, X. Liang, N. Askarzadeh. 4029-4031 [doi]
- Analysis of teamwork dialogue: A data mining approachAntonette Shibani, Elizabeth Koh, Vivian Lai, Kyong Jin Shim. 4032-4034 [doi]
- Meta-analysis of big data security and privacy: Scholarly literature gapsKenneth David Strang, Zhaohao Sun. 4035-4037 [doi]
- An approach for extracting big micro-scale severe weather region trajectories automatically from meteorological radar dataXingang Wang, Zhigang Gai, Suiping Qi. 4038-4039 [doi]
- An improved social spammer detection based on tri-trainingGuangxia Xu, Jingteng Zhao, Deling Huang. 4040-4042 [doi]