Abstract is missing.
- Human-in-the-loop applied machine learningCarla E. Brodley. 1 [doi]
- A more open efficient future for AI development and data science with an introduction to JuliaAlan Edelman. 2 [doi]
- Contextual reinforcement learningJohn Langford 0001. 3 [doi]
- Large-scale graph representation learningJure Leskovec. 4 [doi]
- Being "BYTES-oriented" in HPC leads to an open big data/AI ecosystem and further advances into the post-moore eraSatoshi Matsuoka. 5 [doi]
- TextScope: Enhance human perception via text miningChengXiang Zhai. 6 [doi]
- Collective subjective logic: Scalable uncertainty-based opinion inferenceFeng Chen, Chunpai Wang, Jin-Hee Cho. 7-16 [doi]
- Quality-aware aggregation & predictive analytics at the edgeNatascha Harth, Christos Anagnostopoulos. 17-26 [doi]
- Robust multi-label semi-supervised classificationSheng Li 0001, Yun Fu. 27-36 [doi]
- Lifelong multi-task multi-view learning using latent spacesXiaoli Li 0013, Sai Nivedita Chandrasekaran, Jun Huan. 37-46 [doi]
- Compact multi-class boosted treesNatalia Ponomareva, Thomas Colthurst, Gilbert Hendry, Salem Haykal, Soroush Radpour. 47-56 [doi]
- Constraint-aware dynamic truth discovery in big data social media sensingDaniel Yue Zhang, Dong Wang, Yang Zhang. 57-66 [doi]
- Standardizing big earth datacubesPeter Baumann. 67-73 [doi]
- Enhancing data quality by cleaning inconsistent big RDF dataSalima Benbernou, Mourad Ouziri. 74-79 [doi]
- Iterative matrix correlation for bisection clusteringByron J. Gao, Robert Tung, Yong Yang. 80-87 [doi]
- Entropic determinants of massive matricesDiego Granziol, Stephen J. Roberts. 88-93 [doi]
- Big active learningEr-Chen Huang, Hsing-Kuo Pao, Yuh-Jye Lee. 94-101 [doi]
- A novel approach to optimization of iterative machine learning algorithms: Over heap structureHasan Kurban, Mehmet M. Dalkilic. 102-109 [doi]
- Multi-view graph learning with adaptive label propagationSheng Li 0001, Hongfu Liu, Zhiqiang Tao, Yun Fu. 110-115 [doi]
- Exponential random graph models with big networks: Maximum pseudolikelihood estimation and the parametric bootstrapChristian S. Schmid, Bruce A. Desmarais. 116-121 [doi]
- Automated industry classification with deep learningSam Wood, Rohit Muthyala, Yi Jin, Yixing Qin, Nilaj Rukadikar, Amit Rai, Hua Gao. 122-129 [doi]
- Jointly optimizing task granularity and concurrency for in-memory mapreduce frameworksJonghyun Bae, Hakbeom Jang, Wenjing Jin, Jun Heo, Jaeyoung Jang, Joo Young Hwang, Sangyeun Cho, Jae W. Lee. 130-140 [doi]
- How fast can one scale down a distributed file system?Nathanael Cheriere, Gabriel Antoniu. 141-150 [doi]
- ATM: A distributed, collaborative, scalable system for automated machine learningThomas Swearingen, Will Drevo, Bennett Cyphers, Alfredo Cuesta-Infante, Arun Ross, Kalyan Veeramachaneni. 151-162 [doi]
- A decision tree based approach towards adaptive modeling of big data applicationsIoannis Giannakopoulos, Dimitrios Tsoumakos, Nectarios Koziris. 163-172 [doi]
- Characterizing and accelerating indexing techniques on distributed ordered tablesShashank Gugnani, Xiaoyi Lu, Houliang Qi, Li Zha, Dhabaleswar K. Panda. 173-182 [doi]
- ooc_cuDNN: Accommodating convolutional neural networks over GPU memory capacityYuki Ito, Ryo Matsumiya, Toshio Endo. 183-192 [doi]
- A semantics-aware storage framework for scalable processing of knowledge graphs on HadoopHyeongSik Kim, Padmashree Ravindra, Kemafor Anyanwu. 193-202 [doi]
- Elastic management of cloud applications using adaptive reinforcement learningKonstantinos Lolos, Ioannis Konstantinou, Verena Kantere, Nectarios Koziris. 203-212 [doi]
- Performance characterization and acceleration of big data workloads on OpenPOWER systemXiaoyi Lu, Haiyang Shi, Dipti Shankar, Dhabaleswar K. Panda. 213-222 [doi]
- Low-latency multi-threaded ensemble learning for dynamic big data streamsDiego Marron, Eduard Ayguadé, José R. Herrero, Jesse Read, Albert Bifet. 223-232 [doi]
- I/O load balancing for big data HPC applicationsArnab Kumar Paul, Arpit Goyal, Feiyi Wang, Sarp Oral, Ali Raza Butt, Michael J. Brim, Sangeetha B. Srinivasa. 233-242 [doi]
- HarpLDA+: Optimizing latent dirichlet allocation for parallel efficiencyBo Peng, Bingjing Zhang, Langshi Chen, Mihai Avram, Robert Henschel, Craig A. Stewart, Shaojuan Zhu, Emily McCallum, Lisa Smith, Tom Zahniser, Jon Omer, Judy Qiu. 243-252 [doi]
- Fast access to columnar, hierarchically nested data via code transformationJim Pivarski, Peter Elmer, Brian Bockelman, Zhe Zhang. 253-262 [doi]
- Sanzu: A data science benchmarkAlex Watson, Deepigha Shree Vittal Babu, Suprio Ray. 263-272 [doi]
- Scaling up data-parallel analytics platforms: Linear algebraic operation casesLuna Xu, Seung-Hwan Lim, Min Li, Ali Raza Butt, Ramakrishnan Kannan. 273-282 [doi]
- Robotomata: A framework for approximate pattern matching of big data on an automata processorXiaodong Yu, Kaixi Hou, Hao Wang, Wu-chun Feng. 283-292 [doi]
- Making caches work for graph analyticsYunming Zhang, Vladimir Kiriansky, Charith Mendis, Saman P. Amarasinghe, Matei Zaharia. 293-302 [doi]
- On the usability of Hadoop MapReduce, Apache Spark & Apache flink for data scienceBilal Akil, Ying Zhou, Uwe Röhm. 303-310 [doi]
- Energy efficient stochastic-based deep spiking neural networks for sparse datasetsMohammed Alawad, Hong-Jun Yoon, Georgia D. Tourassi. 311-318 [doi]
- External memory pipelining made easy with TPIELars Arge, Mathias Rav, Svend C. Svendsen, Jakob Truelsen. 319-324 [doi]
- Compressed domain-specific data processing and analysisDapeng Dong, John Herbert. 325-330 [doi]
- Understanding and optimizing the performance of distributed machine learning applications on apache sparkCelestine Dünner, Thomas P. Parnell, Kubilay Atasu, Manolis Sifalakis, Haralampos Pozidis. 331-338 [doi]
- Optimal reducer placement to minimize data transfer in MapReduce-style processingXiao Meng, Lukasz Golab. 339-346 [doi]
- Big data and HPC collocation: Using HPC idle resources for Big Data analyticsMichael Mercier, David Glesser, Yiannis Georgiou, Olivier Richard. 347-352 [doi]
- eTRIKS analytical environment: A modular high performance framework for medical data analysisAxel Oehmichen, Florian Guitton, Kai Sun, Jean Grizet, Thomas Heinis, Yike Guo. 353-360 [doi]
- Multi-objective optimization of scheduling dataflows on heterogeneous cloud resourcesIlia Pietri, Yannis Chronis, Yannis E. Ioannidis. 361-368 [doi]
- NVMD: Non-volatile memory assisted design for accelerating MapReduce and DAG execution frameworks on HPC systemsMd. Wasi-ur-Rahman, Nusrat Sharmin Islam, Xiaoyi Lu, Dhabaleswar K. Panda. 369-374 [doi]
- Towards memory and computation efficient graph processing on sparkXinhui Tian, Yuanqing Guo, Jianfeng Zhan, Lei Wang 0004. 375-382 [doi]
- Sandpiper: Scaling probabilistic inferencing to large scale graphical modelsAlexander Ulanov, Manish Marwah, Mijung Kim, Roshan Dathathri, Carlos Zubieta, Jun Li. 383-388 [doi]
- Dione: Profiling spark applications exploiting graph similarityNikos Zacheilas, Stathis Maroulis, Vana Kalogeraki. 389-394 [doi]
- On on-line task assignment in spatial crowdsourcingMohammad Asghari, Cyrus Shahabi. 395-404 [doi]
- QuAD: A quorum protocol for adaptive data management in the cloudIlir Fetai, Alexander Stiemer, Heiko Schuldt. 405-414 [doi]
- Sequential algorithms to split and merge ultra-high resolution 3D imagesValérie Hayot-Sasson, Yongping Gao, Yuhong Yan, Tristan Glatard. 415-424 [doi]
- Spatiotemporal range pattern queries on large-scale co-movement pattern datasetsShahab Helmi, Farnoush Banaei Kashani. 425-434 [doi]
- Towards robust models of food flows and their role in invasive species spreadSrinivasan Venkatramanan, Sichao Wu, Bowen Shi, Achla Marathe, Madhav Marathe, Stephen Eubank, Lalit P. Sah, A. P. Giri, Luke A. Colavito, K. S. Nitin, V. Sridhar, R. Asokan, Rangaswamy Muniappan, G. Norton, Abhijin Adiga. 435-444 [doi]
- A single-node datastore for high-velocity multidimensional sensor dataJuan A. Colmenares, Reza Dorrigiv, Daniel G. Waddington. 445-452 [doi]
- Model driven reverse engineering of NoSQL property graph databases: The case of Neo4jIsabelle Comyn-Wattiau, Jacky Akoka. 453-458 [doi]
- Universal distant reading through metadata proxies with archivesparkHelge Holzmann, Vinay Goel, Emily Novak Gustainis. 459-464 [doi]
- Big data transfer optimization based on offline knowledge discovery and adaptive samplingMd S. Q. Zulkar Nine, Kemal Guner, Ziyun Huang, Xiangyu Wang, Jinhui Xu, Tevfik Kosar. 465-472 [doi]
- A closed-loop deep learning architecture for robust activity recognition using wearable sensorsRamyar Saeedi, Skyler Norgaard, Assefaw Hadish Gebremedhin. 473-479 [doi]
- CStorage: An efficient classification-based image storage system in cloud datacentersHaiying Shen, Heng Zhou. 480-485 [doi]
- In-depth exploration of single-snapshot lossy compression techniques for N-body simulationsDingwen Tao, Sheng Di, Zizhong Chen, Franck Cappello. 486-493 [doi]
- Reliable fake review detection via modeling temporal and behavioral patternsXian Wu, Yuxiao Dong, Jun Tao, Chao Huang, Nitesh V. Chawla. 494-499 [doi]
- Efficient diversified set monitoring for mobile sensor stream environmentsMasahiro Yokoyama, Takahiro Hara, Sanjay Kumar Madria. 500-507 [doi]
- Low-rank singular value thresholding for recovering missing air quality dataYangwen Yu, James J. Q. Yu, Victor O. K. Li, Jacqueline C. K. Lam. 508-513 [doi]
- Visual analytics with unparalleled variety scaling for big earth dataLina Yu, Michael L. Rilee, Yu Pan, Feiyu Zhu, Kwo-Sen Kuo, Hongfeng Yu. 514-521 [doi]
- Semi-supervised convolutional neural networks for human activity recognitionMing Zeng, Tong Yu, Xiao Wang, Le T. Nguyen, Ole J. Mengshoel, Ian Lane. 522-529 [doi]
- Detecting unmetered taxi rides from trajectory dataXibo Zhou, Ye Ding, Fengchao Peng, Qiong Luo 0001, Lionel M. Ni. 530-535 [doi]
- Estimation of distance-based metrics for very large graphs with MinHash SignaturesGiambattista Amati, Simone Angelini, Giorgio Gambosi, Gianluca Rossi, Paola Vocca. 536-545 [doi]
- High-performance geometric algorithms for sparse computation in big data analyticsPhilipp Baumann, Dorit S. Hochbaum, Quico Spaen. 546-555 [doi]
- Active learning based news veracity detection with feature weighting and deep-shallow fusionSreyasee Das Bhattacharjee, Ashit Talukder, Bala Venkatram Balantrapu. 556-565 [doi]
- Exploiting visual and textual neighborhood information to improve image-tag relevanceChandramani Chaudhary, Poonam Goyal, Yi-Ping Phoebe Chen. 566-575 [doi]
- Inverse extreme learning machine for learning with label proportionsLimeng Cui, Jiawei Zhang, Zhensong Chen, Yong Shi, Philip S. Yu. 576-585 [doi]
- E-CLoG: Counting edge-centric local graphletsVachik S. Dave, Nesreen K. Ahmed, Mohammad Al Hasan. 586-595 [doi]
- Multistream regression with asynchronous concept drift detectionBo Dong, Yifan Li, Yang Gao, Ahsanul Haque, Latifur Khan, Mohammad M. Masud. 596-605 [doi]
- Bias correction in clustering coefficient estimationRoohollah Etemadi, Jianguo Lu. 606-615 [doi]
- Closed walk sampler: An efficient method for estimating the spectral radius of large graphsGuyue Han, Harish Sethu. 616-625 [doi]
- Online city-scale hyper-local event detection via analysis of social media and human mobilityJun Hu, Yuxin Wang, Ping Li. 626-635 [doi]
- Drum: A rhythmic approach to interactive analytics on large dataJianfeng Jia, Chen Li 0001, Michael J. Carey 0001. 636-645 [doi]
- Detecting changes in streaming data with information-theoretic windowingRyoya Kaneko, Kohei Miyaguchi, Kenji Yamanishi. 646-655 [doi]
- Hybrid algorithms for subgraph pattern queries in graph databasesFoteini Katsarou, Nikos Ntarmos, Peter Triantafillou. 656-665 [doi]
- Domain-specific hierarchical subgraph extraction: A recommendation use caseSarasi Lalithsena, Sujan Perera, Pavan Kapanipathi, Amit P. Sheth. 666-675 [doi]
- COEUS: Community detection via seed-set expansion on graph streamsPanagiotis Liakos, Alexandros Ntoulas, Alex Delis. 676-685 [doi]
- Rhea: Adaptively sampling authoritative content from social activity streamsPanagiotis Liakos, Alexandros Ntoulas, Alex Delis. 686-695 [doi]
- Text-based geolocation prediction of social media users with neural networksIsmini Lourentzou, Alex Morales, ChengXiang Zhai. 696-705 [doi]
- Crack random forest for arbitrary large datasetsAlessandro Lulli, Luca Oneto, Davide Anguita. 706-715 [doi]
- S-Isomap++: Multi manifold learning from streaming dataSuchismit Mahapatra, Varun Chandola. 716-725 [doi]
- A scalable model for tracking topical evolution in large document collectionsSheikh Motahar Naim, Arnold P. Boedihardjo, M. Shahriar Hossain. 726-735 [doi]
- Error-robust multi-view clusteringMehrnaz Najafi, LiFang He, Philip S. Yu. 736-745 [doi]
- Holistic and scalable ranking of RDF dataAxel-Cyrille Ngonga Ngomo, Michael Hoffmann, Ricardo Usbeck, Kunal Jha. 746-755 [doi]
- A comparative study of matrix factorization and random walk with restart in recommender systemsHaekyu Park, Jinhong Jung, U. Kang. 756-765 [doi]
- VIGAN: Missing view imputation with generative adversarial networksChao Shang, Aaron Palmer, Jiangwen Sun, Ko-Shin Chen, Jin Lu, Jinbo Bi. 766-775 [doi]
- Tiered sampling: An efficient method for approximate counting sparse motifs in massive graph streamsLorenzo De Stefani, Erisa Terolli, Eli Upfal. 776-786 [doi]
- A fast non-volatile memory aware algorithm for generating random scale-free networksCheng-Chin Tu, Mi-Yen Yeh, Tei-Wei Kuo. 787-796 [doi]
- MRAttractor: Detecting communities from large-scale graphsNguyen Vo, Kyumin Lee, Thanh Tran. 797-806 [doi]
- Potentiality of healthcare big data: Improving search by automatic query reformulationYueyao Wang, Qinmin Hu, Yang Song, Liang He. 807-816 [doi]
- Sampling algorithms to update truncated SVDIchitaro Yamazaki, Stanimire Tomov, Jack J. Dongarra. 817-826 [doi]
- Distributed Top-N local outlier detection in big dataYizhou Yan, Lei Cao, Elke A. Rundensteiner. 827-836 [doi]
- Rectangular hash table: Bloom filter and bitmap assisted hash table with high speedTong Yang, Binchao Yin, Hang Li, Muhammad Shahzad, Steve Uhlig, Bin Cm, Xiaoming Li. 837-846 [doi]
- Large-scale joint topic, sentiment & user preference analysis for online reviewsXinli Yu, Zheng Chen, Wei-Shih Yang, Xiaohua Hu, Erjia Yan, Guangrong Li. 847-856 [doi]
- ImWalkMF: Joint matrix factorization and implicit walk integrative learning for recommendationChuxu Zhang, Lu Yu, Xiangliang Zhang, Nitesh Chawla. 857-866 [doi]
- Hierarchical collaborative embedding for context-aware recommendationsLei Zheng, Bokai Cao, Vahid Noroozi, Philip S. Yu, Nianzu Ma. 867-876 [doi]
- Mining pros and cons of actions from social media for decision supportEbad Ahmadzadeh, Philip K. Chan. 877-882 [doi]
- Distributed Bayesian piecewise sparse linear modelsMasato Asahara, Ryohei Fujimaki. 883-888 [doi]
- Linear-complexity relaxed word Mover's distance with GPU accelerationKubilay Atasu, Thomas P. Parnell, Celestine Dünner, Manolis Sifalakis, Haralampos Pozidis, Vasileios Vasileiadis, Michail Vlachos, Cesar Berrospi, Abdel Labbi. 889-896 [doi]
- Quality-efficiency trade-offs in machine learning for text processingRicardo A. Baeza-Yates, Zeinab Liaghat. 897-904 [doi]
- Fast graph scan statistics optimization using algebraic fingerprintsJose Cadena, Saliya Ekanayake, Anil Vullikanti. 905-910 [doi]
- A distributed rough set theory based algorithm for an efficient big data pre-processing under the spark frameworkZaineb Chelly Dagdia, Christine Zarges, Gaël Beck, Mustapha Lebbah. 911-916 [doi]
- Judicious setting of Dynamic Time Warping's window width allows more accurate classification of time seriesHoang Anh Dau, Diego Furtado Silva, François Petitjean, Germain Forestier, Anthony Bagnall, Eamonn J. Keogh. 917-922 [doi]
- Toward granular knowledge analytics for data intelligence: Extracting granular entity-relationship graphs for knowledge profilingAlexander Denzler, Michael Kaufmann. 923-928 [doi]
- Distributed decision tree v.2.0Ankit Desai, Sanjay Chaudhary. 929-934 [doi]
- An open-source tool for the transcription of paper-spreadsheet data: Code and supplemental materials available online: Https: //github.com/deskool/images to spreadsheetsMohammad M. Ghassemi, Willow Jarvis, Tuka Alhanai, Emery N. Brown, Roger G. Mark, M. Brandon Westover. 935-941 [doi]
- AnyFI: An anytime frequent itemset mining algorithm for data streamsPoonam Goyal, Jagat Sesh Challa, Shivin Shrivastava, Navneet Goyal. 942-947 [doi]
- Discovering potential traffic risks in Japan using a supervised learning approachTatsuru Kobayashi, Shin Matsushima, Taito Lee, Kenji Yamanishi. 948-955 [doi]
- Data context informed data wranglingMartin Koehler, Alex Bogatu, Cristina Civili, Nikolaos Konstantinou 0001, Edward Abel, Alvaro A. A. Fernandes, John A. Keane, Leonid Libkin, Norman W. Paton. 956-963 [doi]
- Fishing in the stream: Similarity search over endless dataNaama Kraus, David Carmel, Idit Keidar. 964-969 [doi]
- Graphical approach for influence maximization in social networks under generic threshold-based non-submodular modelLiang Ma, Guohong Cao, Lance M. Kaplan. 970-975 [doi]
- A distributed k-core decomposition algorithm on sparkAritra Mandal, Mohammad Al Hasan. 976-981 [doi]
- Event pattern discovery by keywords in graph streamsMohammad Hossein Namaki, Peng Lin, Yinghui Wu. 982-987 [doi]
- Queryable compression on streaming social networksMichael Nelson, Sridhar Radhakrishnan, Amlan Chatterjee, Chandra N. Sekharan. 988-993 [doi]
- Event-based non-parametric clustering of team sport trajectoriesFengchao Peng, Yudian Ji, Qiong Luo 0001, Lionel M. Ni. 994-999 [doi]
- Application-specific graph sampling for frequent subgraph mining and community detectionSumit Purohit, Sutanay Choudhury, Lawrence B. Holder. 1000-1005 [doi]
- Discovering co-occurrence patterns of heterogeneous events from unevenly-distributed spatiotemporal dataHung Tran-The, Koji Zettsu. 1006-1011 [doi]
- Micro-clustering by data polishingTakeaki Uno, Hiroki Maegawa, Takanobu Nakahara, Yukinobu Hamuro, Ryo Yoshinaka, Makoto Tatsuta. 1012-1018 [doi]
- Bringing semantic structures to user intent detection in online medical queriesChenwei Zhang, Nan Du, Wei Fan, Yaliang Li, Chun-Ta Lu, Philip S. Yu. 1019-1026 [doi]
- Large-scale point-of-interest category prediction using natural language processing modelsDaniel Yue Zhang, Dong Wang, Hao Zheng, Xin Mu, Qi Li, Yang Zhang. 1027-1032 [doi]
- Shade: A differentially-private wrapper for enterprise big dataAlexander Heifetz, Vaikkunth Mugunthan, Lalana Kagal. 1033-1042 [doi]
- Group privacy-aware disclosure of association graph dataBalaji Palanisamy, Chao Li, Prashant Krishnamurthy. 1043-1052 [doi]
- Contaminant removal for Android malware detection systemsLichao Sun, Xiaokai Wei, Jiawei Zhang, LiFang He, Philip S. Yu, Witawas Srisa-an. 1053-1062 [doi]
- Boosting the phishing detection performance by semantic analysisXi Zhang, Yu Zeng, Xiao-Bo Jin, Zhiwei Yan, Guang-Gang Geng. 1063-1070 [doi]
- Setting the threshold for high throughput detectors: A mathematical approach for ensembles of dynamic, heterogeneous, probabilistic anomaly detectorsRobert A. Bridges, Jessie D. Jamieson, Joel W. Reed. 1071-1078 [doi]
- Weatherman: Exposing weather-based privacy threats in big energy dataDong Chen, David E. Irwin. 1079-1086 [doi]
- Discrimination detection by causal effect estimationJiuyong Li, Jixue Liu, Lin Liu 0003, Thuc Duy Le, Saisai Ma, Yizhao Han. 1087-1094 [doi]
- WEAC: Word embeddings for anomaly classification from event logsAmit Pande, Vishal Ahuja. 1095-1100 [doi]
- Privacy-protected place of activity mining on big location dataShuo Wang, Richard O. Sinnott, Surya Nepal. 1101-1108 [doi]
- Sensitive gazetteer discovery and protection for mobile social media usersShuo Wang, Richard O. Sinnott, Surya Nepal. 1109-1116 [doi]
- Differentially private query learning: From data publishing to model publishingTianqing Zhu, Ping Xiong, Gang Li, Wanlei Zhou, Philip S. Yu. 1117-1122 [doi]
- The ML test score: A rubric for ML production readiness and technical debt reductionEric Breck, Shanqing Cai, Eric Nielsen, Michael Salib, D. Sculley. 1123-1132 [doi]
- BTCI: A new framework for identifying congestion cascades using bus trajectory dataMeng-Fen Chiang, Ee-Peng Lim, Wang-Chien Lee, Agus Trisnajaya Kwee. 1133-1142 [doi]
- Application of big data analytics in process safety and risk managementPankaj Goel, Aniruddha Datta, M. Sam Mannan. 1143-1152 [doi]
- Enabling versatile analysis of large scale traffic video data with deep learning and HiveQLLei Huang, Weijia Xu, Si Liu, Venktesh Pandey, Natalia Ruiz-Juri. 1153-1162 [doi]
- Fast interpolation of grid data at a non-grid pointHiroshi Inoue. 1163-1172 [doi]
- Joint sparse auto-encoder: A semi-supervised spatio-temporal approach in mapping large-scale croplandsXiaowei Jia, Yifan Hu, Ankush Khandelwal, Anuj Karpatne, Vipin Kumar. 1173-1182 [doi]
- Multi-step prediction with missing smart sensor data using multi-task Gaussian processesPasan Karunaratne, Masud Moshtaghi, Shanika Karunasekera, Aaron Harwood, Trevor Cohn. 1183-1192 [doi]
- Bayesian multi-view models for member-job matching and personalized skill recommendationsAbhinav Maurya, Rahul Telang. 1193-1202 [doi]
- Automated scalable detection of location-specific Santa Ana conditions from weather data using unsupervised learningMai H. Nguyen, Daniel Crawl, Jiaxin Li, Dylan Uys, Ilkay Altintas. 1203-1212 [doi]
- HealthEdge: Task scheduling for edge computing with health emergency and human behavior consideration in smart homesHaoyu Wang, Jiaqi Gong, Yan Zhuang, Haiying Shen, John Lach. 1213-1222 [doi]
- Connecting emerging relationships from news via tensor factorizationJingyuan Zhang, Chun-Ta Lu, Bokai Cao, Yi Chang, Philip S. Yu. 1223-1232 [doi]
- LSTM for septic shock: Adding unreliable labels to reliable predictionsYuan Zhang, Chen Lin, Min Chi, Julie S. Ivy, Muge Capan, Jeanne M. Huddleston. 1233-1242 [doi]
- A data-driven congestion diffusion model for characterizing traffic in metrocity scalesBaoxin Zhao, Chengzhong Xu, Siyuan Liu. 1243-1252 [doi]
- Analysis of the term 'big data': Usage in biomedical publicationsAllard J. Van Altena, Perry D. Moerland, Aeilko H. Zwinderman, Sílvia D. Olabarriaga. 1253-1258 [doi]
- Predicting treatment repetitions in the implant denture therapy processMarzieh Bakhshandeh, Dennis M. M. Schunselaar, Henrik Leopold, Hajo A. Reijers. 1259-1264 [doi]
- Personalized flight recommendations via paired choice modelingJian Cao, Fangzhou Yang, Yuchang Xu, Yudong Tan, Quan-Wu Xiao. 1265-1270 [doi]
- Seq2Img: A sequence-to-image based approach towards IP traffic classification using convolutional neural networksZhitang Chen, Ke He, Jian Li, Yanhui Geng. 1271-1276 [doi]
- OReONet: Deep convolutional network for oil reservoir optimizationChung-Ming Cheung, Palash Goyal, Viktor K. Prasanna, Arash Saber Tehrani. 1277-1282 [doi]
- A data-driven approach to predict NOx-emissions of gas turbinesGiuseppe Cuccu, Somayeh Danafar, Philippe Cudré-Mauroux, Martin Gassner, Stefano Bernero, Krzysztof Kryszczuk. 1283-1288 [doi]
- Two-level clustering fast betweenness centrality computation for requirement-driven approximationAngelo Furno, Nour-Eddin El Faouzi, Rajesh Sharma, Eugenio Zimeo. 1289-1294 [doi]
- Cellular network configuration via online learning and joint optimizationXueying Guo, George Trimponias, Xiaoxiao Wang, Zhitang Chen, Yanhui Geng, Xin Liu. 1295-1300 [doi]
- T-BMIRT: Estimating representations of student knowledge and educational components in online educationJiankun Huang, Wenjun Wu. 1301-1306 [doi]
- Forecasting the rise and fall of volatile point-of-interestsXinjiang Lu, Zhiwen Yu, Chuanren Liu, Yanchi Liu, Hui Xiong, Bin Guo. 1307-1312 [doi]
- Predicting regional economic indices using big data of individual bank card transactionsStanislav Sobolevsky, Emanuele Massaro, Iva Bojic, Juan Murillo Arias, Carlo Ratti. 1313-1318 [doi]
- Travel purpose inference with GPS trajectories, POIs, and geo-tagged social media dataChuishi Meng, Yu Cui, Qing He, Lu Su, Jing Gao. 1319-1324 [doi]
- Discovering scientific influence using cross-domain dynamic topic modelingJennifer Sleeman, Milton Halem, Tim Finin, Mark Cane. 1325-1332 [doi]
- RePAIR: Recommend political actors in real-time from news websitesMohiuddin Solaimani, Sayeed Salam, Latifur Khan, Patrick T. Brandt, Vito D'Orazio. 1333-1340 [doi]
- Personalized travel mode detection with smartphone sensorsXing Su, Yuan Yao, Qing He, Jie Lu, Hanghang Tong. 1341-1348 [doi]
- A comparative analysis of state-of-the-art SQL-on-Hadoop systems for interactive analyticsAshish Tapdiya, Daniel Fabbri. 1349-1356 [doi]
- Identifying and quantifying nonlinear structured relationships in complex manufactural systemsTingyang Xu, Tan Yan, Dongjin Song, Wei Cheng, Haifeng Chen, Geoff Jiang, Jinbo Bi. 1357-1362 [doi]
- OTPS: A decision support service for optimal airfare Ticket PurchaseYuchang Xu, Jian Cao. 1363-1368 [doi]
- Product function need recognition via semi-supervised attention networkHu Xu, Sihong Xie, Lei Shu, Philip S. Yu. 1369-1374 [doi]
- Exploring the dynamics of surge pricing in mobility-on-demand taxi servicesWenbo Zhang, Dheeraj Kumar, Satish V. Ukkusuri. 1375-1380 [doi]
- Application of dynamic logistic regression with unscented Kalman filter in predictive codingYihua Shi Astle, Xuning Tang, Craig Freeman. 1381-1389 [doi]
- RAVEN: Web-based smart home exploration system through interactive pattern discoveryMansurul Alam Bhuiyan, Mohammad Al Hasan. 1390-1399 [doi]
- Implementing scalable structured machine learning for big data in the SAKE projectSimon Bin, Patrick Westphal, Jens Lehmann, Axel Ngonga. 1400-1407 [doi]
- Fast botnet detection from streaming logs using online lanczos methodZheng Chen, Xinli Yu, Chi Zhang, Jin Zhang, Cui Lin, Bo Song, Jianliang Gao, Xiaohua Hu, Wei-Shih Yang, Erjia Yan. 1408-1417 [doi]
- Representativeness of latent dirichlet allocation topics estimated from data samples with application to common crawlYuheng Du, Alexander Herzog, André Luckow, Ramu Nerella, Christopher Gropp, Amy W. Apon. 1418-1427 [doi]
- Empirical evaluations of active learning strategies in legal document reviewRishi Chhatwal, Nathaniel Huber-Fliflet, Robert Keeling, Jianping Zhang, Haozhen Zhao. 1428-1437 [doi]
- Topic models for RFID data modeling and localizationT. F. Kennedy, Robert S. Provence, James L. Broyan, Patrick W. Fink, Phong H. Ngo, Lazaro D. Rodriguez. 1438-1446 [doi]
- What is skipped: Finding desirable items in e-commerce search by discovering the worst title tokensIshita K. Khan, Prathyusha Senthil Kumar, Daniel Miranda, David Goldberg. 1447-1456 [doi]
- Ranking the importance of ontology concepts using document summarization techniquesYoungho Kim, Petros Zerfos, Vadim Sheinin, Nancy Greco. 1457-1466 [doi]
- Performance optimization in scale-out storage using design of experiment as heuristicLay Wai Kong. 1467-1474 [doi]
- A study on intelligent personalized push notification with user historyHyunjong Lee, Youngin Jo, Sanghyuk Chun, Kwangseob Kim. 1475-1482 [doi]
- Reuters tracer: Toward automated news production using large scale social media dataXiaomo Liu, Armineh Nourbakhsh, Quanzhi Li, Sameena Shah, Robert Martin, John Duprey. 1483-1493 [doi]
- Integrated access to big data polystores through a knowledge-driven frameworkJustin McHugh, Paul E. Cuddihy, Jenny Weisenberg Williams, Kareem S. Aggour, Vijay S. Kumar, Varish Mulwad. 1494-1503 [doi]
- Predicting over-indebtedness on batch and streaming dataJacob Montiel, Albert Bifet, Talel Abdessalem. 1504-1513 [doi]
- APP-SON: Application characteristics-driven SON to optimize 4G/5G network performance and quality of experienceYe Ouyang, Zhongyuan Li, Le Su, Wenyuan Lu, Zhenyi Lin. 1514-1523 [doi]
- A configurable, big data system for on-demand healthcare cost predictionKarthikeyan Natesan Ramamurthy, Dennis Wei, Emily Ray, Moninder Singh, Vijay Iyengar, Dmitriy A. Katz-Rogozhnikov, Jingwei Yang, Kevin N. Tran, Gigi Y. Yuen-Reed. 1524-1533 [doi]
- Dependency analysis of cloud applications for performance monitoring using recurrent neural networksSyed Yousaf Shah, Zengwen Yuan, Songwu Lu, Petros Zerfos. 1534-1543 [doi]
- Help me find a job: A graph-based approach for job recommendation at scaleWalid Shalaby, BahaaEddin AlAila, Mohammed Korayem, Layla Pournajaf, Khalifeh AlJadda, Shannon Quinn, Wlodek Zadrozny. 1544-1553 [doi]
- Flux: Groupon's automated, scalable, extensible machine learning platformDerrick C. Spell, Xiao Han T. Zeng, Jae Young Chung, Bahador Nooraei, Richard T. Shomer, Ling-Yong Wang, James C. Gibson, Daniel Kirsche. 1554-1559 [doi]
- A data-driven approach for multivariate contextualized anomaly detection: Industry use caseNenad Stojanovic, Marko Dinic, Ljiljana Stojanovic. 1560-1569 [doi]
- A cognitive assistant for risk identification and modelingDharmashankar Subramanian, Debarun Bhattacharjya, Ruben Rodriguez Torrado, Jeffrey O. Kephart, Vijil Chenthamarakshan, Jesus Rios. 1570-1579 [doi]
- Scalable time-versioning support for property graph databasesWarut D. Vijitbenjaronk, Jinho Lee, Toyotaro Suzumura, Gabriel Tanase. 1580-1589 [doi]
- Trendi: Tracking stories in news and microblogs via emerging, evolving and fading topicsXuchao Zhang, Liang Zhao, Zhiqian Chen, Arnold P. Boedihardjo, Jing Dai, Chang-Tien Lu. 1590-1599 [doi]
- SMART: Sponsored mobile app recommendation by balancing app downloads and appstore profitZhiwei Zhang, Ning Chen, Jun Wang, Luo Si. 1600-1609 [doi]
- A gamma-based regression for winning price estimation in real-time bidding advertisingWen-Yuan Zhu, Wen-Yueh Shih, Ying-Hsuan Lee, Wen-Chih Peng, Jiun-Long Huang. 1610-1619 [doi]
- Demystifying dark matter for online experimentationNirupama Appiktala, Miao Chen, Michael Natkovich, Joshua J. Walters. 1620-1626 [doi]
- Detecting and summarizing emergent events in microblogs and social media streams by dynamic centralitiesNeela Avudaiappan, Alexander Herzog, Sneha Kadam, Yuheng Du, Jason Thatche, Ilya Safro. 1627-1634 [doi]
- Faster online experimentation by eliminating traditional A/A validationRussell Chen, Miao Chen, Mahendrasinh Ramsinh Jadav, Joonsuk Bae, Don Matheson. 1635-1641 [doi]
- BBC: A DSL for designing cloud-based heterogeneous bigdata pipelinesFerosh Jacob, Ilamgumaran Karunanithi, Pramod Salian, Ravi Sambhu. 1642-1645 [doi]
- Architectural considerations for highly scalable computing to support on-demand video analyticsGeorge Mathew. 1646-1649 [doi]
- Scalable distributed change detection and its application to maritime trafficLeonardo M. Millefiori, Paolo Braca, Gianfranco Arcieri. 1650-1657 [doi]
- Connected health: Opportunities and challengesAnkita R. Nambiar, Nikitha Reddy, Debojyoti Dutta. 1658-1662 [doi]
- Predictive edge computing for time series of industrial IoT and large scale critical infrastructure based on open-source software analytic of big dataEmmanuel Oyekanlu. 1663-1669 [doi]
- Linking many unusual co-incidencesKevin B. Pratt. 1670-1675 [doi]
- On event-driven knowledge graph completion in digital factoriesMartin Ringsquandl, Evgeny Kharlamov, Daria Stepanova, Steffen Lamparter, Raffaello Lepratti, Ian Horrocks, Peer Kröger. 1676-1681 [doi]
- Knowledge extraction from maritime spatiotemporal data: An evaluation of clustering algorithms on Big DataGiannis Spiliopoulos, Konstantinos Chatzikokolakis, Dimitrios Zissis, Evmorfia Biliri, Dimitrios Papaspyros, Giannis Tsapelas, Spyros Mouzakitis. 1682-1687 [doi]
- TRACES: Generating Twitter stories via shared subspace and temporal smoothnessXuchao Zhang, Zhiqian Chen, Liang Zhao, Arnold P. Boedihardjo, Chang-Tien Lu. 1688-1693 [doi]
- Tracking and predicting the evolution of research topics in scientific literatureChristine Balili, Aviv Segev, Uichin Lee. 1694-1697 [doi]
- Towards a semantic keyword search over industrial knowledge graphs (extended abstract)Gong Cheng, Evgeny Kharlamov. 1698-1700 [doi]
- Designing a high performance cluster for large-scale SQL-on-hadoop analyticsAjay Dholakia, Prasad Venkatachar, Kshitij Doshi, Ravikanth Durgavajhala, Stewart Tate, Berni Schiefer, Matthew Sheard, Ramnath Sai Sagar. 1701-1703 [doi]
- Real time semantic enrichment of broadcast content in the big data ageMaurizio Montagnuolo, Alberto Messina, Nicolo Bidotti, Paolo Platter, Alessio Bosca. 1704-1708 [doi]
- On the improvement of classifying EEG recordings using neural networksYiran Zhao, Shuochao Yao, Shaohan Hu, Shiyu Chang, Raghu K. Ganti, Mudhakar Srivatsa, Shen Li, Tarek F. Abdelzaher. 1709-1711 [doi]
- A robust internet abuse detection methodZhou Fa, Guang-Gang Geng, Zhiwei Yan, Xiaodong Lee. 1712-1715 [doi]
- Manufacturing and contract service networks: Composition, optimization and tradeoff analysis based on a reusable repository of performance modelsAlexander Brodsky 0001, Mohan Krishnamoorthy, M. Omar Nachawati, William Z. Bernstein, Daniel A. Menascé. 1716-1725 [doi]
- Automatic localization of casting defects with convolutional neural networksMax Ferguson, Ronay Ak, Yung-Tsun Tina Lee, Kincho H. Law. 1726-1735 [doi]
- A data-driven approach for improving sustainability assessment in advanced manufacturingYunpeng Li, Heng Zhang, Utpal Roy, Y. Tina Lee. 1736-1745 [doi]
- Issues in synthetic data generation for advanced manufacturingDon Libes, David Lechevalier, Sanjay Jain. 1746-1754 [doi]
- Estimation of online tool wear in turning processes using recurrence quantification analysis (RQA)Srinivasan Radhakrishnan, Yung-Tsun Tina Lee, Sagar Kamarthi. 1755-1759 [doi]
- Statistically-substantiated density characterizations of additively manufactured steel alloys through verification, validation, and uncertainty quantificationHeather M. Reed, Richard P. Vinci, Corbin Robeck, Trevor Verdonik, Michael Pires, Maria Castro, Wojciech Z. Misiolek, Christina Viau Haden. 1760-1768 [doi]
- Hybrid datafication of maintenance logs from AI-assisted human tagsThurston Sexton, Michael P. Brundage, Michael Hoffman, K. C. Morris. 1769-1777 [doi]
- Data treatment from the viewpoint of granular computingAkinori Abe, Yuki Hayashi. 1778-1785 [doi]
- Big-data-enabled modelling and optimization of granular speed-based vessel schedule recovery problemFatemeh Cheraghchi, Ibrahim Y. Abualhaol, Rafael Falcon, Rami S. Abielmona, Bijan Raahemi, Emil M. Petriu. 1786-1794 [doi]
- Improving text classification with word embeddingLihao Ge, Teng-Sheng Moh. 1796-1805 [doi]
- On the role of feature space granulation in feature selection processesMarek Grzegorowski, Andrzej Janusz, Dominik Slezak, Marcin S. Szczuka. 1806-1815 [doi]
- Quasi-erasable itemset miningTzung-Pei Hong, Lu-Hung Chen, Shyue-Liang Wang, Chun-Wei Lin, Bay Vo. 1816-1820 [doi]
- Secure information flow and file movements: A topological theory of discretionary access controlsTsau-Young T. Y. Lin, Pierre Vachon. 1821-1829 [doi]
- Unsupervised deep embedding for novel class detection over data streamAhmad M. Mustafa, Gbadebo Ayoade, Khaled Al-Naami, Latifur Khan, Kevin W. Hamlen, Bhavani M. Thuraisingham, Frederico Araujo. 1830-1839 [doi]
- Scalable cyber-security analytics with a new summary-based approximate query engineDominik Slezak, Agnieszka Chadzynska-Krasowska, Joel Holland, Piotr Synak, Rick Glick, Marcin Perkowski. 1840-1849 [doi]
- Mining text for disease diagnosis in hospital information systemShusaku Tsumoto, Tomohiro Kimura, Haruko Iwata, Shoji Hirano. 1850-1859 [doi]
- Noise self-filtering K-nearest neighbors algorithmsShuyin Xia, Guoyin Wang, Yunsheng Liur, Qun Liu, Hong Yu. 1860-1965 [doi]
- A preliminary study on deep learning for predicting social insurance payment behaviorJosh Jia-Ching Ying, Po-Yu Huang, Chih-Kai Chang, Don-Lin Yang. 1866-1875 [doi]
- Effects of language processing in Turkish authorship attributionHayri Volkan Agun, Sibel Yilmazel, Ozgur Yilmazel. 1876-1881 [doi]
- Event detection from time-series streams using directional change and dynamic thresholdsNora Alkhamees, Maria Fasli. 1882-1891 [doi]
- Real-time Lexicon-based sentiment analysis experiments on Twitter with a mild (more information, less data) approachYusuf Arslan, Aysenur Birturk, Bekjan Djumabaev, Dilek Küçük. 1892-1897 [doi]
- A comparative study on learning to rank with computational methodsInci Batmaz, Pinar Karagoz, Gulsah Serdar. 1898-1906 [doi]
- Semi-supervised learning and social media text analysis towards multi-labeling categorizationBelainine Billal, Alexsandro Fonseca, Fatiha Sadat, Hakim Lounis. 1907-1916 [doi]
- B3SafirBiyo: Genomic variant analysis with big data technologiesTugce Dongel, Yasemin Timar. 1917-1925 [doi]
- A data-driven approach to help understanding the preferences of public transport usersVasco Furtado, Elizabeth Furtado, Carlos Caminha, André Lopes, Victor Dantas, Caio Ponte, Sofia Cavalcante. 1926-1935 [doi]
- Recovering loss to followup information using denoising autoencodersLovedeep Gondara, Ke Wang. 1936-1945 [doi]
- A recommender model based on trust value and time decay: Improve the quality of product rating score in E-commerce platformsMuhittin Isik, Hasan Dag. 1946-1955 [doi]
- Focus location extraction from political news reports with bias correctionMaryam Bahojb Imani, Swarup Chandra, Samuel Ma, Latifur Khan, Bhavani M. Thuraisingham. 1956-1964 [doi]
- Augmenting word embeddings through external knowledge-base for biomedical applicationKishlay Jha, Guangxu Xun, Vishrawas Gopalakrishnan, Aidong Zhang. 1965-1974 [doi]
- Big data impact on stability and reliability improvement of smart gridShady S. Refaat, Amira Mohamed, Haitham Abu-Rub. 1975-1982 [doi]
- A deep learning model for air quality prediction in smart citiesIbrahim Kok, Mehmet Ulvi Simsek, Suat Özdemir. 1983-1990 [doi]
- Graph-based information exploration over structured and unstructured dataGiannis V. Koumoutsos, Maria Fasli, Ian Lewin, David Milward. 1991-2000 [doi]
- Convolutional neural network for clinical narrative categorizationPaula Lauren, Guangzhi Qu, Paul Watta. 2001-2008 [doi]
- ClusTop: A clustering-based topic modelling algorithm for twitter using word networksKwan Hui Lim, Shanika Karunasekera, Aaron Harwood. 2009-2018 [doi]
- A natural language normalization approach to enhance social media text reasoningLong Hoang Nguyen, Andrew Salopek, Liang Zhao, Fang Jin. 2019-2026 [doi]
- Using meta-learning for model type selection in predictive big data analyticsMustafa V. Nural, Hao Peng, John A. Miller. 2027-2036 [doi]
- Weather data analysis and sensor fault detection using an extended IoT framework with semantics, big data, and machine learningAras Can Onal, Omer Berat Sezer, A. Murat Ozbayoglu, Erdogan Dogdu. 2037-2046 [doi]
- Understanding what affects career progression using linkedin and twitter dataYiming Pan, Xuefeng Peng, Tianran Hu, Jiebo Luo. 2047-2055 [doi]
- A distributed proximal gradient descent method for tensor completionT. Papastergiou, V. Megalooikonomou. 2056-2065 [doi]
- Predicting high taxi demand regions using social media check-insXuefeng Peng, Yiming Pan, Jiebo Luo. 2066-2075 [doi]
- Sleep-deprived fatigue pattern analysis using large-scale selfies from social mediaXuefeng Peng, Jiebo Luo, Catherine Glenn, Li-Kai Chi, Jingyao Zhan. 2076-2084 [doi]
- Mathematical programming for social network analysisHarun Pirim. 2085-2088 [doi]
- Unsupervised deep learning for subspace clusteringAli Sekmen, Ahmet Bugra Koku, Mustafa Parlaktuna, Ayad Abdul-Malek, Nagendrababu Vanamala. 2089-2094 [doi]
- Principal coordinate clusteringAli Sekmen, Akram Aldroubi, Ahmet Bugra Koku, Keaton Hamm. 2095-2101 [doi]
- Estimation of parameters for the free-form machining with deep neural networkGokberk Serin, M. Ugur Gudelek, A. Murat Ozbayoglu, Hakki Özgür Ünver. 2102-2111 [doi]
- Towards MapReduce based Bayesian deep learning network for monitoring big data applicationsM. Omair Shafiq, Eric Torunski. 2112-2121 [doi]
- Mined semantic analysis: A new concept space model for semantic representation of textual dataWalid Shalaby, Wlodek Zadrozny. 2122-2131 [doi]
- Online video ad measurement for political science researchAdisak Sukul, Baskar Gopalakrishnan, Wallapak Tavanapong, David A. M. Peterson. 2132-2140 [doi]
- DxNAT - Deep neural networks for explaining non-recurring traffic congestionFangzhou Sun, Abhishek Dubey, Jules White. 2141-2150 [doi]
- A filter-based feature selection model for anomaly-based intrusion detection systemsImtiaz Ullah, Qusay H. Mahmoud. 2151-2159 [doi]
- A hybrid model for anomaly-based intrusion detection in SCADA networksImtiaz Ullah, Qusay H. Mahmoud. 2160-2167 [doi]
- What's trending tomorrow, today: Using early adopters to discover popular posts on TumblrDaniel Xie, JieJun Xu, Tsai-Ching Lu. 2168-2176 [doi]
- Harvey flooding rescue in social mediaZhou Yang, Long Hoang Nguyen, Joshua Stuve, Guofeng Cao, Fang Jin. 2177-2185 [doi]
- A review on cyber security datasets for machine learning algorithmsOzlem Yavanoglu, Murat Aydos. 2186-2193 [doi]
- One-shot learning for fine-grained relation extraction via convolutional siamese neural networkJianbo Yuan, Han Guo, Zhiwei Jin, Hongxia Jin, Xianchao Zhang, Jiebo Luo. 2194-2199 [doi]
- SpEnD portal: Linked data discovery using SPARQL endpointsSemih Yumusak, Riza Emre Aras, Elif Uysal, Erdogan Dogdu, Halife Kodaz, Kasim Oztoprak. 2200-2202 [doi]
- Modeling self-service machine-learning agents for distributed stream processingPhilipp Zehnder, Dominik Riemer. 2203-2212 [doi]
- The cybernetics thought collective project: Using computational methods to reveal intellectual context in archival materialBethany G. Anderson, Christopher J. Prom, Kevin Hamilton, James A. Hutchinson, Mark Sammons, Alex Dolski. 2213-2218 [doi]
- Identifying epochs in text archivesTobias Blanke, Jon Wilson. 2219-2224 [doi]
- GraphQL for archival metadata: An overview of the EHRI GraphQL APIMike Bryant. 2225-2230 [doi]
- Building new knowledge from distributed scientific corpus: HERBADROP & EUROPEANA: Two concrete case studies for exploring big archival dataPascal Dugenie, Nuno Freire, Daan Broeder. 2231-2239 [doi]
- Towards automated quality curation of video collections from a realistic perspectiveTodd Richard Goodall, Maria Esteva, Sandra Sweat, Alan C. Bovik. 2240-2245 [doi]
- What can a knowledge complexity approach reveal about big data and archival practice?Nicola Horsley. 2246-2250 [doi]
- Protecting privacy in the archives: Preliminary explorations of topic modeling for born-digital collectionsTim Hutchinson. 2251-2255 [doi]
- Line detection in binary document scans: A case study with the international tracing service archivesBenjamin Charles Germain Lee. 2256-2261 [doi]
- Heuristics for assessing Computational Archival Science (CAS) research: The case of the human face of big data projectMyeong Lee, Yuheng Zhang, Shiyun Chen, Edel Spencer, Jhon Dela Cruz, Hyeonggi Hong, Richard Marciano. 2262-2270 [doi]
- A typology of blockchain recordkeeping solutions and some reflections on their implications for the future of archival preservationVictoria L. Lemieux. 2271-2278 [doi]
- An infrastructure and application of computational archival science to enrich and integrate big digital archival data: Using Taiwan Indigenous Peoples Open Research Data (TIPD) as an exampleJi-Ping Lin. 2279-2287 [doi]
- Auto-categorization methods for digital archivesNathaniel Payne, Jason R. Baron. 2288-2298 [doi]
- The blockchain litmus testT. D. Smith. 2299-2308 [doi]
- Computational curation of a digitized record series of WWII Japanese-American InternmentWilliam Underwood, Richard Marciano, Sandra Laib, Carl Apgar, Luis Beteta, Waleed Falak, Marisa Gilman, Riss Hardcastle, Keona Holden, Yun Huang, David Baasch, Brittni Ballard, Tricia Glaser, Adam Gray, Leigh Plummer, Zeynep Diker, Mayanka Jha, Aakanksha Singh, Namrata Walanj. 2309-2313 [doi]
- Towards a requirements engineering artefact model in the context of big data software development projects: Research in progressDarlan Arruda, Nazim H. Madhavji. 2314-2319 [doi]
- Predicting outcomes for big data projects: Big Data Project Dynamics (BDPD): Research in progressDavid K. Becker. 2320-2330 [doi]
- Agile big data analytics: AnalyticsOps for data scienceNancy W. Grady, Jason A. Payne, Huntley Parker. 2331-2339 [doi]
- Saving costs with a big data strategy frameworkMike Lakoju, Alan Serrano. 2340-2347 [doi]
- Does pair programming work in a data science context? An initial case studyJeffrey S. Saltz, Ivan Shamshurin. 2348-2354 [doi]
- The ambiguity of data science team roles and the need for a data science workforce frameworkJeffrey S. Saltz, Nancy W. Grady. 2355-2361 [doi]
- Make accumulated data in companies eloquent by SQL statement constructorsToshiyuki Shimono. 2362-2369 [doi]
- Online mining for association rules and collective anomalies in data streamsShaaban Abbady, Cheng-Yuan Ke, Jennifer Lavergne, Jian Chen, Vijay V. Raghavan 0001, Ryan Benton. 2370-2379 [doi]
- ABC: A practicable sketch framework for non-uniform multisetsJunzhi Gong, Tong Yang 0003, Yang Zhou, Dongsheng Yang, Shigang Chen, Bin Cui 0001, Xiaoming Li. 2380-2389 [doi]
- Harnessing the power of hashtags in tweet analyticsVibhuti Gupta, Rattikorn Hewett. 2390-2395 [doi]
- A study of a video analysis framework using Kafka and spark streamingAyae Ichinose, Atsuko Takefusa, Hidemoto Nakada, Masato Oguchi. 2396-2401 [doi]
- Towards a unified storage and ingestion architecture for stream processingOvidiu-Cristian Marcu, Alexandru Costan, Gabriel Antoniu, María S. Pérez-Hernández, Radu Tudoran, Stefano Bortoli, Bogdan Nicolae. 2402-2407 [doi]
- Smart distributed query execution over data streamsSalman Ahmed Shaikh, Hiroyuki Kitagawa. 2408-2413 [doi]
- RASP: Real-time network analytics with distributed NoSQL stream processingGeorgios Touloupas, Ioannis Konstantinou, Nectarios Koziris. 2414-2419 [doi]
- Predicting concept drift via dynamic Naïve BayesQian Zhao, Christian Klaue, Chih Lai. 2420-2425 [doi]
- Leveraging distributed big data storage support in CLAaaS for WINGS workflow management systemHadeel Alghamdi, Farhana H. Zulkernine, Patrick Martin. 2426-2432 [doi]
- Online machine learning for cloud resource provisioning of microservice backend systemsHanieh Alipour, Yan Liu. 2433-2441 [doi]
- Trilogy: Data placement to improve performance and robustness of cloud computingChin-Jung Hsu, Vincent W. Freeh, Flavio Villanustre. 2442-2451 [doi]
- Closing the loop - Finding lung cancer patients using NLPBipin Karunakaran, Debdipto Misra, Kyle Marshall, Dhruv Mathrawala, Shravan Kethireddy. 2452-2461 [doi]
- Uncovering the evolution history of data lakesMeike Klettke, Hannes Awolin, Uta Störl, Daniel Müller 0004, Stefanie Scherzinger. 2462-2471 [doi]
- Highly consolidated servers with container-based virtualizationJoichiro Kon, Naoki Mizusawa, Ayaka Umezawa, Saneyasu Yamaguchi, Jian Tao. 2472-2479 [doi]
- Dynamic data transformation for low latency querying in big data systemsLeandro Ordoñez-Ante, Thomas Vanhove, Gregory van Seghbroeck, Tim Wauters, Bruno Volckaert, Filip De Turck. 2480-2489 [doi]
- Icarus: Towards a multistore database systemMarco Vogt, Alexander Stiemer, Heiko Schuldt. 2490-2499 [doi]
- Improving user interaction in mobile-cloud database query processingChenxiao Wang, Jason Arenson, Florian Helff, Le Gruenwald, Laurent d'Orazio. 2500-2507 [doi]
- Understanding and improving disk-based intermediate data caching in SparkKaihui Zhang, Yusuke Tanimura, Hidemoto Nakada, Hirotaka Ogawa. 2508-2517 [doi]
- Improving the functionality of tamura directionality on solar imagesAzim Ahmadzadeh, Dustin J. Kempton, Michael A. Schuh, Rafal A. Angryk. 2518-2526 [doi]
- Parallel computation of magnetic field parameters from HMI active region patchesSunitha Basodi, Berkay Aydin, Rafal A. Angryk. 2527-2532 [doi]
- On the prediction of >100 MeV solar energetic particle events using GOES satellite dataSoukaina Filali Boubrahimi, Berkay Aydin, Petrus C. Martens, Rafal A. Angryk. 2533-2542 [doi]
- A time series classification-based approach for solar flare predictionShah Muhammad Hamdi, Dustin Kempton, Ruizhe Ma, Soukaina Filali Boubrahimi, Rafal A. Angryk. 2543-2551 [doi]
- Multi-wavelength solar event detection using faster R-CNNAhmet Küçük, Berkay Aydin, Rafal A. Angryk. 2552-2558 [doi]
- Improving expectation maximization algorithm over stellar dataHasan Kurban, Can Kockan, Mark Jenne, Mehmet M. Dalkilic. 2559-2568 [doi]
- Solar flare prediction using multivariate time series decision treesRuizhe Ma, Soukaina Filali Boubrahimi, Shah Muhammad Hamdi, Rafal A. Angryk. 2569-2578 [doi]
- Accelerating scientific algorithms in array databases with GPUsSimon Marcin, André Csillaghy. 2579-2587 [doi]
- Identifying and mitigating risks to the quality of open data in the post-truth eraAdrienne Colborne, Michael Smit. 2588-2594 [doi]
- Generative adversarial networks for increasing the veracity of big dataMatthew L. Dering, Conrad S. Tucker. 2595-2602 [doi]
- Augmentation and evaluation of training data for deep learningJunhua Ding, Xinchuan Li, Venkat N. Gudivada. 2603-2611 [doi]
- Is data quality enough for a clinical decision?: Apply machine learning and avoid biasKim Hee. 2612-2619 [doi]
- Data quality challenges with missing values and mixed types in joint sequence analysisAlina Lazar, Ling Jin, C. Anna Spurlock, Kesheng Wu, Alex Sim. 2620-2627 [doi]
- Improving data quality through high precision gender categorizationDaniel Muller, Yiea-Funk Te, Pratiksha Jain. 2628-2636 [doi]
- Collapsing corporate confusion: Leveraging network structures for effective entity resolution in relational corporate dataTim Marple, Bruce A. Desmarais, Kevin L. Young. 2637-2643 [doi]
- Toward data quality analytics in signature verification using a convolutional neural networkShahab Tayeb, Matin Pirouz, Brittany Cozzens, Richard Huang, Maxwell Jay, Kyle Khembunjong, Sahan Paliskara, Felix Zhan, Mark Zhang, Justin Zhan, Shahram Latifi. 2644-2651 [doi]
- An improved P2P file system scheme based on IPFS and BlockchainYongle Chen, Hui Li, Kejiao Li, Jiyang Zhang. 2652-2657 [doi]
- The architecture of distributed storage system under mimic defense theoryHui Li, Jiawei Hu, Huajun Ma, Ting Huang. 2658-2663 [doi]
- A scheduling strategy based on multi-queues of CassandraHaopeng Li, Hui Li. 2664-2669 [doi]
- MDFS: A mimic defense theory based architecture for distributed file systemZhili Lin, Kedan Li, Hanxu Hou, Xin Yang, Hui Li. 2670-2675 [doi]
- On the implementation of BRS codes in CephJiyang Zhang, Hanxu Hou, Kedan Li, Hui Li. 2676-2681 [doi]
- Detecting polarization in ratings: An automated pipeline and a preliminary quantification on several benchmark data setsMahsa Badami, Olfa Nasraoui, Wenlong Sun, Patrick Shafto. 2682-2690 [doi]
- Evaluating the quality of graph embeddings via topological feature reconstructionStephen Bonner, John Brennan, Ibad Kureshi, Georgios Theodoropoulos, Andrew Stephen McGough, Boguslaw Obara. 2691-2700 [doi]
- Using sentiment analysis to explore the degree of risk in sharing economyWei-Lun Chang. 2701-2709 [doi]
- PSEISMIC: A personalized self-exciting point process model for predicting tweet popularityHsin-Yu Chen, Cheng-Te Li. 2710-2713 [doi]
- Detection of profile injection attacks in social recommender systems using outlier analysisAnahita Davoudi, Mainak Chatterjee. 2714-2719 [doi]
- A big social media data study of the 2017 german federal election based on social set analysis of political party Facebook pages with SoSeViBenjamin Flesch, Ravi Vatrapu, Raghava Rao Mukkamala. 2720-2729 [doi]
- Using an asset price bubble model in tweet analyticsK. M. George. 2730-2739 [doi]
- Topic life cycle extraction from big Twitter data based on community detection in bipartite networksTakako Hashimoto, Hiroshi Okamoto, Tetsuji Kuboyama, Kilho Shin. 2740-2745 [doi]
- Ticket-purchase behavior under the effects of marketing campaigns on facebook fan pagesHsiao-Wei Hu, Ching-Han Cheng, Yun-Chu Chung, Chia-Yu Lee. 2746-2751 [doi]
- Language identification in multilingual, short and noisy texts using common N-gramsDijana Kosmajac, Vlado Keselj. 2752-2759 [doi]
- Characterization of daily tourism behaviors based on place sequence analysis from photo sharing websitesThomas-Joseph Loiseau, Sonia Djebali, Thomas Raimbault, Bérengère Branchet, Gaël Chareyron. 2760-2765 [doi]
- Digital content recommendation system using implicit feedback dataGang Wu, Viswanathan Swaminathan, Saayan Mitra, Ratnesh Kumar 0001. 2766-2771 [doi]
- Big social data analytics for public health: Comparative methods study and performance indicators of health care content on FacebookNadiya Straton, Raghava Rao Mukkamala, Ravi Vatrapu. 2772-2777 [doi]
- Outbound behavior analysis through social network data: A case study of Chinese people in JapanTianqi Xia, Xuan Song, Dou Huang, Satoshi Miyazawa, Zipei Fan, Renhe Jiang, Ryosuke Shibasaki. 2778-2786 [doi]
- Towards online graph processing with spark streamingTariq Abughofa, Farhana H. Zulkernine. 2787-2794 [doi]
- SUDS: System for uncertainty decision supportMaaike de Boer, Barry Nouwt, Michael van Bekkum. 2795-2803 [doi]
- Big data processing: Is there a framework suitable for economists and statisticians?Giuseppe Bruno, Demetrio Condello, Alberto Falzone, Andrea Luciani. 2804-2811 [doi]
- A performance study of AsterixDBKeren Ouaknine, Michael J. Carey 0001. 2812-2820 [doi]
- Plug and play bench: Simplifying big data benchmarking using containersSheriffo Ceesay, Adam Barker, Blesson Varghese. 2821-2828 [doi]
- Enhancing the MapReduce training of BP neural networks based on local weight matrix evolutionWanghu Chen, Xintian Li, Jing Li, Jianwu Wang. 2829-2835 [doi]
- CloudEC: A MapReduce-based algorithm for correcting errors in next-generation sequencing big dataWei-Chun Chung, Jan-Ming Ho, Chung-Yen Lin, D. T. Lee. 2836-2842 [doi]
- Quantifying volume, velocity, and variety to support (Big) data-intensive application developmentRustem Dautov, Salvatore Distefano. 2843-2852 [doi]
- Tula: A disk latency aware balancing and block placement strategy for HadoopJanakiram Dharanipragada, Srikant Padala, Balaji Kammili, Vikram Kumar. 2853-2858 [doi]
- Efficient incremental data analytics with apache sparkSina Gholamian, Wojciech Golab, Paul A. S. Ward. 2859-2868 [doi]
- A comparison of big data application programming approaches: A travel companion case studyPei Guo, Jianwu Wang, Zhiyuan Chen. 2869-2878 [doi]
- Adaptive scalable pipelines for political event data generationAndrew Halterman, Jill Irvine, Manar Landis, Phanindra Jalla, Yan Liang, Christan Grant, Mohiuddin Solaimani. 2879-2883 [doi]
- Imbalance in the cloud: An analysis on Alibaba cluster traceChengzhi Lu, Kejiang Ye, Guoyao Xu, Cheng-Zhong Xu, Tongxin Bai. 2884-2892 [doi]
- Scaling point set registration in 3D across thread counts on multicore and hardware accelerator platforms through autotuning for large scale analysis of scientific point cloudsPiotr Luszczek, Jakub Kurzak, Ichitaro Yamazaki, David Keffer, Jack J. Dongarra. 2893-2902 [doi]
- Performance evaluation of multiple sports player tracking system based on graph optimizationYuri Nishikawa, Hitoshi Sato, Jun Ozawa. 2903-2910 [doi]
- A performance study of big data analytics platformsPouria Pirzadeh, Michael J. Carey 0001, Till Westmann. 2911-2920 [doi]
- Schema design support for semi-structured data: Finding the sweet spot between NF and De-NFVincent Reniers, Dimitri Van Landuyt, Ansar Rafique, Wouter Joosen. 2921-2930 [doi]
- A novel compression algorithm decision method for spark shuffle processShanshan Huang, Jungang Xu, Renfeng Liu, Husheng Liao. 2931-2940 [doi]
- ECL-watch: A big data application performance tuning tool in the HPCC systems platformLili Xu, Edin Muharemagic, Amy W. Apon. 2941-2950 [doi]
- Finding the best box-cox transformation from massive datasets on sparkHuayi Fang, Baijian Yang, Tonglin Zhang. 2951-2960 [doi]
- Community-based self generation of policies and processes for assets: Concepts and research directionsElisa Bertino, Geeth de Mel, Alessandra Russo, Seraphin B. Calo, Dinesh C. Verma. 2961-2969 [doi]
- Research challenges in dynamic policy-based autonomous securitySeraphin B. Calo, Emil Lupu, Elisa Bertino, Saritha Arunkumar, Gregory H. Cirincione, Brian Rivera, Alan Cullen. 2970-2973 [doi]
- My (fair) big dataTiziana Catarci, Monica Scannapieco, Marco Console, Camil Demetrescu. 2974-2979 [doi]
- LightSpy: Optical eavesdropping on displays using light sensors on mobile devicesSupriyo Chakraborty, Wentao Robin Ouyang, Mani B. Srivastava. 2980-2989 [doi]
- Combining semantic web and IoT to reason with health and safety policiesEmre Göynügür, Murat Sensoy, Geeth de Mel. 2990-2997 [doi]
- Improving data sharing in data rich environmentsErisa Karafili, Emil C. Lupu, Alan Cullen, Bill Williams, Saritha Arunkumar, Seraphin B. Calo. 2998-3005 [doi]
- Identifying sensor accesses from service descriptionsAntara Palit, Mudhakar Srivatsa, Raghu K. Ganti, Christopher Simpkin. 3006-3011 [doi]
- Edge computing architecture for applying AI to IoTSeraphin B. Calo, Maroun Touma, Dinesh C. Verma, Alan Cullen. 3012-3016 [doi]
- Policy enabled caching for distributed AIDinesh C. Verma, Graham Bent. 3017-3023 [doi]
- Case: Big geosciences data validation challenges and achievementsHussain Z. Al-Ajmi. 3024-3030 [doi]
- Why-Diff: Explaining differences amongst similar workflow runs by exploiting scientific metadataPriyaa Thavasimani, Jacek Cala, Paolo Missier. 3031-3041 [doi]
- Using machine learning methods to identify atrocity perpetratorsBenjamin E. Bagozzi, Ore Koren. 3042-3051 [doi]
- Comparison between spatial distributions of tweet base and population in JapanShouji Fujimoto, Atushi Ishikawa, Takayuki Mizuno. 3052-3057 [doi]
- Evaluating funding programs through network centrality measures of co-author networks of technical papersMasanori Fujita, Hiroto Inoue, Takao Terano. 3058-3063 [doi]
- Analysis of twitter messages about the osaka metropolis plan in JapanKouki Hayashi, Eiichi Umehara, Yuuki Ogawa. 3064-3070 [doi]
- Analyzing regional characteristics of living activities of elderly people from large survey data with probabilistic latent spatial semantic structure modelingAyae Ide, Kazuya Yamashita, Yoichi Motomura, Takao Terano. 3071-3077 [doi]
- Position-sensitive propagation of information on social media using social physics approachAkira Ishii, Takayuki Mizuno, Yasuko Kawahata. 3078-3085 [doi]
- Time dependent analysis of financial networks using supervised latent feature relational modelsShotaro Ito, Koji Eguchi. 3086-3090 [doi]
- A statistical analysis of behavioral bursts occurring in a social networking gameMitsuki Murase, Masanori Takano, Reiji Suzuki, Takaya Arita. 3091-3097 [doi]
- Bias reduction of peer influence effects with latent coordinates and community membershipDaniel Rajchwald, Natasha Markuzon, Edoardo M. Airoldi. 3098-3103 [doi]
- Cross-national measurement of polarization in political discourse: Analyzing floor debate in the U.S. the Japanese legislaturesTakuto Sakamoto, Hiroki Takikawa. 3104-3110 [doi]
- Mining social media for disaster management: Leveraging social media data for community recoveryYuya Shibuya. 3111-3118 [doi]
- When do users change their profile information on twitter?Jinsei Shima, Mitsuo Yoshida, Kyoji Umemura. 3119-3122 [doi]
- Facebook and public health: A study to understand facebook post performance with organizations' strategyNadiya Straton, Ravi Vatrapu, Raghava Rao Mukkamala. 3123-3132 [doi]
- Develop method to predict the increase in the Nikkei VI indexHirohiko Suwa, Yuki Ogawa, Eiichi Umehara, Kento Kakigi, Keiichi Yasumoto, Tatsuo Yamashita, Kota Tsubouchi. 3133-3138 [doi]
- Analysis of the changes in listening trends of a music streaming serviceMasanori Takano, Hiroki Mizukami, Fujio Toriumi, Makoto Takeuchi, Kazuya Wada, Masahiro Yasuda, Ichiro Fukiida. 3139-3142 [doi]
- Political polarization in social media: Analysis of the "Twitter political field" in JapanHiroki Takikawa, Kikuko Nagayoshi. 3143-3150 [doi]
- Analysis of EXILE TRIBE in the music scene using mathematical model of hit phenomenonToshimichi Wakabayashi, Yasuko Kawahata, Akira Ishii. 3151-3155 [doi]
- Relationships between market impact characteristics and order book propertiesKenta Yamada, Takayuki Mizuno. 3156-3161 [doi]
- Detecting two types of seasonal words using simple autocorrelation analysisKenta Yamada. 3162-3167 [doi]
- Inference of personal attributes from tweets using machine learningTake Yo, Kazutoshi Sasahara. 3168-3174 [doi]
- Managing massive multi-dimensional array data with TileDB: - Invited demo paperJacob Bolewski, Stavros Papadopoulos 0001. 3175-3176 [doi]
- Generating polystore ingestion plans - A demonstration with the AWESOME systemSubhasis Dasgupta, Charles McKay, Amarnath Gupta. 3177-3179 [doi]
- Polystore mathematics of relational algebraHayden Jananthan, Ziqi Zhou, Vijay Gadepally, Dylan Hutchison, Suna Kim, Jeremy Kepner. 3180-3189 [doi]
- Querying web polystoresYasar Khan, Antoine Zimmermann, Alokkumar Jha, Dietrich Rebholz-Schuhmann, Ratnesh Sahay. 3190-3195 [doi]
- A novel object placement protocol for minimizing the average response time of get operations in distributed key-value storesAntonios Makris, Konstantinos Tserpes, Dimosthenis Anagnostopoulos. 3196-3205 [doi]
- SciDB: An array-native computational database for heterogeneous, multi-dimensional data setsJonathan Rivers. 3206-3210 [doi]
- Enabling query processing across heterogeneous data models: A surveyRan Tan, Rada Chirkova, Vijay Gadepally, Timothy G. Mattson. 3211-3220 [doi]
- An apache calcite-based polystore variation for federated querying of heterogeneous healthcare sourcesAshwin Kumar Vajantri, Kunwar Deep Singh Toor, Edmon Begoli, Jack Bates. 3221-3227 [doi]
- A detection mechanism with text mining cross correlation approachJose Luis, Guerrero-Cusumano. 3228-3232 [doi]
- Text mining analysis of wind turbine accidents: An ontology-based frameworkGürdal Ertek, Xu Chi, Allan N. Zhang, Sobhan Asian. 3233-3241 [doi]
- A model for analysing a disrupted supply chain's time-to-recovery under uncertaintyA. J. L. Lee, D. Paul, W. J. Yan, A. N. Zhang, Mark Goh. 3242-3247 [doi]
- Application of deep neural network and generative adversarial network to industrial maintenance: A case study of induction motor fault detectionYong Oh Lee, Jun Jo, Jongwoon Hwang. 3248-3253 [doi]
- Learning automata based method for solving demand and supply problem with periodic behaviorsHaoye Lu, Anand Srinivasan, Amiya Nayak. 3254-3260 [doi]
- Forecast and analysis of food donations using support vector regressionNigel Pugh, Lauren B. Davis. 3261-3267 [doi]
- Association analysis of supply chain risk and company salesMurat Mustafa Tunc, Alexandru Valcov, Allan N. Zhang, Wenjing Yan, Rong Wen. 3268-3277 [doi]
- Adaptive spatio-temporal mining for route planning and travel time estimationRong Wen, Wenjing Yan, Allan N. Zhang. 3278-3284 [doi]
- Streaming analytics processing in manufacturing performance monitoring and predictionYi-Hsin Wu, Sheng-De Wang, Li-Jung Chen, Cheng-Juei Yu. 3285-3289 [doi]
- Performing literature review using text mining, Part I: Retrieving technology infrastructure using Google Scholar and APIsDazhi Yang, Allan N. Zhang, Wenjing Yan. 3290-3296 [doi]
- Performing literature review using text mining, Part II: Expanding domain knowledge with abbreviation identificationDazhi Yang, Jihoon Hong. 3297-3301 [doi]
- GPU-based parallel algorithm for generating massive scale-free networks using the preferential attachment modelMaksudul Alam, Kalyan S. Perumalla. 3302-3311 [doi]
- A parallel algorithm for generating a random graph with a prescribed degree sequenceMd Hasanuzzaman Bhuiyan, Maleq Khan, Madhav Marathe. 3312-3321 [doi]
- Discovering interesting patterns in large graph cubesFlorian Demesmaeker, Amine Ghrab, Siegfried Nijssen, Sabri Skhiri. 3322-3331 [doi]
- Distributed memory parallel Markov random fields using graph partitioningC. Heinemann, Talita Perciano, Daniela Ushizima, E. Wes Bethel. 3332-3341 [doi]
- A generalized incremental bottom-up community detection framework for highly dynamic graphsWeiyi Liu, Toyotaro Suzumura, Lingli Chen, Guangmin Hu. 3342-3351 [doi]
- Regular decomposition of large graphs and other structures: Scalability and robustness towards missing dataHannu Reittu, Ilkka Norros. 3352-3357 [doi]
- R: Massive and distributed RDF graph stream reasoningXiangnan Ren, Olivier Curé, Hubert Naacke, Jérémy Lhez, Li Ke. 3358-3367 [doi]
- Practical approach to evacuation planning via network flow and deep learningAkira Tanaka, Nozomi Hata, Nariaki Tateiwa, Katsuki Fujisawa. 3368-3377 [doi]
- Techniques for efficient detection of rapid weather changes and analysis of their impacts on a highway networkAdil Alim, Aparna Joshi, Feng Chen 0001, Catherine T. Lawson. 3378-3387 [doi]
- SQL versus NoSQL databases for geospatial applicationsElena Baralis, Andrea Dalla Valle, Paolo Garza, Claudio Rossi, Francesco Scullino. 3388-3397 [doi]
- Spatiotemporal visualization of traffic paths using color space time curveSavitha Baskaran, Shiaofen Fang, Shenhui Jiang. 3398-3405 [doi]
- All in One: Encoding spatio-temporal big data in XML, JSON, and RDF without information lossPeter Baumann 0001, Eric Hirschorn, Joan Masó-Pau, Vlad Merticariu, Dimitar Misev. 3406-3415 [doi]
- Spaten: A spatio-temporal and textual big data generatorThaleia Dimitra Doudali, Ioannis Konstantinou, Nectarios Koziris. 3416-3421 [doi]
- Multiscale graph theoretical tools reveal subtle patterns in big geospatial dataRonald D. Hagan, Charles A. Phillips, Michael A. Langston, Bradley J. Rhodes. 3422-3425 [doi]
- Optimal viewpoint finding for 3D visualization of spatio-temporal vehicle trajectories on caution crossroads detected from vehicle recorder big dataMasahiko Itoh, Daisaku Yokoyama, Masashi Toyoda, Masaru Kitsuregawa. 3426-3434 [doi]
- Road map extraction from satellite imagery using connected component analysis and landscape metricsKulsawasd Jitkajornwanich, Peerapon Vateekul, Teerapong Panboonyuen, Siam Lawawirojwong, Siwapon Srisonphan. 3435-3442 [doi]
- Scalable parallel data loading in SciDBSangchul Kim, Junhee Lee 0003, Taehoon Kim, Bongki Moon. 3443-3446 [doi]
- Discovering dynamic patterns of urban space via semi-nonnegative matrix factorizationZhicheng Liu, Jun Cao, Junyan Yang, Qiao Wang. 3447-3453 [doi]
- Identifying coherent anomalies in multi-scale spatio-temporal data using Markov random fieldsAdway Mitra. 3454-3460 [doi]
- A tale of two cities: Analyzing road accidents with big spatial dataRene Richard, Suprio Ray. 3461-3470 [doi]
- Challenges and trends about smart big geospatial data: A position paperVictor Saquicela, Luis Manuel Vilches Blázquez, Andres Tello. 3471-3475 [doi]
- Towards development of spark based agricultural information system including geo-spatial dataPurnima Shah, Deepak B. Hiremath, Sanjay Chaudhary. 3476-3481 [doi]
- A map-based visual analysis method for patterns discovery of mobile learning in education with big dataDongbo Zhou, Hao Li, Sannyuya Liu, Bo Song, Tony Xiaohua Hu. 3482-3491 [doi]
- Big data machine learning using apache spark MLlibMehdi Assefi, Ehsun Behravesh, Guangchi Liu, Ahmad Pahlavan Tafti. 3492-3498 [doi]
- Return of experience on the mean-shift clustering for heterogeneous architecture use caseChristophe Cérin, Jean-Luc Gaudiot, Mustapha Lebbah, Fouste Yuehgoh. 3499-3507 [doi]
- Cloud big data decision support system for machine learning on AWS: Analytics of analyticsAlex Kaplunovich, Yelena Yesha. 3508-3516 [doi]
- Divide-and-conquer strategies for large-scale simulations in RHui Zhang, Yiwen Zhong, Juan Lin. 3517-3523 [doi]
- Map-scan node accelerator for big-dataMihaela Malita, Gheorghe M. Stefan. 3524-3529 [doi]
- Ranked time series matching by interleaving similarity distancesCuong Nguyen, Charles Lovering, Rodica Neamtu. 3530-3539 [doi]
- Kernel bandwidth selection for SVDD: The sampling peak criterion method for large dataSergiy Peredriy, Deovrat Kakde, Arin Chaudhuri. 3540-3549 [doi]
- An online spatio-temporal model for inference and predictions of taxi demandHong Yan, Zhongqiang Zhang, Jian Zou. 3550-3557 [doi]
- Machine learning for early detection of autism (and other conditions) using a parental questionnaire and home video screeningHalim Abbas, Ford Garberson, Eric Glover, Dennis P. Wall. 3558-3561 [doi]
- Artificial intelligence applied to challenges in the fields of operations and customer supportRavi Santosh Arvapally, Hasan Hicsasmaz, Wally Lo Faro. 3562-3569 [doi]
- Semantic search (invited talk)Ricardo A. Baeza-Yates. 3570 [doi]
- Artificial intelligence(AI), automation, and its impact on data scienceRichard Boire. 3571-3574 [doi]
- A hybrid bipartite graph based recommendation algorithm for mobile gamesYong Cai, Shaorong Liu, Jinlong Hu, Guihong Bai, Shoubin Dong. 3575-3582 [doi]
- Estimating skill fungibility and forecasting services labor demandBrian Johnston, Benjamin Zweig, Michael Peran, Charlie Wang, Rachel Rosenfeld. 3583-3585 [doi]
- Innovation in big data analytics: Applications of mathematical programming in medicine and healthcareEva K. Lee. 3586-3595 [doi]
- Automated knowledge extraction from the federal acquisition regulations system (FARS)Srishty Saha, Karuna P. Joshi, Renee Frank, Michael Aebig, Jiayong Lin. 3596-3603 [doi]
- A comparative sequence analysis of career paths among knowledge workers in a multinational bankPaul Squires, Harold G. Kaufman, Julian Togelius, Catalina M. Jaramillo. 3604-3612 [doi]
- Hitting your number or not? A robust & intelligent sales forecast systemXin Xu Lei, Tang Venkat Rangan. 3613-3622 [doi]
- Governance framework for enterprise analytics and dataAtsushi Yamada, Michael Peran. 3623-3631 [doi]
- Forensics analysis of Wi-Fi communication traces in mobile devicesAnja Evelyn Amundsen, Kenneth M. Ovens. 3632-3637 [doi]
- Identifying extremism in social media with multi-view context-aware subset optimizationSreyasee Das Bhattacharjee, Bala Venkatram Balantrapu, William Tolone, Ashit Talukder. 3638-3647 [doi]
- Extracting cyber threat intelligence from hacker forums: Support vector machines versus convolutional neural networksIsuf Deliu, Carl Leichter, Katrin Franke. 3648-3656 [doi]
- Exploratory studies into forensic logs for criminal investigation using case studies in industrial control systems in the power sectorAsif Iqbal, Mathias Ekstedt, Hanan Alobaidli. 3657-3661 [doi]
- Neural reputation models learned from passive DNS dataPierre Lison, Vasileios Mavroeidis. 3662-3671 [doi]
- Cyber crime investigations in the era of big dataAndrii Shalaginov, Jan William Johnsen, Katrin Franke. 3672-3676 [doi]
- Topical behavior prediction from massive logsShih-Chieh Su. 3677-3683 [doi]
- Introducing DeepBalance: Random deep belief network ensembles to address class imbalancePeter Xenopoulos. 3684-3689 [doi]
- A first estimation of the proportion of cybercriminal entities in the bitcoin ecosystem using supervised machine learningHaohua Sun Yin, Ravi Vatrapu. 3690-3699 [doi]
- Forensic database reconstructionJoshua Sablatura, Bing Zhou. 3700-3704 [doi]
- Coupling early warning services, crowdsourcing, and modelling for improved decision support and wildfire emergency managementConrad Bielski, V. O'Brien, C. Whitmore, K. Ylinen, I. Juga, P. Nurmi, J. Kilpinen, I. Porras, J. M. Sole, P. Gamez, M. Navarro, A. Alikadic, A. Gobbi, C. Furlanello, Gunter Zeug, M. Weirathe, J. Martinez, R. Yuste, S. Castro, V. Moreno, T. Velin, Claudio Rossi 0003. 3705-3712 [doi]
- Summarization of emergency news articles driven by relevance feedbackLuca Cagliero. 3713-3721 [doi]
- All in a twitter: Self-tuning strategies for a deeper understanding of a crisis tweet collectionEvelina Di Corso, Francesco Ventura, Tania Cerquitelli. 3722-3726 [doi]
- Gamified crowdsourcing for disaster risk managementAntonella Frisiello, Quynh Nhu Nguyen, Claudio Rossi. 3727-3733 [doi]
- A heat wave forecast system for EuropeAndrea Gobbi, Azra Alikadic, Kaisa Ylinen, Federico Angaramo, Cesare Furlanello. 3734-3738 [doi]
- A language-agnostic approach to exact informative tweets during emergency situationsJacopo Longhini, Claudio Rossi, Claudio Casetti, Federico Angaramo. 3739-3475 [doi]
- River segmentation for flood monitoringLaura Lopez-Fuentes, Claudio Rossi, Harald Skinnemoen. 3746-3749 [doi]
- A comparison of classification models for natural disaster and critical event detection from newsTimothy Nugent, Fabio Petroni, Natraj Raman, Lucas Carstens, Jochen L. Leidner. 3750-3759 [doi]
- Optimal geospatial volunteer allocation needs realistic distancesJasmin Pielorz, Matthias Prandtstetter, Markus Straub, Christoph H. Lampert. 3760-3763 [doi]
- Crowd control and evacuation guidance based on simulationsTomoichi Takahashi, Katsuki Ichinose. 3764-3768 [doi]
- The role of unstructured data in real-time disaster-related social media monitoringFrancesco Tarasconi, Michela Farina, Antonio Mazzei, Alessio Bosca. 3769-3778 [doi]
- Analyzing spatial data from twitter during a disasterLuca Venturini, Evelina Di Corso. 3779-3783 [doi]
- Comparison of different driving style analysis approaches based on trip segmentation over GPS informationMarco Brambilla, Paolo Mascetti, Andrea Mauri. 3784-3791 [doi]
- Understanding data quality: Ensuring data quality by design in the rail industryQian Fu, John M. Easton. 3792-3799 [doi]
- Track geometry big data analysis: A machine learning approachEmmanuel Nii Martey, Lasisi Ahmed, Nii O. Attoh-Okine. 3800-3809 [doi]
- Application of machine learning for fuel consumption modelling of trucksFederico Perrotta, Tony Parry, Luis C. Neves. 3810-3815 [doi]
- Privacy-preserving trajectory classification of driving trip data based on pattern discovery techniquesGene P. K. Wu, Keith C. C. Chan. 3816-3825 [doi]
- Predictive analytics for litigation case managementJerzy Bala, Michael Kellar, Fred Ramberg. 3826-3830 [doi]
- Using google analytics to support cybersecurity forensicsHan Qin, Kit Riehle, Haozhen Zhao. 3831-3834 [doi]
- A feasibility experiment on the application of predictive coding to instant messaging corporaThanasis Schoinas, Ghulam Qadir. 3835-3840 [doi]
- Patient-individual morphological anomaly detection in multi-lead electrocardiography data streamsAlexander Acker, Florian Schmidt, Anton Gulenko, Reinhard Kietzmann, Odej Kao. 3841-3846 [doi]
- Predicting efficacy of therapeutic services for autism spectrum disorder using scientific workflowsFahima Amin Bhuyan, Shiyong Lu, Ishtiaq Ahmed, Jia Zhang. 3847-3856 [doi]
- A multimedia big data retrieval framework to detect dyslexia among childrenElham Hassanain. 3857-3860 [doi]
- Mining accompanying relationships between diseases from patient recordsWei Hong Lee, En Tzu Wang, Arbee L. P. Chen. 3861-3868 [doi]
- Explainable data-driven modeling of patient satisfaction survey dataNing Liu, Soundar Kumara, Eric Reich. 3869-3876 [doi]
- A multi-task machine learning approach for comorbid patient prioritizationGoutam Mylavarapu, Johnson P. Thomas. 3877-3881 [doi]
- Visualization of non-metric relationships by adaptive learning multiple maps t-SNE regularizationXianjun Shen, Xianchao Zhu, Xingpeng Jiang, Li Gao, Tingting He, Xiaohua Hu. 3882-3887 [doi]
- bigNN: An open-source big data toolkit focused on biomedical sentence classificationAhmad Pahlavan Tafti, Ehsun Behravesh, Mehdi Assefi, Eric LaRose, Jonathan C. Badger, John Mayer, AnHai Doan, David Page, Peggy L. Peissig. 3888-3896 [doi]
- Toward predicting medical conditions using k-nearest neighborsShahab Tayeb, Matin Pirouz, Johann Sun, Kaylee Hall, Andrew Chang, Jessica Li, Connor Song, Apoorva Chauhan, Michael Ferra, Theresa Sager, Justin Zhan, Shahram Latifi. 3897-3903 [doi]
- A medical price prediction system using hierarchical decision treesAnuja Tike, Sanket Tavarageri. 3904-3913 [doi]
- High dimensional data processing for fetal activity evaluationIulian Voicu, Denis Kouame. 3914-3915 [doi]
- iVAR: Interactive visual analytics of radiomics features from large-scale medical imagesLina Yu, Hengle Jiang, Hongfeng Yu, Chi Zhang, Josiah Mcallister, Dandan Zheng. 3916-3923 [doi]
- Big data technology and ethics considerations in customer behavior and customer feedback miningXin Deng. 3924-3927 [doi]
- Customer churn prediction in an internet service providerDuyen Do, Phuc Huynh, Phuong Vo, Tu Vu. 3928-3933 [doi]
- Training on the poles for review sentiment polarity classificationMichael Kranzlein, Dan Chia-Tien Lo. 3934-3937 [doi]
- Understanding rating behavior based on moral foundations: The case of Yelp reviewsPegah Nokhiz, FengJun Li. 3938-3945 [doi]
- A scalable sequential principal component analysis algorithm (SeqPCA) with application to user access control analysisYixuan Qiu, Wutao Wei. 3946-3754 [doi]
- Towards an ethical application of customer feedback dataRoss Smith. 3955-3957 [doi]
- Dynamic Bayesian predictive model for box office forecastingWutao Wei, Le Zhang, Qi Ding, Bingrou Zhou. 3958-3964 [doi]
- A big data analytics framework for forecasting rare customer complaints: A use case of predicting MA members' complaints to CMSDonghui Wu. 3965-3967 [doi]
- Heterogeneous knowledge transfer via domain regularization for improving cross-domain collaborative filteringYizhou Zang, Xiaohua Hu. 3968-3974 [doi]
- iEnvironment: A software platform for integrated environmental monitoring and modeling of surface waterPaulo S. C. Alencar, Donald D. Cowan, Doug Mulholland, Bruce MacVicar, Simon Courtenay, Stephen Murphy, Fred McGarry. 3975-3978 [doi]
- New data paradigms: From the crowd and backRumi Chunara. 3979-3980 [doi]
- Unifying the open big data world: The possibilities∗ of apache BEAMHolden Karau. 3981 [doi]
- Deep learning enabled national cancer surveillanceGeorgia D. Tourassi. 3982-3983 [doi]
- Preparing data managers to support open ocean science: Required competencies, assessed gaps, and the role of experiential learningLee Wilson, Adrienne Colborne, Michael Smit. 3984-3993 [doi]
- Modeling multiple subskills by extending knowledge tracing model using logistic regressionXuan Zhou, Wenjun Wu, Yong Han. 3994-4003 [doi]
- Application specific traffic control using network virtualization node in large-scale disastersTsumugi Tairaku, Akihiro Nakao, Saneyasu Yamaguchi, Masato Oguchi. 4004-4009 [doi]
- Automatic detection of DNS manipulationsMartino Trevisan, Idilio Drago, Marco Mellia, Maurizio M. Munafò. 4010-4015 [doi]
- Mining and modeling web trajectories from passive tracesLuca Vassio, Marco Mellia, Flavio Figueiredo, Ana Paula Couto da Silva, Jussara M. Almeida. 4016-4021 [doi]
- Automatic topic discovery of online hospital reviews using an improved LDA with Variational Gibbs SamplingRichard de Groof, Haiping Xu. 4022-4029 [doi]
- Fragrance to vector as scent technologyNoriaki Koide, Yu Ichifuji. 4030-4034 [doi]
- Cross-database mammographic image analysis through unsupervised domain adaptationDeepak Kumar, Chetan Kumar, Ming Shao. 4035-4042 [doi]
- GuideMe: Routes coordination of participating agents in mobile crowd sensing platformsChristine Bassem, Azer Bestavros. 4043-4049 [doi]
- A whole building fault detection using weather based pattern matching and feature based PCA methodYimin Chen, Jin Wen. 4050-4057 [doi]
- A model for the socially smart city practical uses of city-level socio-economic indicatorsDonald D. Cowan, Paulo S. C. Alencar, Kyle Young, Bryan Smale, Ryan Erb, Fred McGarry. 4058-4067 [doi]
- Using social media photos to identify tourism preferences in smart tourism destinationMickael Figueredo, Nélio Cacho, Antonio Thome, Andréa Cacho, Frederico Lopes, Maria Valeria Araujo. 4068-4073 [doi]
- Self-adaptive and resilient urban networking infrastructure for disasters and smart city servicesPaul G. Flikkema, Morgan Vigil-Hayes. 4074-4079 [doi]
- Data analysis on train transportation data with nonnegative matrix factorizationKyoichi Ito, Masaki Ito, Kosuke Miyazaki, Keishi Tanimoto, Kaoru Sezaki. 4080-4085 [doi]
- Reliability analysis of an IoT-based smart parking application for smart citiesAnderson Araujo, Rubem Kalebe, Gustavo Girão, Itamir Filho, Kayo Goncalves, Bianor Neto. 4086-4091 [doi]
- Road marking blur detection with drive recorderMakoto Kawano, Kazuhiro Mikami, Satoshi Yokoyama, Takuro Yonezawa, Jin Nakazawa. 4092-4097 [doi]
- Datafying city: Detecting and accumulating spatio-temporal events by vehicle-mounted sensorsYasue Kishino, Koh Takeuchi, Yoshinari Shirai, Futoshi Naya, Naonori Ueda. 4098-4104 [doi]
- Analytical toolbox for smart city applications: Garbage collection log use caseTakahiro Komamizu, Jin Nakazawa, Toshiyuki Amagasa, Hiroyuki Kitagawa, Hideyuki Tokuda. 4105-4110 [doi]
- City event detection from social media with neural embeddings and topic model visualizationShuhua Liu, Patrick Jansson. 4111-4116 [doi]
- Proposing an access gate to facilitate knowledge exchange for smart city servicesZohreh Pourzolfaghar, Markus Helfert, Viviana Angely Bastidas Melo, Ahmad Khalilijafarabad. 4117-4122 [doi]
- MM360: A GPS-assisted 360-degree video sharing system for participatory eventsNaoya Shibahara, Ryoma Kondo, Masayuki Iwai. 4123-4127 [doi]
- Towards building a hybrid model for predicting stock indexesJonathan Creighton, Farhana H. Zulkernine. 4128-4133 [doi]
- Agglomeration, network and urban development - - A study on newspaper connection network index of citiesDongmei Guo, Jialong Zheng, Xiaolan Yang. 4134-4141 [doi]
- An augmented fama and french three-factor model using social interactionLin Huo, Xiaoli Sun. 4142-4147 [doi]
- Stock price forecasting using support vector regression: Based on network behavior dataQuan Jin, Kun Guo, Yi Sun. 4148-4153 [doi]
- Insurance premium optimization using motor insurance policies - A business growth classification approachDaniel Muller, Yiea-Funk Te. 4154-4158 [doi]
- Predicting business performance through patent applicationsDaniel Muller, Yiea-Funk Te, Pratiksha Jain. 4159-4164 [doi]
- Forecasting tourist arrivals with machine learning and internet search indexShaolong Sun, Shouyang Wang, Yunjie Wei, Xianduan Yang, Kwok-Leung Tsui. 4165-4169 [doi]
- A new time series prediction method based on complex network theoryMinggang Wang, Andre L. M. Vilela, Lixin Tian, Hua Xu, Ruijin Du. 4170-4175 [doi]
- An enhanced LGSA-SVM for S&P 500 index forecastJinxin Wang, Wei Shang, Zhengyang Liu, Shouyang Wang. 4176-4183 [doi]
- Can search data help forecast inflation? Evidence from a 13-country panelYunjie Wei, Xun Zhang, Shouyang Wang. 4184-4188 [doi]
- Integrating heterogeneous data sources for traffic flow prediction through extreme learning machineQingqing Zhang, Darren Jian, Rui Xu, Wei Dai, Ying Liu. 4189-4194 [doi]
- The construction and application of expectations index on monetary policyGuihuan Zheng, Qikun Yao, Xingfen Wang, Zhou Yang. 4199-4203 [doi]
- Big data processing: Is there a framework suitable for economists and statisticians?Giuseppe Bruno, Demetrio Condello, Alberto Falzone, Andrea Luciani. 4204-4211 [doi]
- Cluster-overlap algorithm for assessing preprocessing choices in environmental sustainabilityAnne M. Denton, Arighna Roy. 4212-4220 [doi]
- Critical enablers of sustainable water management (SWM): Text evidences from 10 countriesChu-hua Kuei, Christian N. Madu, Picheng Lee. 4221-4227 [doi]
- Characterization of cities based on world grid square statistics about specific propertiesAki-Hiro Sato. 4228-4237 [doi]
- World grid square codes: Definition and an example of world grid square dataAki-Hiro Sato, Shoki Nishimura, Hiroe Tsubaki. 4238-4247 [doi]
- Statistical analysis of hotel plan popularity in regional tourist areasHiroshi Tsuda, Masakazu Ando, Yu Ichifuji. 4248-4254 [doi]
- Sustainable blockchain-enabled services: Smart contractsCraig Wright, Antoaneta Serguieva. 4255-4264 [doi]
- Developing sustainable trading strategies using directional changes with high frequency dataAilun Ye, V. L. Raju Chinthalapati, Antoaneta Serguieva, Edward P. K. Tsang. 4265-4271 [doi]
- SARGS method for distributed actionable pattern mining using sparkArunkumar Bagavathi, Pranava Mummoju, Katarzyna A. Tarnowska, Angelina A. Tzacheva, Zbigniew W. Ras. 4272-4281 [doi]
- Vehicle path estimation using dual-level clustering and multi-source predictionI-Cheng Chang, Yudi Pratama Halim, Chun-Man Lin. 4282-4286 [doi]
- Combining pattern matching with word embeddings for the extraction of experimental variables from scientific literatureHelena F. Deus, Corey A. Harper, Darin McBeath, Ron Daniel. 4287-4292 [doi]
- Ocean surface current prediction based on HF radar observations using trajectory-oriented association rule miningKulsawasd Jitkajornwanich, Peerapon Vateekul, Upa Gupta, Teeranai Kormongkolkul, Arnon Jirakittayakorn, Siam Lawawirojwong, Siwapon Srisonphan. 4293-4300 [doi]
- A distributed pipeline for DIDSON data processingLiling Li, Tyler Danner, Jesse Eickholt, Erin McCann, Kevin Pangle, Nicholas Johnson. 4301-4306 [doi]
- Deep model style: Cross-class style compatibility for 3D furniture within a sceneTse-Yu Pan, Yi-Zhu Dai, Wan-Lun Tsai, Min-Chun Hu. 4307-4313 [doi]
- Improving Arabic sentiment analysis with sentiment-specific embeddingsA. Aziz Altowayan, Ashraf Elnagar. 4314-4320 [doi]
- Differences in emoji sentiment perception between readers and writersJose Berengueres, Dani Castro. 4321-4328 [doi]
- Topic modelling enriched LSTM models for the detection of novel and emerging named entities from social mediaPatrick Jansson, Shuhua Liu. 4329-4336 [doi]
- An entity disambiguation method based on LeaderRankBingjing Jia, Bin Wu, Jinna Lv, Pengpeng Zhou, Yao Bu, Ying Xing. 4337-4342 [doi]
- Identifying emergency stages in facebook posts of police departments with convolutional and recurrent neural networks and support vector machinesNicolai Pogrebnyakov, Edgar A. Maldonado. 4343-4352 [doi]
- #Anorexia, #anarexia, #anarexyia: Characterizing online community practices with orthographic variationIan Stewart, Stevie Chancellor, Munmun De Choudhury, Jacob Eisenstein. 4353-4361 [doi]
- Crossing the Streams: Fuzz testing with user inputJoseph A. Cottam, Leslie Blaha, Dimitri Zarzhitsky, Mathew Thomas, Elliott Skomski. 4362-4371 [doi]
- Improving classification accuracy in crowdsourcing through hierarchical reorganizationXiaoni Duan, Keishi Tajima. 4372-4374 [doi]
- Crowd-based best-effort number estimationYuzuki Furuhashi, Masaki Matsubara, Atsuyuki Morishima. 4375-4377 [doi]
- [Research paper] formalizing interruptible algorithms for human over-the-loop analyticsAustin Graham, Yan Liang, Le Gruenwald, Christan Grant. 4378-4383 [doi]
- Clarifying the transition of workload for victims life reconstruction support programs in affected local governments using the victims master database - Comparison between the 2007 Chuetsu-oki earthquake and the 2016 Kumamoto Earthquake-Munenari Inoguchi, Keiko Tamura, Kei Horie, Haruo Hayashi. 4384-4388 [doi]
- Active preference learning for generative adversarial networksMasahiro Kazama, Viviane Takahashi. 4389-4393 [doi]
- A crowd-in-the-loop approach for generating conference programs with microtasksNaoki Kobayashi, Masaki Matsubara, Keishi Tajima, Atsuyuki Morishima. 4394-4396 [doi]
- Method to generate disaster-damage map using 3D photometry and crowd sourcingKoyo Kobayashi, Hidehiko Shishido, Yoshinari Kameda, Itaru Kitahara. 4397-4399 [doi]
- Implicit order join: Joining log data with property data by discovering implicit order-oriented keys with human assistanceTakahiro Komamizu, Toshiyuki Amagasa, Hiroyuki Kitagawa. 4400-4406 [doi]
- Conceptual design for comprehensive research support platform: Successful research data management generating big data from little dataMamiko Matsubayashi, Keiko Kurata. 4407-4409 [doi]
- A trade-off between estimation accuracy of worker quality and task complexityYoshitaka Matsuda, Yu Suzuki, Satoshi Nakamura 0001. 4410-4416 [doi]
- Collaborative filtering and rating aggregation based on multicriteria ratingHiroki Morise, Satoshi Oyama, Masahito Kurihara. 4417-4422 [doi]
- Towards predicting task performance from EEG signalsMichalis Papakostas, Konstantinos Tsiakas, Theodoros Giannakopoulos, Fillia Makedon. 4423-4425 [doi]
- Proactive preservation of world heritage by crowdsourcing and 3D reconstruction technologyHidehiko Shishido, Yutaka Ito, Youhei Kawamura, Toshiya Matsui, Atsuyuki Morishima, Itaru Kitahara. 4426-4428 [doi]
- Using categorized web browsing history to estimate the user's latent interests for web advertisement recommendationPanote Siriaraya, Yuriko Yamaguchi, Mimpei Morishita, Yoichi Inagaki, Reyn Y. Nakamoto, Jianwei Zhang 0002, Junichi Aoi, Shinsuke Nakajima. 4429-4434 [doi]
- "DEKATSU" activity of data and service collaboration among private companies and academic institutions for Tokyo metropolitan resilience projectKeiko Tamura, Naoshi Hirata. 4435-4437 [doi]
- Link before you share: Managing privacy policies through blockchainAgniva Banerjee, Karuna Pande Joshi. 4438-4447 [doi]
- Automated microsoft office macro malware detection using machine learningRuth Bearden, Dan Chai-Tien Lo. 4448-4452 [doi]
- Fighting fake news spread in online social networks: Actual trends and future research directionsAlina Campan, Alfredo Cuzzocrea, Traian Marius Truta. 4453-4457 [doi]
- Impact of security awareness training on phishing click-through ratesAnthony Carella, Murat Kotsoev, Traian Marius Truta. 4458-4466 [doi]
- Data masking techniques for NoSQL database security: A systematic reviewAlfredo Cuzzocrea, Hossain Shahriar. 4467-4473 [doi]
- Tor traffic analysis and detection via machine learning techniquesAlfredo Cuzzocrea, Fabio Martinelli, Francesco Mercaldo, Gianni Vercelli. 4474-4480 [doi]
- Modeling user communities for identifying security risks in an organizationAnirban Das, Min-Yi Shen, Jisheng Wang. 4481-4486 [doi]
- Efficient and private approximations of distributed databases calculationsPhilip Derbeko, Shlomi Dolev, Ehud Gudes, Jeffrey D. Ullman. 4487-4496 [doi]
- Collaborative caching techniques for privacy-preserving location-based services in peer-to-peer environmentsKangsoo Jung, Seog Park. 4497-4506 [doi]
- Secure power scheduling auction for smart grids using homomorphic encryptionHaya Shajaiah, Ahmed Abdelhadi, Charles Clancy. 4507-4512 [doi]
- A top-down k-anonymization implementation for apache sparkUgur Sopaoglu, Osman Abul. 4513-4521 [doi]
- Securing the positioning signals of autonomous vehiclesShahab Tayeb, Matin Pirouz, Gabriel Esguerra, Kimiya Ghobadi, Jimson Huang, Robin Hill, Derwin Lawson, Stone Li, Tiffany Zhan, Justin Zhan, Shahram Latifi. 4522-4528 [doi]
- User-profile-based analytics for detecting cloud security breachesTrishita Tiwari, Ata Turk, Alina Oprea, Katzalin Olcoz, Ayse Kivilcim Coskun. 4529-4535 [doi]
- Event clustering & event series characterization on expected frequencyConrad M. Albrecht, Marcus Freitag, Theodore G. van Kessel, Siyuan Lu, Hendrik F. Hamann. 4536-4541 [doi]
- 'Petroleum Analytics Learning Machine' for optimizing the Internet of Things of today's digital oil field-to-refinery petroleum systemRoger N. Anderson. 4542-4545 [doi]
- Developing an edge computing platform for real-time descriptive analyticsHung Cao, Monica Wachowicz, Sangwhan Cha. 4546-4554 [doi]
- Energy efficiency driven by a storage model and analytics on a multi-system semantic integrationDomitille Couloumb, Charbel El Kaed, Ayush Garg, Chris Healey, Jonathan Healey, Stuart Sheehan. 4555-4561 [doi]
- Data driven modeling for energy consumption prediction in smart buildingsAurora González-Vidal, Alfonso P. Ramallo-González, Fernando Terroso-Saenz, Antonio F. Skarmeta. 4562-4569 [doi]
- Machine learning and air quality modelingChristoph A. Keller, Mathew J. Evans, J. Nathan Kutz, Steven Pawson. 4570-4576 [doi]
- A low maintenance particle pollution sensing system using the Minimum Airflow Particle Counter (MAPC)Theodore G. van Kessel, Ramachandran Muralidhar, Josephine B. Chang, Jun-Song Wang, Michael A. Schappert, Hendrik F. Hamann. 4577-4582 [doi]
- Distributed wireless sensing for fugitive methane leak detectionLevente J. Klein, Theodore G. van Kessel, Dhruv Nair, Ramachandran Muralidhar, Nigel Hinds, Hendrik F. Hamann, Norma Sosa. 4583-4591 [doi]
- Using big data analytics and IoT principles to keep an eye on underground infrastructureJoshua Lieberman, Alan Leidner, George Percivall, Carsten Rönsdorf. 4592-4601 [doi]
- Understanding the impact of lossy compressions on IoT smart farm analyticsAekyeung Moon, Jaeyoung Kim, Jialing Zhang, Hang Liu, Seung Woo Son. 4602-4611 [doi]
- Measures of network centricity for edge deployment of IoT applicationsDinesh C. Verma, Geeth de Mel. 4612-4620 [doi]
- Source characterization of airborne emissions using a sensor network: Examining the impact of sensor quality, quantity, and wind climatologyXiaochi Zhou, Vinícius Amaral, John D. Albertson. 4621-4629 [doi]
- Sentiment analysis via multi-layer perceptron trained by meta-heuristic optimisationDabiah Ahmed Alboaneen, Huaglory Tianfield, Yan Zhang. 4630-4635 [doi]
- Detection of hacking behaviors and communication patterns on social mediaOlga Babko-Malaya, Rebecca Cathey, Steve Hinton, David Maimon, Taissa Gladkova. 4636-4641 [doi]
- Improving cyber-attack predictions through information foragingAdam Dalton, Bonnie J. Dorr, Leon Liang, Kristy Hollingshead. 4642-4647 [doi]
- Twitter-enhanced Android malware detectionJordan DeLoach, Doina Caragea. 4648-4657 [doi]
- Deriving cyber use cases from graph projections of cyber data represented as bipartite graphsMohammed Eslami, George Zheng, Hamed Eramian, Georgiy Levchuk. 4658-4663 [doi]
- Binary malware image classification using machine learning with local binary patternJhu-Sin Luo, Dan Chia-Tien Lo. 4664-4667 [doi]
- On the relevance of social media platforms in predicting the volume and patterns of web defacement attacksDavid Maimon, Andrew Fukuda, Steve Hinton, Olga Babko-Malaya, Rebecca Cathey. 4668-4673 [doi]
- Towards a definition of cyberspace tactics, techniques and proceduresFernando Maymi, Robert Bixler, Randolph Jones, Scott Lathrop. 4674-4679 [doi]
- DNS graph mining for malicious domain detectionHau Tran, An Nguyen, Phuong Vo, Tu Vu. 4680-4685 [doi]
- Network intrusion detection using word embeddingsXiaoyan Zhuo, Jialing Zhang, Seung Woo Son. 4686-4695 [doi]
- Building industry network based on business text: Corporate disclosures and newsSung Whan Jeon, Hye-Jin Lee, Sungzoon Cho. 4696-4704 [doi]
- Predicting stock movement direction with machine learning: An extensive study on S&P 500 stocksYang Jiao, Jérémie Jakubowicz. 4705-4713 [doi]
- Credit decision tool using mobile application data for microfinance in agricultureNaomi Simumba, Suguru Okami, Naohiko Kohtake. 4714-4721 [doi]
- Analysis of national election using mathematical model of hit phenomenonMasanori Ajito, Yasuko Kawahata, Akira Ishii. 4722-4724 [doi]
- Towards a big data requirements engineering artefact model in the context of big data software development projects: Poster extended abstractDarlan Arruda, Nazim H. Madhavji. 4725-4726 [doi]
- Big data analysis of youth tobacco smoking trends in the United StatesShilpa Balan, Nishant Shristiraj, Vrunda Shah, Anusha Manjappa. 4727-4729 [doi]
- Towards scalable kernel machines for streaming data analyticsShaunak D. Bopardikar, George S. Eskander Ekladious. 4730-4732 [doi]
- Large scale app recommendation in Ant FinancialChaochao Chen, Xinxing Yang, Li Wang, Jun Zhou, Xiaolong Li. 4733-4735 [doi]
- Social media based NPL system to find and retrieve ARM data: Concept paperRanjeet Devarakonda, Michael Giansiracusa, Jitendra Kumar, Harold Shanafield. 4736-4737 [doi]
- Towards a distributed infrastructure for data-driven discoveries & analysisMohammed Elshambakey, Mohamed Khalefa, William J. Tolone, Sreyasee Das Bhattacharjee, Huikyo Lee, Luca Cinquini, Shannon Schlueter, Isaac Cho, Wenwen Dou, Daniel J. Crichton. 4738-4740 [doi]
- Anomaly detection on bipartite graphs for cyber situational awareness and threat detectionMohammed Eslami, George Zheng, Hamed Eramian, Georgiy Levchuk. 4741-4743 [doi]
- Extracting route patterns of vessels from AIS data by using topic modelIwao Fujino, Christophe Claramunt, Abdel-Ouahab Boudraa. 4744-4746 [doi]
- Big data in psychology: Using word embeddings to study theory-of-mindMichel Généreux, Bryor Snejfella, Marta Maslej. 4747-4749 [doi]
- Analyzing big ocean science data with NEXUSFrank R. Greguska, Thomas Huang, Brian Wilson, Nga Quach, Joe Jacob. 4750 [doi]
- Turning big spatial data into smart routingAbdeltawab M. Hendawi, Aqeel Rustum, Mohamed H. Ali, John A. Stankovic. 4751-4753 [doi]
- Human-controlled iterative subclustering analysisMauri Kaipainen, Olli Pitkänen, Perspicamus Ab. 4754-4756 [doi]
- Consideration of parallel data processing over an apache spark clusterKasumi Kato, Atsuko Takefusa, Hidemoto Nakada, Masato Oguchi. 4757-4759 [doi]
- Analytical the large-scale collection of data on the results of the guides for foreigners visiting JapanYasuko Kawahata, Yukari Moriyama, Shinichirou Yamada, Mingyi Sun, Taketo Kawamura. 4760-4764 [doi]
- iSkin specialist - A big data based expert system for dermatologySaleena Khanna, Yuvraj S. Sethi, Akash R. Nambiar. 4765-4767 [doi]
- Data analytics for modeling soil moisture patterns across united states ecoclimatic domainsThomas Kitson, Paula Olaya, Elizabeth Racca, Michael R. Wyatt II, Mario Guevara, Rodrigo Vargas, Michela Taufer. 4768-4770 [doi]
- Generating Unified Famous Objects (UFOs) from the classified object tablesAnusha Kola, Harshal More, Sean Soderman, Michael N. Gubanov. 4771-4773 [doi]
- Energy information collection mechanism using big data correlation mapTai-Yeon Ku, Wan-Ki Park, Hoon Choi. 4774-4776 [doi]
- Anticipating human errors from periodic big survey data in nuclear power plantsHyun-Chul Lee, Tong Il Jang, Kwangsu Moon. 4777-4778 [doi]
- MapReduce-based computation of area skyline query for selecting good locations in a mapChen Li, Annisa, Asif Zaman, Yasuhiko Morimoto. 4779-4782 [doi]
- Data analysis using hadoop MapReduce environmentPrathyushaRani Merla, Yiheng Liang. 4783-4785 [doi]
- Spatial-based topic modelling using wikidata knowledge baseKwan Hui Lim, Shanika Karunasekera, Aaron Harwood, Lucia Falzon. 4786-4788 [doi]
- The influences of deep-sea vision data quality on observational analysisLixin Liu, Jun Chen. 4789-4791 [doi]
- Data-driven approach to ensuring fault tolerance and efficiency of swarm systemsAmin Majd, Elena Troubitsyna. 4792-4794 [doi]
- A SVM approach for lightpath QoT estimation in optical transport networksJavier Mata, Ignacio de Miguel, Ramón J. Durón, Juan Carlos Aguado, Noemí Merayo, Lidia Ruiz, Patricia Fernández, Rubén M. Lorenzo, Evaristo J. Abril. 4795-4797 [doi]
- 1A study on big data I/O performance with modern storage systemsKenji Nakashima, Joichiro Kon, Saneyasu Yamaguchi, Gil Jae Lee, José A. B. Fortes. 4798-4799 [doi]
- Biofeedback EEG data integration and visualization analytics for endurance exercise practices: Data integration and visualization analytics of biofeedback EEGMonika Nawrocka, Marcin Lukowski. 4800-4802 [doi]
- A performance evaluation of Apache Kafka in support of big data streaming applicationsPaul Le Noac'h, Alexandru Costan, Luc Bougé. 4803-4806 [doi]
- Hybrid.JSON: High-velocity parallel in-memory polystore JSON ingestSteven Ortiz, Caner Enbatan, Maksim Podkorytov, Dylan Soderman, Michael N. Gubanov. 4807-4809 [doi]
- Using Bi-partite graphs to cluster complex networksKaine Black, Monica Wachowicz, Alec Parise. 4810-4812 [doi]
- ART-2b: Adapted ART-2a for large scale data clustering on PM2.5 mass spectraNat Pavasant, Hiroshi Furutani, Masayuki Numao, Ken-ichi Fukui. 4813-4815 [doi]
- Automatic keyword extraction: An ensemble methodTayfun Pay, Stephen Lucci. 4816-4818 [doi]
- The case for graph-based recommendationsIulia Popescu, Kurt Portelli, Christos Anagnostopoulos, Nikos Ntarmos. 4819-4821 [doi]
- Baselines for demographic inference on a new gold standard twitter corpusJason Radford, Luke Horgan, David Lazer. 4822-4823 [doi]
- Piloting a theory-based approach to inferring gender in big dataJason Radford. 4824-4826 [doi]
- Privacy-preserving outsourced collaborative frequent itemset mining in the cloudBharath K. Samanthula. 4827-4829 [doi]
- A study on interpretability of decision of machine learningShohei Shirataki, Saneyasu Yamaguchi. 4830-4831 [doi]
- Hybrid.media: High velocity video ingestion in an in-memory scalable analytical polystoreMark Simmons, Daniel Armstrong, Dylan Soderman, Michael N. Gubanov. 4832-4834 [doi]
- EOS: A multilingual text archive of international newspaper & blog articlesLisa Singh, Raghu Pemmaraju. 4835-4837 [doi]
- Application specific traffic control in large-scale disastersTsumugi Tairaku, Akihiro Nakao, Saneyasu Yamaguchi, Masato Oguchi. 4838-4840 [doi]
- Road safety estimation utilizing big and heterogeneous vehicle recorder dataMasashi Toyoda, Daisaku Yokoyama, Junpei Komiyama, Masahiko Itoh. 4841-4842 [doi]
- Real time analytics - State of the art: Potentials and limitations in the smart factorySebastian Trinks, Carsten Felden. 4843-4845 [doi]
- MCMalloc: A scalable memory allocator for multithreaded applications on a many-core shared-memory machineAkira Umayabara, Hayato Yamana. 4846-4848 [doi]
- Scalable spam classifier for web tablesSantiago Villasenor, Tom Nguyen, Anusha Kola, Sean Soderman, Michael N. Gubanov. 4849-4851 [doi]
- Accurate signal timing from high frequency streaming dataJonathan Wang, Kesheng Wu, Alex Sim, Seongwook Hwangbo. 4852-4854 [doi]
- Understanding the impact of sampling and noise on detecting events using twitterYifang Wei, Lisa Singh. 4855-4857 [doi]
- Attribute-based proxy re-encryption method for revocation in cloud data storageYoshiko Yasumura, Hiroki Imabayashi, Hayato Yamana. 4858-4860 [doi]
- Towards constructing a driver management system based on large-scale driving operation recordsDaisaku Yokoyama, Masashi Toyoda. 4861-4862 [doi]
- Proposal of classification method of bus operation states using sensor dataTakuya Yonezawa, Ismail Arai, Toyokazu Akiyama, Kazutoshi Fujikawa. 4863-4865 [doi]
- Understanding a moderating effect of physicians' endorsement to online workload: An empirical study in online health-care communitiesHaiyan Yu, Kun Xiang, Jiang Yu. 4866-4868 [doi]
- Towards automatic infrastructure provisioning for highly dynamic streaming applicationsPhilipp Zehnder, Dominik Riemer. 4869-4871 [doi]
- Personalized search with editable profilesBinyam A. Zemede, Byron J. Gao. 4872-4874 [doi]
- Discovering the interdisciplinary nature of big data researchYin Zhang, Jiming Hu. 4875-4877 [doi]
- Big data system for information aggregation and model comparison for precison medicineZiwei Zhu, Weijia Xu, Wei He. 4878-4880 [doi]