Big Data Analytics in Cybersecurity First Edition Deng - The ebook in PDF format with all chapters is ready for download
Big Data Analytics in Cybersecurity First Edition Deng - The ebook in PDF format with all chapters is ready for download
com
https://textbookfull.com/product/big-data-analytics-in-
cybersecurity-first-edition-deng/
OR CLICK HERE
DOWLOAD EBOOK
https://textbookfull.com/product/leadership-strategies-in-the-age-of-
big-data-algorithms-and-analytics-first-edition-norton-paley/
textbookfull.com
https://textbookfull.com/product/from-big-data-to-big-profits-success-
with-data-and-analytics-1st-edition-russell-walker/
textbookfull.com
https://textbookfull.com/product/big-data-analytics-systems-
algorithms-applications-c-s-r-prabhu/
textbookfull.com
https://textbookfull.com/product/big-data-and-analytics-for-
insurers-1st-edition-boobier/
textbookfull.com
Big Data Analytics with Java 1st Edition Rajat Mehta
https://textbookfull.com/product/big-data-analytics-with-java-1st-
edition-rajat-mehta/
textbookfull.com
https://textbookfull.com/product/big-data-analytics-for-large-scale-
multimedia-search-stefanos-vrochidis/
textbookfull.com
https://textbookfull.com/product/big-data-analytics-for-intelligent-
healthcare-management-1st-edition-nilanjan-dey/
textbookfull.com
https://textbookfull.com/product/understanding-azure-data-factory-
operationalizing-big-data-and-advanced-analytics-solutions-sudhir-
rawat/
textbookfull.com
Big Data Analytics
in Cybersecurity
Data Analytics Applications
Series Editor: Jay Liebowitz
PUBLISHED
FORTHCOMING
Edited by
Onur Savas
Julia Deng
CRC Press
Taylor & Francis Group
6000 Broken Sound Parkway NW, Suite 300
Boca Raton, FL 33487-2742
This book contains information obtained from authentic and highly regarded sources. Reasonable efforts
have been made to publish reliable data and information, but the author and publisher cannot assume
responsibility for the validity of all materials or the consequences of their use. The authors and publishers
have attempted to trace the copyright holders of all material reproduced in this publication and apologize
to copyright holders if permission to publish in this form has not been obtained. If any copyright material
has not been acknowledged please write and let us know so we may rectify in any future reprint.
Except as permitted under U.S. Copyright Law, no part of this book may be reprinted, reproduced, trans-
mitted, or utilized in any form by any electronic, mechanical, or other means, now known or hereafter
invented, including photocopying, microfilming, and recording, or in any information storage or retrieval
system, without written permission from the publishers.
For permission to photocopy or use material electronically from this work, please access www.copyright
.com (http://www.copyright.com/) or contact the Copyright Clearance Center, Inc. (CCC), 222 Rosewood
Drive, Danvers, MA 01923, 978-750-8400. CCC is a not-for-profit organization that provides licenses and
registration for a variety of users. For organizations that have been granted a photocopy license by the
CCC, a separate system of payment has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are
used only for identification and explanation without intent to infringe.
Preface................................................................................................................vii
About the Editors..............................................................................................xiii
Contributors....................................................................................................... xv
6 Cybersecurity Training.......................................................................115
BOB POKORNY
v
vi ◾ Contents
Index............................................................................................................329
Preface
vii
viii ◾ Preface
that the organization expects staff to apply; (2) assuming that new cybersecurity
staff who recently received degrees or certificates in cybersecurity will know what is
required; or (3) requiring cybersecurity personnel to read about new threats.
Chapter 7, “Machine Unlearning: Repairing Learning Models in Adversarial
Environments,” is written by Professor Yinzhi Cao of Lehigh University. Motivated
by the fact that today’s systems produce a rapidly exploding amount of data, and
the data further derives more data, this forms a complex data propagation network
that we call the data’s lineage. There are many reasons that users want systems to
forget certain data including its lineage for privacy, security, and usability reasons.
In this chapter, the author introduces a new concept machine unlearning, or simply
unlearning, capable of forgetting certain data and their lineages in learning models
completely and quickly. The chapter presents a general, efficient unlearning approach
by transforming learning algorithms used by a system into a summation form.
Chapter 8, “Big Data Analytics for Mobile App Security,” is written by
Professor Doina Caragea of Kansas State University, and Professor Xinming Ou of
the University of South Florida. This chapter describes mobile app security analysis,
one of the new emerging cybersecurity issues with rapidly increasing requirements
introduced by the predominant use of mobile devices in people’s daily lives, and dis-
cusses how big data techniques such as machine learning (ML) can be leveraged for
analyzing mobile applications such as Android for security problems, in particular
malware detection. This chapter also demonstrates the impact of some challenges
on some existing machine learning-based approaches, and is particularly written to
encourage the practice of employing a better evaluation strategy and better designs
of future machine learning-based approaches for Android malware detection.
Chapter 9, “Security, Privacy, and Trust in Cloud Computing,” is written by
Ruiwen Li, Songjie Cai, and Professor Yuhong Liu Ruiwen Li, and Songjie Cai of
Santa Clara University, and Professor Yan (Lindsay) Sun of the University of Rhode
Island. Cloud computing is revolutionizing the cyberspace by enabling conve-
nient, on-demand network access to a large shared pool of configurable computing
resources (e.g., networks, servers, storage, applications, and services) that can be rap-
idly provisioned and released. While cloud computing is gaining popularity, diverse
security, privacy, and trust issues are emerging, which hinders the rapid adoption of
this new computing paradigm. This chapter introduces important concepts, mod-
els, key technologies, and unique characteristics of cloud computing, which helps
readers better understand the fundamental reasons for current security, privacy, and
trust issues in cloud computing. Furthermore, critical security, privacy and trust
challenges, and the corresponding state-of-the-art solutions are categorized and dis-
cussed in detail, and followed by future research directions.
Chapter 10, “Cybersecurity in Internet of Things (IoT),” is written by Wenlin Han
and Professor Yang Xiao of the University of Alabama. This chapter introduces the
IoT as one of the most rapidly expanding cybersecurity domains, and presents the
big data challenges faced by IoT, as well as various security requirements and issues
in IoT. IoT is a giant network containing various applications and systems with
Preface ◾ xi
heterogeneous devices, data sources, protocols, data formats, and so on. Thus, the
data in IoT is extremely heterogeneous and big, and this poses heterogeneous big data
security and management problems. This chapter describes current solutions and also
outlines how big data analytics can address security issues in IoT when facing big data.
Chapter 11, “Big Data Analytics for Security in Fog Computing,” is written by
Shanhe Yi and Professor Qun Li of the College of William and Mary. Fog comput-
ing is a new computing paradigm that can provide elastic resources at the edge of
the Internet to enable many new applications and services. This chapter discusses
how big data analytics can come out of the cloud and into the fog, and how security
problems in fog computing can be solved using big data analytics. The chapter also
discusses the challenges and potential solutions of each problem and highlights
some opportunities by surveying existing work in fog computing.
Chapter 12, “Analyzing Deviant Socio-Technical Behaviors using Social
Network Analysis and Cyber Forensics-Based Methodologies,” is written by Samer
Al-khateeb, Muhammad Hussain, and Professor Nitin Agarwal of the University
of Arkansas at Little Rock. In today’s information technology age, our thinking
and behaviors are highly influenced by what we see online. However, misinfor-
mation is rampant. Deviant groups use social media (e.g., Facebook) to coordi-
nate cyber campaigns to achieve strategic goals, influence mass thinking, and steer
behaviors or perspectives about an event. The chapter employs computational social
network analysis and cyber forensics informed methodologies to study information
competitors who seek to take the initiative and the strategic message away from the
main event in order to further their own agenda (via misleading, deception, etc.).
Chapter 13, “Security Tools for Cybersecurity,” is written by Matthew Matchen
of Braxton-Grant Technologies. This chapter takes a purely practical approach to
cybersecurity. When people are prepared to apply cybersecurity ideas and theory to
practical applications in the real world, they equip themselves with tools to better
enable the successful outcome of their efforts. However, choosing the right tools
has always been a challenge. The focus of this chapter is to identify functional areas
in which cybersecurity tools are available and to list examples in each area to dem-
onstrate how tools are better suited to provide insight in one area over the other.
Chapter 14, “Data and Research Initiatives for Cybersecurity,” is written by the
editors of this book. We have been motivated by the fact that big data based cyber-
security analytics is a data-centric approach. Its ultimate goal is to utilize available
technology solutions to make sense of the wealth of relevant cyber data and turn-
ing it into actionable insights that can be used to improve the current practices
of network operators and administrators. Hence, this chapter aims at introducing
relevant data sources for cybersecurity analysis, such as benchmark datasets for
cybersecurity evaluation and testing, and certain research repositories where real
world cybersecurity datasets, tools, models, and methodologies can be found to
support research and development among cybersecurity researchers. In addition,
some insights are added for the future directions on data sharing for big data based
cybersecurity analysis.
http://taylorandfrancis.com
About the Editors
Dr. Onur Savas is a data scientist at Intelligent Automation, Inc. (IAI), Rockville,
MD. As a data scientist, he performs research and development (R&D), leads a
team of data scientists, software engineers, and programmers, and contributes to
IAI’s increasing portfolio of products. He has more than 10 years of R&D expertise
in the areas of networks and security, social media, distributed algorithms, sen-
sors, and statistics. His recent work focuses on all aspects of big data analytics and
cloud computing with applications to network management, cybersecurity, and
social networks. Dr. Savas has a PhD in electrical and computer engineering from
Boston University, Boston, MA, and is the author of numerous publications in
leading journals and conferences. At IAI, he has been the recipient of various R&D
contracts from DARPA, ONR, ARL, AFRL, CTTSO, NASA, and other federal
agencies. His work at IAI has contributed to the development and commercializa-
tion of IAI’s social media analytics tool Scraawl® (www.scraawl.com).
Dr. Julia Deng is a principal scientist and Sr. Director of Network and Security
Group at Intelligent Automation, Inc. (IAI), Rockville, MD. She leads a team of
more than 40 scientists and engineers, and during her tenure at IAI, she has been
instrumental in growing IAI’s research portfolio in networks and cybersecurity. In
her role as a principal investigator and principal scientist, she initiated and directed
numerous R&D programs in the areas of airborne networks, cybersecurity, net-
work management, wireless networks, trusted computing, embedded system, cog-
nitive radio networks, big data analytics, and cloud computing. Dr. Deng has a
PhD from the University of Cincinnati, Cincinnati, OH, and has published over
30 papers in leading international journals and conference proceedings.
xiii
http://taylorandfrancis.com
Contributors
Yi Cheng Qun Li
Intelligent Automation, Inc. College of William and Mary
Rockville, Maryland Williamsburg, Virginia
xv
xvi ◾ Contributors
Bob Pokorny
Intelligent Automation, Inc.
Rockville, Maryland
APPLYING BIG I
DATA INTO
DIFFERENT
CYBERSECURITY
ASPECTS
http://taylorandfrancis.com
Chapter 1
Contents
1.1 Introduction to Big Data Analytics...............................................................4
1.1.1 What Is Big Data Analytics?..............................................................4
1.1.2 Differences between Traditional Analytics and Big Data Analytics....4
1.1.2.1 Distributed Storage..............................................................5
1.1.2.2 Support for Unstructured Data............................................5
1.1.2.3 Fast Data Processing............................................................6
1.1.3 Big Data Ecosystem...........................................................................7
1.2 The Need for Big Data Analytics in Cybersecurity........................................8
1.2.1 Limitations of Traditional Security Mechanisms...............................9
1.2.2 The Evolving Threat Landscape Requires New Security
Approaches......................................................................................10
1.2.3 Big Data Analytics Offers New Opportunities to Cybersecurity......11
1.3 Applying Big Data Analytics in Cybersecurity............................................11
1.3.1 The Category of Current Solutions..................................................11
1.3.2 Big Data Security Analytics Architecture........................................12
1.3.3 Use Cases.........................................................................................13
1.3.3.1 Data Retention/Access.......................................................13
1.3.3.2 Context Enrichment..........................................................14
1.3.3.3 Anomaly Detection...........................................................15
1.4 Challenges to Big Data Analytics for Cybersecurity....................................18
References............................................................................................................20
3
4 ◾ Big Data Analytics in Cybersecurity
This chapter introduces big data analytics and highlights the needs and importance
of applying big data analytics in cybersecurity to fight against the evolving threat
landscape. It also describes the typical usage of big data security analytics including
its solution domains, architecture, typical use cases, and the challenges. Big data
analytics, as an emerging analytical technology, offers the capability to collect,
store, process, and visualize big data, which are so large or complex that traditional
data processing applications are inadequate to deal with them. Cybersecurity, at
the same time, is experiencing the big data challenge due to the rapidly growing
complexity of networks (e.g., virtualization, smart devices, wireless connections,
Internet of Things, etc.) and increasing sophisticated threats (e.g., malware, multi-
stage, advanced persistent threats [APTs], etc.). Accordingly, traditional cybersecu-
rity tools become ineffective and inadequate in addressing these challenges and big
data analytics technology brings in its advantages, and applying big data analytics
in cybersecurity becomes critical and a new trend.
of big data that need to be updated frequently or even continually. Big data analyt-
ics is able to deal with them well by applying distributed storage and distributed
in-memory processing.
1.1.2.1 Distributed Storage
“Volume” is the first “V” of Gartner’s definition of big data. One key feature of big
data is that it usually relies on distributed storage systems because the data is
so massive (often at the petabyte or higher level) that it is impossible for a single
node to store or process it. Big data also requires the storage system to scale up with
future growth. Hyperscale computing environments, used by major big data com-
panies such as Google, Facebook, and Apple, satisfy big data’s storage requirements
by constructing from a vast number of commodity servers with direct-attached
storage (DAS).
Many big data practitioners build their hyberscale computing environments
using Hadoop [2] clusters. Initiated by Google, Apache Hadoop is an open-source
software framework for distributed storage and distributed processing of very large
data sets on computer clusters built from commodity hardware. There are two key
components in Hadoop:
◾◾ HDFS (Hadoop distributed file system): a distributed file system that stores
data across multiple nodes
◾◾ MapReduce: a programming model that processes data in parallel across
multiple nodes
Under MapReduce, queries are split and distributed across parallel nodes and
processed in parallel (the Map step). The results are then gathered and delivered (the
Reduce step). This approach takes advantage of data locality—nodes manipulating
the data they have access to—to allow the dataset to be processed faster and more
efficiently than it would be in conventional supercomputer architecture [3].
◾◾ Infrastructure
Infrastructure is the fundamental part of the big data technology. It stores,
processes, and sometimes analyzes data. As discussed earlier, big data infra-
structure is capable of handling both structured and unstructured data at
large volumes and fast speed. It supports a vast variety of data, and makes it
possible to run applications on systems with thousands of nodes, potentially
Cross-infrastructure/analytics
Open source
Last updated 3/23/2016 Matt Turck (@mattturck), Jim Hao (@jimrhao), and FirstMark Capital (@firstmarkcap)
◾◾ A basic data storage platform to support long-term log data retention and
batch processing jobs. There are a few offerings in the market that skip this
layer and use a single NoSQL database to support all the data retention,
investigation access, and analytics. However, considering all the available
open-source applications in the Hadoop ecosystem, a Hadoop-based plat-
form still gives a more economical, reliable, and flexible data solution for
larger data sets.
◾◾ A data access layer with fast query response performance to support inves-
tigation queries and drill-downs. Because the data access inside Hadoop
Services/apps
Data presentation
Integration
consumption
Data access
Data
Data storage
63
La hermosissima Balaja,
que llorosa en su aposento
las sinrazones del Rey
le pagavan sus cabellos
como tanto estruendo oyò
a un valcon salio corriendo,
y enmudecida le dixo,
dando vozes con silencio:
Vete en paz, que no vas solo,
y en mi ausencia ten consuelo,
que quien te echò de Xerez,
vno te echara de mi pecho:
El con la vista responde,
yo me voy, y no te dexo.
De las agravios de Rey
para tu firmeza a pelo,
Con esto passò la calle,
los ojos atras bolviendo
dos mil vezes: y de Andujar
tomò el camino derecho.
Frontefrida, Frontefrida,
Frontefrida, y con amor,
Do todas las avecicas
Van tomar consolacion, &c.
82 It commences thus:
La M madre te muestra,
La A te manda adorar, &c.
The following are the two first strophes, and the rhythmic
structure of the rest is not less beautiful.
Or:—
115 The following are the first and second strophes of this
song. Love is here a hell, in which the thoughts burn.
LA GLOSA DE PINAR.
Mote.
Our website is not just a platform for buying books, but a bridge
connecting readers to the timeless values of culture and wisdom. With
an elegant, user-friendly interface and an intelligent search system,
we are committed to providing a quick and convenient shopping
experience. Additionally, our special promotions and home delivery
services ensure that you save time and fully enjoy the joy of reading.
textbookfull.com