2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

One challenge in big data analytics is the lack of tools to manage the complex interactions among code, data and parameters, especially in the common situation where all these factors can change a lot. We present our preliminary experience with DataLab, a system we build to manage the big data workflow. DataLab improves big data analytical workflow in several novel ways. 1) DataLab manages the revision...

chapter

Committees and Reviewers

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > viii - ix

The conference offers a note of thanks and lists its reviewers.

chapter

Streaming Software Analytics

Georgios Gousios, Dominik Safaric, Joost Visser

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 8 - 11

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

In this paper we present a novel software analytics infrastructure supporting for a combination of three requirements to serve software practitioners in utilising data-driven decision making: (1) Real-time insight: streaming software analytics unify static historical and current event-stream data enabling for immediate, nearly real-time insight into software quality, processes and users; (2) Query...

chapter

Understanding Quality Requirements in the Context of Big Data Systems

Ibtehal Noorwali, Darlan Arruda, Nazim H. Madhavji

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 76 - 79

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

While the domain of big data is anticipated to affect many aspects of human endeavour, there are numerous challenges in building big data applications among which is how to address big data characteristics in quality requirements. In this paper, we propose a novel, unified, approach for specifying big data characteristics (e.g., velocity of data arrival) in quality requirements (i.e., those requirements...

chapter

The 'BigSE' Project: Lessons Learned from Validating Industrial Text Mining

Rahul Krishna, Zhe Yu, Amritanshu Agrawal, Manuel Dominguez, more

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 65 - 71

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

As businesses become increasingly reliant on big data analytics, it becomes increasingly important to {\em test} the choices made within the data miners. This paper reports lessons learned from the {\em BigSE Lab}, an industrial/university collaboration that augments industrial activity with low-cost testing of data miners (by graduate students). BigSE is an experiment in academic/ industrial collaboration...

chapter

Data Model Evolution Using Object-NoSQL Mappers: Folklore or State-of-the-Art?

Andreas Ringlstetter, Stefanie Scherzinger, Tegawende F. Bissyande

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 33 - 36

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

In big data software engineering, the schema flexibility of NoSQL document stores is a major selling point: When the document store itself does not actively manage a schema, the data model is maintained within the application. Just like object-relational mappers for relational databases, object-NoSQL mappers are part of professional software development with NoSQL document stores. Some mappers go...

chapter

[Title page i]

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > i

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

Presents the title page of the proceedings record.

chapter

[Title page iii]

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > iii

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

Presents the title page of the proceedings record.

chapter

Message from the Workshop Chairs

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > vii

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

Presents the introductory welcome message from the conference proceedings. May include the conference officers' congratulations to all involved with the conference event and publication of the proceedings record.

chapter

Sponsors and supporters

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > x - xii

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

The conference organizers greatly appreciate the support of the various corporate sponsors listed.

chapter

Decisions as a Service for Application Centric Real Time Analytics

Patrick Tendick, Audris Mockus

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 1 - 7

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

The need for application-level intelligence cannot be easily satisfied with existing architectures or methodologies that separate methods and tools for application developers and data scientists. We aim, therefore, to develop a framework (an architecture and a methodology) to make it possible to add intelligence capabilities to existing applications (decision-enablement) and to facilitate building...

chapter

Predicting and Fixing Vulnerabilities before They Occur: A Big Data Approach

Hong-Mei Chen, Rick Kazman, Ira Monarch, Ping Wang

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 72 - 75

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

The number and variety of cyber-attacks is rapidly increasing, and the rate of new software vulnerabilities is also rising dramatically. The cybersecurity community typically reacts to attacks after they occur. Being reactive is costly and can be fatal, where attacks threaten lives, important data, or mission success. Taking a proactive approach, we are: (I) identifying potential attacks before they...

chapter

Towards a Model-Driven Design Tool for Big Data Architectures

Michele Guerriero, Saeed Tajfar, Damian Andrew Tamburri, Elisabetta Di Nitto

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 37 - 43

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

Big Data technologies are rapidly becoming a key enabler for modern industries. However, the entry costs inherent to ``going Big" are considerable, ranging from learning curve, renting/buying infrastructure, etc. A key component of these costs is the time spent on learning about and designing with the many big data frameworks (e.g., Spark, Storm, HadoopMR, etc.) on the market. To reduce said...

chapter

Toward Big Data Value Engineering for Innovation

Hong-Mei Chen, Rick Kazman, Juan Garbajosa, Eloy Gonzalez

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 44 - 50

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

This article articulates the requirements for an effective big data value engineering method. It then presents a value discovery method, called Eco-ARCH (Eco-ARCHitecture), tightly integrated with the BDD (Big Data Design) method for addressing these requirements, filling a methodological void. Eco-ARCH promotes a fundamental shift in design thinking for big data system design – from "bounded...

chapter

Exploring a Framework for Identity and Attribute Linking across Heterogeneous Data Systems

Nathan Wilder, Jared M. Smith, Audris Mockus

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 19 - 25

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

Online-activity-generated digital traces provide opportunities for novel services and unique insights as demonstrated in, for example, research on mining software repositories. The inability to link these traces within and among systems, such as Twitter, GitHub, or Reddit, inhibit the advances in this area. Furthermore, no single approach to integrate data from these disparate sources is likely to...

chapter

A Big Data Framework for Cloud Monitoring

Saeed Zareian, Marios Fokaefs, Hamzeh Khazaei, Marin Litoiu, more

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 58 - 64

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

Elasticity is a key component of modern cloud environments and monitoring is an essential part of this process. Monitoring demonstrates several challenges including gathering metrics from a variety of layers (infrastructure, platform, application), the need for fast processing of this data to enable efficient elasticity and the proper management of this data in order to facilitate analysis of current...

chapter

A Reference Architecture for Big Data Systems in the National Security Domain

John Klein, Ross Buglak, David Blockow, Troy Wuttke, more

2016 IEEE/ACM 2nd International Workshop on Big Data Software Engineering (BIGDSE) > 51 - 57

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)

Acquirers, system builders, and other stakeholders of big data systems need to define requirements, develop and evaluate solutions, and integrate systems together. A reference architecture enables these software engineering activities by standardizing nomenclature, defining key solution elements and their relationships, collecting relevant solution patterns, and classifying existing technologies....

Publication date

Set your own date range

Content availability

Available (21)
None (1)

Keywords

BIG DATA (10)
SOFTWARE ENGINEERING (7)
SOFTWARE (6)
COMPUTER ARCHITECTURE (4)
DATA MODELS (4)
ENGINES (3)
BIG DATA APPLICATIONS (2)
DATA MINING (2)
DATABASES (2)
MEASUREMENT (2)
REAL-TIME SYSTEMS (2)
STAKEHOLDERS (2)
ADAPTATION MODELS (1)
APPLICATION DEVELOPER (1)
ARCHITECTURE ANALYSIS (1)
ARCHITECTURE LANDSCAPE (1)
BIG DATA ARCHITECTURE (1)
BIOINFORMATICS (1)
C LANGUAGES (1)
CLOUD APPLICATIONS (1)
CLOUD COMPUTING (1)
COLLABORATION (1)
COMPUTATIONAL MODELING (1)
COMPUTER HACKING (1)
CONCEPT CLUSTERING (1)
CONTEXT (1)
DATA ANALYTICS (1)
DATA MANAGEMENT (1)
DATA MIGRATION (1)
DATA MODEL EVOLUTION (1)
DATA PRIVACY (1)
DATA PROCESSING (1)
DATA SCIENCE (1)
DATA SCIENTIST (1)
DATA VISUALIZATION (1)
DATABASES HETEROGENEITY (1)
DECISION INJECTION (1)
DECISION MAKING (1)
DECISIONS AS A SERVICE (1)
DISTRIBUTED DATABASES (1)
DOMAIN SPECIFIC LANGUAGE (1)
DSL (1)
E-DISCOVERY (1)
ECOSYSTEM (1)
ENERGY INDUSTRY (1)
ENTITY EXTRACTION (1)
ENTITY IDENTIFICATION (1)
EVENT DRIVEN APPLICATION (1)
EVIDENCE BASED DESIGN (1)
FAULT TOLERANCE (1)
FAULT TOLERANT SYSTEMS (1)
IDENTITY LINKING (1)
INDEXES (1)
INDUSTRIES (1)
INNOVATION (1)
INSTRUMENTATION (1)
JAVA (1)
JOINING PROCESSES (1)
LOADING (1)
LOGGING (1)
METADATA (1)
MODEL-DRIVEN DEVELOPMENT (1)
MONITORING (1)
MONITORING SYSTEM (1)
NATIONAL SECURITY (1)
NEXT BEST ACTION (1)
NIST (1)
NOSQL DATABASES (1)
NOSQL DATASTORES (1)
OBJECT-NOSQL MAPPERS (1)
ONTOLOGIES (1)
PATTERN MATCHING (1)
PERFORMANCE ANALYSIS (1)
PROBLEM-SOLVING (1)
QUALITY REQUIREMENTS; BIG DATA; SPECIFICATION (1)
REFERENCE ARCHITECTURE (1)
RELIABILITY (1)
REQUIREMENTS ENGINEERING (1)
REVERSE ENGINEERING (1)
SCORING (1)
SEARCH-BASED EXPERIMENTATION (1)
SECURITY (1)
SERVERS (1)
SOFTWARE ANALYTICS (1)
SOFTWARE SECURITY (1)
SPARKS (1)
STANDARDS (1)
STORMS (1)
STREAMING (1)
TECHNOLOGICAL INNOVATION (1)
TESTING (1)
TEXT MINING (1)
THROUGHPUT (1)
TRANSIENT ANALYSIS (1)
TWITTER (1)
VALUE DISCOVERY (1)
VALUE ENGINEERING (1)
VERSION CONTROL (1)
more

INFONA - science communication portal

2016 IEEE/ACM 2nd International Workshop on BIG Data Software Engineering (BIGDSE)