25.01.2015 Views

Cost-Based Optimization of Integration Flows - Datenbanken ...

Cost-Based Optimization of Integration Flows - Datenbanken ...

Cost-Based Optimization of Integration Flows - Datenbanken ...

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

1 Introduction<br />

I can’t change the direction <strong>of</strong> the wind,<br />

but I can adjust my sails to always reach my destination.<br />

— Jimmy Dean<br />

Advances in information technology combined with rising business requirements lead to<br />

a rapidly growing amount <strong>of</strong> digital information created and replicated worldwide. For<br />

example, recent studies [IDC08, IDC10] conducted by the market research organization<br />

IDC, report a yearly information increase by over 60% to 1.2 zettabyte (ZB) in 2010.<br />

Beside this huge amount <strong>of</strong> digital information, due to technical and organizational issues,<br />

the information is distributed across numerous heterogeneous systems, applications and<br />

small devices [IDC08, Haa07]. For these reasons, the integration <strong>of</strong> heterogeneous systems<br />

and applications becomes more and more crucial for an IT infrastructure. In another<br />

recent study [Gar09], the market research organization Gartner reveals that the revenue<br />

<strong>of</strong> the worldwide application infrastructure and middleware market increased by 6.9% from<br />

14.17 billion USD in 2007 to 15.15 billion USD in 2008. In conclusion, the integration <strong>of</strong><br />

heterogeneous systems is seen as one <strong>of</strong> the biggest and most cost-extensive challenges<br />

information technology faces today [BH08].<br />

Historically, the predominant integration approaches were materialized and virtually integrated<br />

systems [DD99], where both types provide a homogeneous global view over data <strong>of</strong><br />

several source systems. Nowadays, we observe emerging requirements <strong>of</strong> complex integration<br />

tasks that (1) stretch beyond simple read-only applications, (2) involve many types <strong>of</strong><br />

heterogeneous systems and applications, and (3) require fairly complex procedural aspects.<br />

To meet these requirements, typically, imperative integration flows are modeled and executed<br />

in order to exchange data between the heterogeneous systems and applications <strong>of</strong> an<br />

IT infrastructure [HAB + 05]. There are plenty <strong>of</strong> application examples for this type <strong>of</strong> imperative<br />

integration flows, including, for instance, enterprise information systems [BH08],<br />

health care management [CHX08, GGH00], energy data management [SAP03], financial<br />

messaging [MS09], telecommunications [ACG + 08], and context-aware mobile applications<br />

[CEB + 09]. For example, large health care management system infrastructures include<br />

up to 20-120 different applications and systems [Mic07] and rely on domain-specific standards<br />

for data exchange such as HL/7 (Health Level 7) [hl707] or DICOM (Digital Imaging<br />

and Communications in Medicine) [dic09]. The deployed integration flows are repeatedly<br />

executed by an integration platform. Examples <strong>of</strong> such platforms are ETL tools (Extraction<br />

Transformation Loading) [KC04], EAI servers (Enterprise Application <strong>Integration</strong>)<br />

[Lin99] or MOM systems (Message-Oriented Middleware) [HW04], which have converged<br />

more and more in the past [Sto02, HAB + 05] and this trend is expected to continue in the<br />

future as well [HAB + 05].<br />

From a business perspective, we classify all information systems <strong>of</strong> an enterprise according<br />

to the three levels <strong>of</strong> an information system pyramid [Sch97] as operational systems,<br />

dispositive systems, or strategical systems. In this context, typically, two major use cases<br />

1

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!