10.07.2015 Views

ALIO–EURO 2011 Proceedings - Faculdade de Engenharia da ...

ALIO–EURO 2011 Proceedings - Faculdade de Engenharia da ...

ALIO–EURO 2011 Proceedings - Faculdade de Engenharia da ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Published by: ALIO-EURO <strong>2011</strong>May 4-6 <strong>2011</strong>http://www.dcc.fc.up.pt/ALIO-EURO-<strong>2011</strong>/Sponsors:– Cámara Municipal do Porto– Fun<strong>da</strong>ção para o Desenvolvimento Social do Porto– Porto Ci<strong>da</strong><strong>de</strong> <strong>de</strong> Ciência– Universi<strong>da</strong><strong>de</strong> do Porto– Fun<strong>da</strong>ção para a Ciência e a TecnologiaInstitutional support:– Asociación Latino-Iberoamericana <strong>de</strong> Investigación Operativa– Association of European Operational Research Societies– Instituto <strong>de</strong> <strong>Engenharia</strong> <strong>de</strong> Sistemas e Computadores do Porto– <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências <strong>da</strong> Universi<strong>da</strong><strong>de</strong> do Porto– Associação Portuguesa <strong>de</strong> Investigação Operacional


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Welcome NoteDear Conference Participant,It is our great pleasure to welcome you to Porto and to the 7 th edition of the ALIO-EURO workshop in AppliedCombinatorial Optimization.Porto is a city full of tradition and contrasting mo<strong>de</strong>rnity. House of some of the most awar<strong>de</strong>d contemporaryarchitects in the world, here you can find mo<strong>de</strong>rn vibrating buildings si<strong>de</strong> by si<strong>de</strong> with walls that preservecenturies of History. You can make a toast (always with Port Wine) at the mo<strong>de</strong>rnist concert hall building ofCasa <strong>da</strong> Música (House of the Music) or at the old cellars in Vila Nova <strong>de</strong> Gaia, on the left bank of river Douro.You can explore the renowned contemporary art museum of Serralves and enjoy its won<strong>de</strong>rful gar<strong>de</strong>ns. A strollin the city park, towards the seasi<strong>de</strong> and the mouth of river Douro is also a must for those who like walking.Plenty of interesting activities that we expect will contribute for good moments of leisure after the workshop.In ALIO-EURO <strong>2011</strong> there will be presentations covering a wi<strong>de</strong> range of subjects – over 70 high quality presentationsand 4 keynote talks by distinguished researchers. We are very grateful to all authors for contributingto the success of the workshop. We hope that this selection will provi<strong>de</strong> each of you with opportunities to learnsomething new, to discuss and exchange research i<strong>de</strong>as with other colleagues and to start new collaborations.The high quality of the program is also due to the strong engagement of the Program Committee and ClusterOrganizers in a thorough reviewing process. To all of them we address our sincere acknowledgment.To conclu<strong>de</strong>, we are grateful to the Faculty of Sciences of the University of Porto for hosting the workshop andfor providing all the required facilities, and to all sponsors for the financial support provi<strong>de</strong>d.We wish you a pleasant and fruitful stay in Porto.The Organizing CommitteeALIO–EURO <strong>2011</strong> – i


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Local Organizing Committee:Ana Viana (chair), Instituto Politécnico do Porto / INESC PortoA, Miguel Gomes, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> <strong>Engenharia</strong> <strong>da</strong> Universi<strong>da</strong><strong>de</strong> do Porto / INESC PortoJoão Pedro Pedroso, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências <strong>da</strong> Universi<strong>da</strong><strong>de</strong> do Porto / INESC PortoMaria Teresa Costa, Instituto Politécnico do Porto / INESC PortoProgram Committee:Ana Viana (Portugal)Andrés Weintraub (Chile)A. Miguel Gomes (Portugal)Celso C. Ribeiro (Brazil)Chris Potts (UK)Hector Cancela (Uruguay)Horacio Yanasse (Brazil)Irene Loiseau (Argentina)J. Valério <strong>de</strong> Carvalho (Portugal)João Pedro Pedroso (Portugal)M. Grazia Speranza (Italy)Margari<strong>da</strong> Vaz Pato (Portugal)Maria Teresa Costa (Portugal)Maria Urquhart (Uruguay)Olivier Hudry (France)Paolo Toth (Italy)Rafael Martí (Spain)Ramon Alvarez-Val<strong>de</strong>s (Spain)Richard F. Hartl (Austria)Rolf Möhring (Germany)ALIO–EURO <strong>2011</strong> – ii


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>TABLE OF CONTENTSPlenary TalksMoehring R.Routing in Graphs with Applications to Logistics and Traffic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1Ronconi Debora P.Recent Developments in Optimization Methods for Scheduling Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2Constantino MiguelSpatial Forest Optimization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4Lodi AndreaOn Bilevel Programming and its Implications for Mixed Integer Linear Programming . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5Session 1A – Energy IDulce Costa, C. Henggeler Antunes, A. Gomes MartinsMulti-Objective Evolutionary Algorithms for Reactive Power Planning in Electrical Distribution Systems: A Comparative CaseStudy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6Ana Viana, Joao Pedro PedrosoA new MIP based approach for Unit Commitment in power production planning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9Jessica Pillon Torralba Fernan<strong>de</strong>s, Paulo <strong>de</strong> Barros CorreiaDispatch Hydroelectric Power Plant using Genetic Algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13Session 1B – Multiobjective Evolutionary AlgorithmsNail El-Sourani, Markus BorschbachAlgebraic Group Theory driven Divi<strong>de</strong> and Evolve of multi-objective Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18Antonio L. Marquez, Consolacion Gil, Raul Banos, Antonio Fernan<strong>de</strong>zMulti-objective Evolutionary Course Timetabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22R. Li, R. Etemaadi, M.T.M. Emmerich, M.R.V. ChaudronAutomated Design of Software Architectures for Embed<strong>de</strong>d Systems using Evolutionary Multiobjective Optimization . . . . . . . . . . . . . . 26Session 1C – Graph TheoryLilian Markenzon, Paulo R.C. Pereira, Christina F.E.M. WagaNew Characterizations for Subfamilies of Chor<strong>da</strong>l Graphs. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30Gustavo Silva Semaan, Jose Brito, Luiz Satoru OchiEfficient Algorithms for Regionalization: an Approach Based on Graph Partition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34ALIO–EURO <strong>2011</strong> – iii


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Cristina Requejo, Eulalia SantosLagrangean based algorithms for the Weight-Constrained Minimum Spanning Tree Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38Session 2A – Cutting and Packing ILuigi <strong>de</strong> Giovanni, Gionata Massi, Ferdinando Pezzella, Marc E. Pfetsch, Giovanni Rinaldi, Paolo VenturaA Heuristic and an Exact Method for Pattern Sequencing Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42Isabel Cristina Lopes, Jose Valerio <strong>de</strong> CarvalhoAn integer programming framework for sequencing cutting patterns based on interval graph completion . . . . . . . . . . . . . . . . . . . . . . . . . . . 47Session 2B – Metaheuristics FrameworksIgor Machado Coelho, Pablo Luiz Araujo Munhoz, Matheus Nohra Had<strong>da</strong>d, Vitor Nazario Coelho, Marcos <strong>de</strong> Melo <strong>da</strong> Silva,Marcone Jamilson Freitas Souza, Luiz Satoru OchiOPTFRAME: A Computational Framework for Combinatorial Optimization Problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51Dorabela Gamboa, Cesar RegoRAMP: An Overview of Recent Advances and Applications. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55Session 2C – Lot Sizing and SchedulingAgostinho Agra, Mahdi DoostmohammadiA Polyhedral Study of Mixed 0-1 Sets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57Wilco van <strong>de</strong>n Heuvel, H. Edwin Romeijn, Dolores Romero Morales, Albert P.M. WagelmansMulti-Objective Economic Lot-Sizing Mo<strong>de</strong>ls . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60Session 3A – Cutting and Packing IILeonardo Junqueira, Jose Fernando Oliveira, Maria Antonia Carravilla, Reinaldo MorabitoAn Optimization Mo<strong>de</strong>l for the Traveling Salesman Problem with Three-Dimensional Loading Constraints . . . . . . . . . . . . . . . . . . . . . . . . 64Marisa Oliveira, Eduar<strong>da</strong> Pinto Ferreira, A. Miguel GomesRect-TOPOS: A constructive heuristic for the rectilinear packing area minimization problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66Pedro Bras, Claudio Alves, Jose Valerio <strong>de</strong> CarvalhoLocal search methods for leather nesting problems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70Antonio Martinez Sykora, Ramon Alvarez-Val<strong>de</strong>s, Jose Manuel TamaritNesting Problems: mixed integer formulations and valid inequalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73Session 3B – MatheuristicsMarco A. Boschetti, Vittorio Maniezzo, Matteo Roffilli, Antonio Jose Bolufe RohlerMatheuristics for Traffic Counter Location. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77Mauro Dell’Amico, Simone Falavigna, Manuel IoriA Matheuristic Algorithm for Auto-Carrier Transportation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81ALIO–EURO <strong>2011</strong> – iv


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Davi<strong>de</strong> Anghinolfi, Massimo PaolucciA new MIP Heuristic based on Randomized Neighborhood Search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85Stefanie KosuchTowards an Ant Colony Optimization algorithm for the Two-Stage Knapsack problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89Session 3C – Applications of Combinatorial Optimization IYang Zhang, Horst BaierOptimal Parts Allocation for Structural Systems via Improved Initial Solution Generation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93John Gunnar CarlssonPartitioning a service region among several vehicles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97Margari<strong>da</strong> Vaz Pato, Helenice <strong>de</strong> Oliveira FlorentinoA bi-objective approach for selection of sugarcane varieties in Brazilian companies . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102Jose Brito, Nelson Maculan, Luiz Satoru Ochi, Flavio Montenegro, Luciana BritoAn Imputation Algorithm Applied to the Nonresponse Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104Session 4A – Cutting and Packing IIIJ. Alejandro Zepe<strong>da</strong>, Victor Para<strong>da</strong>, Gustavo Gatica, Mauricio Sepulve<strong>da</strong>Automatic Generation of Algorithms for the Non Guillotine Cutting Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108Jannes Verstichel, Patrick De Causmaecker, Greet Van<strong>de</strong>n BergheEnhancements to the best fit heuristic for the orthogonal stock-cutting problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112Antonio Fernan<strong>de</strong>z, Consolacion Gil, Raul Banos, Antonio L. Marquez, M.G. Montoya, M. ParraBi-dimensional Bin-packing Problem: A Multiobjective Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116Ernesto G. Birgin, Rafael D. Lobato, Reinaldo MorabitoA recursive partitioning approach for generating unconstrained two-dimensional non-guillotine cutting patterns . . . . . . . . . . . . . . . . . . . . 119Session 4B – Scheduling and Metaheuristics IFilipe Bran<strong>da</strong>o, Joao Pedro PedrosoA Complete Search Method For Relaxed Traveling Tournament Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122Fulgencia Villa, Ramon Alvarez-Val<strong>de</strong>s, Jose Manuel TamaritA Hybrid Algorithm for Minimizing Earliness-Tardiness Penalties in Parallel Machines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125Esteban Peruyero, Angel A. Juan, Daniel RieraA hybrid algorithm combining heuristics with Monte Carlo simulation to solve the Stochastic Flow Shop Problem . . . . . . . . . . . . . . . . . . 129Angel A. Juan, Javier Faulin, Daniel Riera, Jose Caceres, Scott GrasmanA Simulation-based algorithm for solving the Vehicle Routing Problem with Stochastic Demands . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133Session 4C – Vehicle Routing ProblemTeresa Bianchi-Aguiar, Maria Antonia Carravilla, Jose Fernando OliveiraVehicle routing for mixed solid waste collection – comparing alternative hierarchical formulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137ALIO–EURO <strong>2011</strong> – v


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Said Dabia, Stefan Ropke, Tom Van Woensel, Ton De KokBranch and Cut and Price for the Time Depen<strong>de</strong>nt Vehicle Routing Problem with Time Windows . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141Sabir Ribas, Anand Subramanian, Igor Machado Coelho, Luiz Satoru Ochi, Marcone Jamilson Freitas SouzaAn algorithm based on Iterated Local Search and Set Partitioning for the Vehicle Routing Problem with Time Windows . . . . . . . . . . . . . 145Agostinho Agra, Marielle Christiansen, Alexandrino DelgadoA medium term short sea fuel oil distribution problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149Session 5A – Energy IIMargari<strong>da</strong> Carvalho, Joao Pedro Pedroso, Joao SaraivaNash Equilibria in Electricity Markets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153Teresa NogueiraApplication of Combinatorial Optimization in Natural Gas System Operation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 157Renan S. Maciel, Mauro <strong>de</strong> Rosa, Vladimiro Miran<strong>da</strong>, Antonio Padilha-FeltrinA Multi-objective EPSO for Distributed Energy Resources Planning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159Session 5B – Mathematical ProgramingLaureano F. Escu<strong>de</strong>ro, M. Araceli Garin, Maria Merino, Gloria PerezOn using preprocessing: Cuts i<strong>de</strong>ntification and probing schemes in stochastic mixed 0-1 and combinatorial optimization . . . . . . . . . . . 163Laureano F. Escu<strong>de</strong>ro, M. Araceli Garin, Gloria Perez, A. UnzuetaScenario cluster lagrangean <strong>de</strong>composition in stochastic mixed integer programming . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167Vincent Raymond, Francois Soumis, Ab<strong>de</strong>lmoutalib Metrane, Mehdi Towhidi, Jacques DesrosiersPositive Edge: A Pricing Criterion for the I<strong>de</strong>ntification of Non-<strong>de</strong>generate Simplex Pivots . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171Session 5C – HealthHumberto Rocha, Joana M. Dias, Brigi<strong>da</strong> C. Ferreira, Maria do Carmo LopesOn the transition from fluence map optimization to fluence map <strong>de</strong>livery in intensity modulated radiation therapy treatment planning . 173Sophie N. Parragh, Verena SchmidHybrid large neighborhood search for the dial-a-ri<strong>de</strong> problem. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177Ines Marques, M. Eugenia Captivo, Margari<strong>da</strong> Vaz PatoAn integer programming approach for elective surgery scheduling in a Lisbon hospital . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 181Session 6A – Logistics IPedro Amorim, Hans-Otto Gunther, Bernardo Alma<strong>da</strong>-LoboTackling Freshness in Supply Chain Planning of Perishable Products . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184Yajaira Cardona-Val<strong>de</strong>s, A<strong>da</strong> Alvarez, Joaquin PachecoApproaching a robust bi-objective supply chain <strong>de</strong>sign problem by a metaheuristic procedure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188ALIO–EURO <strong>2011</strong> – vi


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Session 6B – Scheduling and Metaheuristics IINicolau Santos, Joao Pedro PedrosoA Tabu Search Approach for the Hybrid Flow Shop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 192Jan RiezebosSequencing approaches in Synchronous Manufacturing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 195Session 6C – TelecomunicationsMichael Poss, Christian RaackAffine recourse for the robust network <strong>de</strong>sign problem: between static and dynamic routing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 198Adilson Elias Xavier, Claudio Martagao Gesteira, Henrique Pacca Loureiro LunaSolving a Hub Location Problem by the Hyperbolic Smoothing Approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202Session 7A – Logistics IITania Rodrigues Pereira Ramos, Maria Isabel Gomes, Ana Paula Barbosa-PovoaA hybrid method to solve a multi-product, multi-<strong>de</strong>pot vehicle routing problem arising in a recyclable waste collection system . . . . . . . 206Sonia R. Cardoso, Ana Paula Barbosa-Povoa, Susana RelvasDesign and Planning of Supply Chains with Integrated Forward and Reverse Decisions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210Xiaoyun Bing, Jacqueline Bloemhof, Jack van <strong>de</strong>r VorstReverse Logistics Network Design for Household Plastic Waste . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212Juan Pablo Soto, Rosa Colome Perales, Marcus ThiellReverse Cross Docking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215Session 7B – Timetabling and RosteringMarta Mesquita, Margari<strong>da</strong> Moz, Ana Paias, Margari<strong>da</strong> Vaz PatoComparing Roster Patterns within a Single Depot Vehicle-Crew-Roster Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 218Marta Rocha, Jose Fernando Oliveira, Maria Antonia CarravillaInsights on the exact resolution of the rostering problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 222Dario Lan<strong>da</strong>-Silva, Joe Henry ObitComparing Hybrid Constructive Heuristics for University Course Timetabling . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224Session 7C – Applications of Combinatorial Optimization IIAgostinho Agra, Jorge Orestes Cer<strong>de</strong>ira, Cristina RequejoLower and upper bounds for large size instances of the optimal diversity management problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228Luiza Amalia Pinto Cantao, Ricardo Coelho Silva, Akebo YamakamiContinous Ant Colony System Applied to Optimization Problems with Fuzzy Coefficients . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231Teresa Neto, Miguel Constantino, Joao Pedro Pedroso, Isabel MartinsA tree search procedure for forest harvest scheduling problems addressing aspects of habitat availability . . . . . . . . . . . . . . . . . . . . . . . . . . . 235ALIO–EURO <strong>2011</strong> – vii


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Session 8A – Stochastic Local SearchJeremie Dubois-Lacoste, Manuel Lopez-Ibanez, Thomas StutzleAutomatic Configuration of TPLS+PLS Algorithms for Bi-objective Flow-Shop Scheduling Problems. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 239Luis Paquete, Jose Luis Santos, Daniel VazEfficient paths by local search . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243Iryna Yevseyeva, Jorge Pinho <strong>de</strong> Sousa, Ana VianaSolving a Multiobjective Flowshop Scheduling Problem by GRASP with Path-relinking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 246Session 8B – Column Generation and MetaheuristicsMarkus Leitner, Mario Ruthmair, Gunther R. RaidlStabilized Column Generation for the Rooted Delay-Constrained Steiner Tree Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250Martin Wolkerstorfer, Tomas NordstromHeuristics for Discrete Power Control – A Case-Study in Multi-Carrier DSL Networks. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 254Dorabella Santos, Amaro <strong>de</strong> Sousa, Filipe AlvelosA Hybrid Meta-Heuristic for the Network Load Balancing Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258Session 8C – Approximation AlgorithmsAntonio Alonso Ayuso, Laureano F. Escu<strong>de</strong>ro, Francisco Javier Martin CampoMo<strong>de</strong>ling the collision avoi<strong>da</strong>nce for the ATM by a mixed 0–1 nonlinear approach . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 260Richard Dobson, Kathleen SteinhofelLow Energy Scheduling with Power Heterogeneous Multiprocessor Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264Pablo Coll, Pablo Factorovich, Irene LoiseauA linear programming approach for a<strong>da</strong>ptive synchronization of traffic signals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 268List of Authors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 273ALIO–EURO <strong>2011</strong> – viii


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>8:45!"#$%&'(&$)*9:00+"%%$)*,-.!"##$%&'(%)'*(+,$%&'---9:30 9:3023"*$*#,+"%%$)*9:4545"*('6,7(58,9561A'260/=$%&+"%%$)*,-/.+/0)"1$%&'(%)'23'-+"%%$)*,-040/$+10'56"#$%&'+"%%$)*,1..#6+/(?#$+'@.+"%%$)*,1/!61"7%'80%9':'23+"%%$)*,10;


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Routing in Graphs with Applications to Logistics and TrafficRolf Möhring ∗∗ TU BerlinTraffic management and routing in logistic systems are optimization problem by nature. We want to utilize the available street or logisticnetwork in such a way that the total network “load” is minimized or the “throughput” is maximized. This lecture <strong>de</strong>als with the mathematicalaspects of these optimization problems from the viewpoint of network flow theory and scheduling. It leads to flow mo<strong>de</strong>ls in which–incontrast to static flows–the aspects of “time” and “congestion” play a crucial role.We illustrate these aspects on several applications:1. Traffic gui<strong>da</strong>nce in rush hour traffic (cooperation with ptv).2. Routing automated gui<strong>de</strong>d vehicles in container terminals (cooperation with HHLA).3. Ship Traffic Optimization for the Kiel Canal (cooperation with the German Fe<strong>de</strong>ral Water- ways and Shipping Administration).All these applications benefit from new insights into routing in graphs. In (1), it is a routing scheme that achieves traffic patterns thatare close to the system optimum but still respect certain fairness conditions, while in (2) it is a very fast real-time algorithm that avoidscollisions, <strong>de</strong>adlocks, and other conflicts already at route computation. Finally, (3) uses techniques from (2) and enhances them with specialpurpose scheduling algorithms.ALIO-EURO <strong>2011</strong> – 1


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Recent Developments in Optimization Methods for Scheduling ProblemsDebora P. Ronconi ∗∗ Department of Production Engineering, EP-USP, University of São PauloAv. Prof. Almei<strong>da</strong> Prado, 128, Ci<strong>da</strong><strong>de</strong> Universitária, 05508-900, São Paulo SP, Brazildronconi@usp.brIn this talk, the combinatorial optimization scheduling problem will be addressed. A few approaches of exact and heuristic nature <strong>de</strong>velopedfor different variants of scheduling problems will be <strong>de</strong>scribed to illustrate the vitality of the topic.Since the seminal paper by Johnson [4], scheduling problems have received significant attention, particularly in recent years with severalpublications each year. In general words, the scheduling problem consists of the allocation of resources to tasks over time, consi<strong>de</strong>ring thephysical restrictions of the process while optimizing one or more objectives. Resources can be machines in a workshop, processing units ina computing environment, runways at an airport, and so on; while tasks may be operations in a production process, landings at an airport, orexecutions of computer programs, just to name a few. A task may have a distinct due <strong>da</strong>te, priority or release <strong>da</strong>te. According to Baker [1],to classify the major scheduling mo<strong>de</strong>ls it is necessary to characterize the configuration of resources and the behavior of tasks. For instance,a mo<strong>de</strong>l may contain one resource type or several resource types. In addition, if the set of tasks available for scheduling does not changeover time, the system is called static, in contrast to cases in which new tasks arise over time, where the system is called dynamic. Generallyspeaking, the scheduling of jobs is a very complex problem due to its combinatorial nature and, amongst the combinatorial optimizationproblems, it can be classified as one of the most difficult problems. An overview of scheduling mo<strong>de</strong>ls can be found in [5].In most theoretical scheduling papers, simple measures of performance have been applied, such as, for example, the completion time ofthe last job on the last machine, known as makespan. In general, the consi<strong>de</strong>red criteria are regular, i.e. non<strong>de</strong>creasing with the completiontime. Among them, we can mention the total tardiness criterion, whose difficulty arises from the fact that tardiness is not a linear functionof completion time. On the other hand, scheduling problems involving not regular measures based on both earliness and tardiness costshave also been addressed in many recent studies. This type of problem became important with the advent of the just-in-time (JIT) concept,where early or tardy <strong>de</strong>liveries are highly discouraged. A practical example can be found in the chemical industry, where different productscan be ma<strong>de</strong> through the same process and must be mixed as close as possible to a given instant in time to prevent their <strong>de</strong>terioration.Comprehensive reviews can be found in [2] and [3].Due the good performance of optimization methods in several problems that appear in industrial settings, this talk will mainly focus on theapplication and <strong>de</strong>velopment of optimization methods for job-scheduling problems in different environments. Selected published papers,which comprise problems addressed by the speaker, will be <strong>de</strong>scribed.As the solution of practical mo<strong>de</strong>ls is now largely automated by the use of commercial software, we will initially discuss different mixedintegermo<strong>de</strong>ls that represent a useful scheduling environment: the flowshop problem with no storage constraints aiming to minimize thesum of earliness and tardiness of the jobs (see [8]). The formulation of combinatorial optimization problems such as mixed-integer mo<strong>de</strong>lsopens the possibility of applying different algorithms <strong>de</strong>veloped for general and specific problems. Since the pioneering work of RalphGomory in the late 1950s, integer programming is one of the fields in operational research that has ma<strong>de</strong> the most progress in the past fewyears. The most popular approaches are cutting planes and enumerations. Within the second approach, we can highlight the branch-andboun<strong>da</strong>lgorithm, which is basically a sophisticated way to perform an enumeration. With the purpose of illustrating the application ofthis technique to a scheduling problem, a lower bound which exploits properties of the flowshop problem with blocking will be presented(see [6, 7]). In this environment there are no buffers between successive machines, and, therefore, intermediate queues of jobs waiting inthe system for their next operations are not allowed. Some examples of blocking can be found in concrete block manufacturing, which doesnot allow stock in some stages of the manufacturing process.On the other hand, there are several combinatorial optimization problems that are difficult to solve through the use of methods that areguaranteed to provi<strong>de</strong> an optimal solution. In these cases, heuristic methods are typically used to quickly find solutions that are notnecessarily optimal solutions, but are good quality solutions anyway. Due the practical importance of objectives associated with due<strong>da</strong>tes, we will present heuristic approaches that focus on these performance measures. First, a constructive heuristic that explores specificcharacteristics of the flowshop problem with blocking will be presented [9]. In this case, performance is measured by the minimizationof the total tardiness of the jobs. Then a GRASP-based heuristic is proposed, coupled with a path relinking strategy to search for betteroutcomes. Next, the minimization of the mean absolute <strong>de</strong>viation from a common due <strong>da</strong>te in a two-machine flowshop scheduling problemwill be addressed [11].An online version of a single machine scheduling problem to minimize total tardiness will also be <strong>de</strong>scribed. In this problem, or<strong>de</strong>rs getto the system randomly. Jobs have to be scheduled without knowledge of what jobs will come afterwards. The processing times and thedue <strong>da</strong>tes become known when the or<strong>de</strong>r is placed. A customized approximate dynamic programming method will be presented for thisproblem [10]. This talk will also comment on new research initiatives un<strong>de</strong>r <strong>de</strong>velopment.ALIO-EURO <strong>2011</strong> – 2


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>References[1] K.R. Baker, Introduction to Sequencing and Scheduling, Addison-Wesley, John Wiley & Sons, New York, 1974.[2] K.R. Baker and G.D. Scud<strong>de</strong>r, Sequencing with earliness and tardiness penalties: A review, Operations Research 38, pp. 22–36, 1990.[3] V. Gordon, J.M. Proth and C. Chu, A survey of the state-of-art of common due <strong>da</strong>te assignment and scheduling research, EuropeanJournal of Operational Research 139, pp. 1–25, 2002.[4] S.M. Johnson, Scheduling in a two-machine flowshop for the minimization of the mean absolute <strong>de</strong>viation from a common due <strong>da</strong>te,Naval Research Logistics Quartely 1, pp. 61-67, 1954.[5] M. Pinedo, Scheduling: theory, algorithms, and systems, Prentice-Hall, New Jersey, 2008.[6] D.P. Ronconi, A Branch-and-Bound Algorithm to Minimize the Makespan in a Flowshop with Blocking, Annals of OperationsResearch 138, pp. 53-65, 2005.[7] D.P. Ronconi and V.A. Armentano, Lower Bounding Schemes for Flowshops with Blocking In-Process, Journal of the OperationalResearch Society 52, pp. 1289-1297, 2001.[8] D.P. Ronconi and E.G. Birgin, Mixed-integer programming mo<strong>de</strong>ls for flowshop scheduling problems minimizing the total earlinessand tardiness, in Just-in-Time Systems, Y.A. Ríos-Solís and R.Z. Ríos-Mercado (Eds.), Springer Series on Optimization and ItsApplications, P.M. Par<strong>da</strong>los and Ding-Zhu Du (Series eds.), <strong>2011</strong>, to appear.[9] D.P. Ronconi and L.S. Henriques, Some Heuristic Algorithms for Total Tardiness Minimization in a Flowshop with Blocking, Omega37, pp. 272-281, 2009.[10] D.P. Ronconi and W.B. Powell, Minimizing Total Tardiness in a Stochastic Single Machine Scheduling Problem using ApproximateDynamic Programming, Journal of Scheduling 13, pp. 597–607, 2010.[11] C.S. Sakuraba, D.P. Ronconi and F. Sourd, Scheduling in a two-machine flowshop for the minimization of the mean absolute <strong>de</strong>viationfrom a common due <strong>da</strong>te, Computers and Operations Research 36, pp. 60–72, 2009.ALIO-EURO <strong>2011</strong> – 3


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Spatial Forest OptimizationMiguel Constantino ∗∗ Centro <strong>de</strong> Investigação Operacional<strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências, Universi<strong>da</strong><strong>de</strong> <strong>de</strong> Lisboamiguel.constantino@fc.ul.ptSpatial Forest Optimization is concerned with the <strong>de</strong>sign of forest landscapes. Forest landscapes evolve along time un<strong>de</strong>r the action ofopposing forces. Vegetation growth is counterbalanced by natural hazards such as fire and pests, or through human intervention, such asharvesting. In managed forests usually the main objective is to maximize the value of timber harvested. However, other objectives can beconsi<strong>de</strong>red, such as soil preservation, aesthetic values, biodiversity and wildlife conservation. Landscapes can be intentionally modified inor<strong>de</strong>r to accomplish or help to achieve these goals. For mo<strong>de</strong>ling purposes, a forest landscape is a region in the plane, composed of a finitenumber of smaller management units. A finite horizon divi<strong>de</strong>d into periods may be consi<strong>de</strong>red. Main <strong>de</strong>cisions are, for each unit, either toharvest in some specific period or not harvesting at all. A set of contiguous units with similar characteristics in some time period is calle<strong>da</strong> patch of the forest. The aim of spatial forest optimization is to optimize an objective function while ensuring certain characteristics ofsome patches.In this talk we review a few combinatorial optimization problems that arise in the context of spatial forest optimization: One problem isthe so-called "harvest scheduling subject to maximum area restrictions"- large harvested patches are forbid<strong>de</strong>n, to prevent erosion and alsofor aesthetic reasons. Another one consists of selecting a "patch with a minimum required area." Such a patch may represent an old growthregion suitable for wildlife habitat. A related problem consists of selecting a (nearly) convex region in the landscape. We introduce asimplified version of this problem and show it can be solved in polynomial time.ALIO-EURO <strong>2011</strong> – 4


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>On Bilevel Programming and its Implications for Mixed Integer LinearProgrammingAndrea Lodi ∗∗ DEIS, Università di BolognaViale Risorgimento 2, 40136 Bologna, Italyandrea.lodi@unibo.itBilevel programming is a rich paradigm to express a variety of real-world applications including game theoretic and pricing ones. However,what we are interested in this talk is to discuss the bilevel nature of two of the most crucial ingredients of enumerative methods for solvingcombinatorial optimization problems, namely branching and cutting.Specifically, we discuss a new branching method for 0-1 programs called interdiction branching [3] that exploits the intrinsic bilevel natureof the problem of selecting a branching disjunction. The method is <strong>de</strong>signed to overcome the difficulties encountered in solving problemsfor which branching on variables is inherently weak. Unlike traditional methods, selection of the disjunction in interdiction branching takesinto account the best feasible solution found so far.On the cutting plane si<strong>de</strong>, we examine the nature of the so-called separation problem, which is that of generating a valid inequality violatedby a given real vector, usually arising as the solution to a relaxation of the original problem. We show that the problem of generating amaximally violated valid inequality often has a natural interpretation as a bilevel program [2]. In some cases, this bilevel program can beeasily reformulated as a single-level mathematical program, yielding a stan<strong>da</strong>rd mathematical programming formulation for the separationproblem. In other cases, no reformulation exists yielding surprisingly interesting examples of problems arising in the complexity hierarchiesintroduced by Jeroslow [1].References[1] R. Jeroslow, The polynomial hierarchy and a simple mo<strong>de</strong>l for competitive analysis, Mathematical Programming, 32:146–164, 1985.[2] A. Lodi, T.K. Ralphs, G. Woeginger, “Bilevel Programming and Maximally Violated Valid Inequalities", Technical Report OR/11/3,DEIS - Università di Bologna.[3] A. Lodi, T.K. Ralphs, F. Rossi, S. Smriglio, “Interdiction Branching”, Technical Report OR/09/10, DEIS - Università di Bologna.ALIO-EURO <strong>2011</strong> – 5


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Multi-Objective Evolutionary Algorithms for Reactive Power Planning inElectrical Distribution Systems: A Comparative Case StudyDulce Costa ∗ Carlos Henggeler Antunes † António Gomes Martins †∗ Department of Electrical Engineering, EST Setúbal, IPSCAMPUS do IPS 2910-761 Setúbal, Portugaldulce.costa@estsetubal.ips.pt† DEEC, University of CoimbraPolo II, 3030-290 Coimbra, Pólo II - Universi<strong>da</strong><strong>de</strong> <strong>de</strong> Coimbra, Portugal{ch, amartins}@<strong>de</strong>ec.uc.ptABSTRACTInstallation of capacitors in radial electrical distribution power systemsis a generalized practice used by the utilities mainly to reducepower losses, improve system stability, perform power factor correctionand get a better voltage profile. These benefits are associatedwith the ability of choosing the appropriate locations and capacityof the equipments to be installed. This problem has been extensivelyresearched over the past <strong>de</strong>ca<strong>de</strong>s. Nowa<strong>da</strong>ys more flexibleoptimization tools allow for the computation of solutions tomore realistic mo<strong>de</strong>ls. This exten<strong>de</strong>d abstract shows how Multi-Objective Evolutionary Algorithms (MOEAs) are a<strong>de</strong>quate toolsto tackle this problem and provi<strong>de</strong>s a comparative study betweensome distinct approaches. Some modifications are introduced intoan MOEA in or<strong>de</strong>r to tailor it to the characteristics of the multiobjectivemathematical mo<strong>de</strong>l.Keywords: Reactive power compensation, Quality of service, Multiobjectivemo<strong>de</strong>ls, Evolutionary algorithms1. INTRODUCTIONShunt capacitors installed in electrical distribution networks forreactive power compensation generate some positive effects, suchas increasing voltage level at the load point, improving voltageregulation when capacitor banks are properly switched, reducingactive and reactive power losses, improving system capacity by reducingcurrents, reducing the need of reinforcement by releasingsystem capacity. The importance of an a<strong>de</strong>quate reactive powerplanning is <strong>de</strong>finite, namely due to the growing utilization and <strong>de</strong>pen<strong>de</strong>ncyon electricity. The FERC report about the August 2003North American electrical blackout [1], conclu<strong>de</strong>d that poor voltageprofile and insufficient reactive planning were <strong>de</strong>cisive factorsto this inci<strong>de</strong>nt. In the mid-20th century these <strong>de</strong>vices were generallyinstalled at the head of electrical distribution systems. Severalmathematical mo<strong>de</strong>ls and algorithmic approaches have beenreported in the literature [2], and the Capacitor Subcommittee ofthe IEEE Transmission and Distribution Committee has publishedseveral bibliographies on this theme until 1980, [3, 4, 5, 6]. Theappearance of capacitors with smaller weight/capacity ratio enabled,from technical and economic perspectives, the allocationof compensation also along the fee<strong>de</strong>rs of distribution networks.Mainly in the 1990s new algorithms based on heuristic and metaheuristicsearch techniques started to be applied: specific heuristics[7, 8], Simulated Annealing [9, 10, 11], Tabu Search [12, 13],Genetic/Evolutionary Algorithms [14, 15, 16]. The problem of thereactive power planning can be stated as i<strong>de</strong>ntifying the best networklocations and the appropriate dimension of capacitors to beinstalled in or<strong>de</strong>r to achieve the network operator’s objectives subjectto technical, operational and budget constraints. Mathematicalmo<strong>de</strong>l for this problem are generally of combinatorial nature, involvingmultiple objective functions, real-valued and integer variables,and linear and non-linear relationships.2. MULTI-OBJECTIVE MATHEMATICAL MODELThe multi-objective mathematical mo<strong>de</strong>l has been formulated as anon-linear mixed integer problem consi<strong>de</strong>ring two objective functions:minimizing investment costs and minimizing active powerlosses. These objectives are conflicting and of distinct nature. Theconstraints comprise operational and quality restrictions: voltagelimits at each bus, impossibility to locate capacitor banks in someno<strong>de</strong>s, operational constrains due to the power flow in the systemand the need to supply the required load at each no<strong>de</strong>. The mainpurpose is to characterize a compensation scheme, which consistsof a set of capacitors banks to be located in selected network locations,in or<strong>de</strong>r to achieve a compromise between active powerlosses and investment costs while satisfying all constraints. A <strong>de</strong>tailed<strong>de</strong>scription of the mo<strong>de</strong>l objective functions, power flowequations (physical laws in electrical networks) and other constrainscan be found in [17].3. MULTI-OBJECTIVE EVOLUTIONARY ALGORITHMSEvolutionary Algorithms (EAs) have gained a growing importanceto tackle multi-objective mo<strong>de</strong>ls, particularly for hard combinatorialproblems, due to their capability of working with a populationof individuals (solutions). Since they <strong>de</strong>al with a population of solutionsand the aim is generally the characterization of a Pareto optimalfront, EAs endowed with techniques to maintain diversity ofsolutions present advantages with respect to the use of scalarizingfunctions as in traditional mathematical programming approaches.A Pareto optimal front can be i<strong>de</strong>ntified throughout the evolutionaryprocess, which hopefully converges to the true non-dominatedfront for the problem un<strong>de</strong>r study. It must be noticed that, in realworldproblems, this is, in general, a potential Pareto optimal front,classified as such because no other solutions dominating it could befound but no theoretical tools exist guaranteeing their true Paretooptimality. EAs can incorporate techniques aimed at guaranteeingthe diversity of the Pareto optimal front in or<strong>de</strong>r to display thetra<strong>de</strong>-offs between the conflicting objective functions in differentregions of the search space. These advantages of using EAs are notjust related with the computational effort required but also with thedifficulty of using mathematical programming algorithms in mostALIO-EURO <strong>2011</strong> – 6


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>high-dimensional combinatorial multi-objective problems.4. CASE STUDY AND RESULTSAn actual Portuguese electrical radial distribution network has beenused for a comparative case study. The network topology is displayedin 1. For more <strong>de</strong>tailed information on this network see[17]. This network is located in a rural area and has a particularcharacteristic: the voltage profile without compensation does notrespect the quality voltage limits, so the zero cost solution is notfeasible. Therefore, it is necessary to install capacitors to have feasiblesolutions with respect to the voltage profile constraint. Threewell known MOEA have been implemented: MOGA, SPEA andNSGA-II. Moreover, a local search scheme tailored for this problemhas been inclu<strong>de</strong>d in NSGA-II to make the most of the problemspecificities, namely regarding neighborhood exploration. Inthis local search scheme, a move leading to a neighbour solutionis <strong>de</strong>fined by changing the capacitor location in the network to aneighbour location, or the capacitor type corresponding to a capacityvalue. 2, 3, 5 and 4 display the set of initial solutions andthe Pareto Frontiers obtained with each algorithm. All MOEAconverge reasonably well to a set of dispersed non-dominated solutions.However, the front reached with the modified NSGA IItotally dominates the other fronts 6. This approach not only increasedthe number of solutions computed, but also improved themiddle front solutions and exten<strong>de</strong>d the Pareto front, achievingcompromise solutions with lower costs/higher losses, and highercosts/lower losses.Figure 3: Initial solutions and Pareto Frontier obtained with SPEA.Figure 4: Initial solutions and Pareto Frontier obtained with NSGAII.5. REFERENCESFigure 1: Portuguese radial electrical distribution network.Figure 2: Initial solutions and Pareto Frontier obtained withMOGA.[1] F. S. Report, “Principles for efficient and reliable reactivepower supply and consumption,” Docket No. AD05-1-000,Tech. Rep., 2005.[2] N. M. Neagle and D. R. Samson, “Loss reduction from capacitorsinstalled on primary fee<strong>de</strong>rs,” Transactions of theAmerican Institute of Electrical Engineers, Power Apparatusand Systems PAS, vol. Part III, no. PAS-75, pp. 950–959,1956.[3] I. C. Report, “Bibliography on power capacitors 1967-1970,”IEEE Transactions on Power Apparatus and Systems PAS,vol. PAS 91, no. 5, pp. 1750–1759, 1972.[4] ——, “Bibliography on power capacitors 1971-1974,” IEEETransactions on Power Apparatus and Systems PAS, vol. PAS97, no. 4, pp. 1124–1131, 1978.[5] ——, “Bibliography on power capacitors 1975-1980,” IEEETransactions on Power Apparatus and Systems PAS, vol. PAS102, no. 7, pp. 2331–2334, 1983.[6] I. V. M. W. G. Report, “Bibliography on reactive powerand voltage control,” IEEE Transactions on Power SystemsIEEETPS, vol. 2, no. 2, pp. 361–370, May 1987.[7] M. M. A. Salama and A. Y. Chikhani, “A simplified networkapproach to the var control problem for radial distributionsystems,” IEEE Transactions on Power Delivery IEEETPD,vol. 8, no. 3, pp. 1529–1535, 1993.[8] N. R. J. Shao and Y. Zhang, “A capacitor placement expertsystem,” International Journal of Engineering IntelligentSystems for Electrical Engineering and Communications,pp. 105–114, 1994.[9] Y.-L. C. C.-C. Liu, “Optimal multi-objective var planning usingan interactive satisfying method,” IEEE Transactions onPower Systems, vol. 10, no. 2, pp. 664–670, 1990.[10] H. Chiang, J. Wang, and O. Cockings, “Optimal capacitorplacements in distribution systems part i: A new formula-ALIO-EURO <strong>2011</strong> – 7


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 5: Initial solutions and Pareto Frontier obtained with NSGAII with local search.Figure 6: Pareto Frontierstion and the overall problem,” IEEE Transactions on PowerDelivery, vol. 5, no. 2, pp. 634–642, 1990.[11] ——, “Optimal capacitor placements in distribution systemspart ii: Solution algorithms and numerical results,” IEEETransactions on Power Delivery, vol. 5, no. 2, pp. 643–649,1990.[12] Y.-C. H. H.-T. Y. C.-L. Huang, “Solving the capacitor placementproblem in a radial distribution system using tabusearch approach,” IEEE Transactions on Power Systems,vol. 11, no. 4, pp. 1868–1873, 1996.[13] A. G. M. Dulce F. Pires, C. Henggeler Antunes, “A multiobjectivemo<strong>de</strong>l for var planning in radial distribution networksbased on tabu search,” IEEE Transactions On PowerSystems, vol. 20, no. 2, pp. 1089–1094, May 2005.[14] K. Iba, “Reactive power optimization by genetic algorithm,”IEEE Transactions on Power Systems, vol. 9, no. 2, pp. 685–692, 1994.[15] G. Levitin, A. Kalyuhny, A. Shenkman, and M. Chertkov,“Optimal capacitor allocation in distribution systems using agenetic algorithm and a fast energy loss computation technique,”IEEE Transactions on Power Delivery, vol. 15, no. 2,pp. 623–628, 2000.[16] L. L. J.T. Ma, “Evolutionary programming approach to reactivepower planning,” IEE <strong>Proceedings</strong> - Generation Transmissionand Distribution, vol. 43, no. 4, pp. 365 – 370, July1996.[17] A. G. M. Dulce F. Pires, C. Henggeler Antunes, “An nsga-iiapproach with local search for a var planning multi-objectiveproblem,” Research Report 8/2009, INESC Coimbra, Tech.Rep., 2009.ALIO-EURO <strong>2011</strong> – 8


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A new MIP based approach for Unit Commitment in power production planningAna Viana ∗ ‡ João Pedro Pedroso ∗ †∗ INESC PortoCampus <strong>da</strong> FEUP, Rua Dr. Roberto Frias 378, Porto, Portugalaviana@inescporto.pt‡ Polytechnic Institute of Engineering of PortoRua Dr. António Bernardino <strong>de</strong> Almei<strong>da</strong> 431, Porto, Portugalagv@isep.ipp.pt† <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciêcias, Universi<strong>da</strong><strong>de</strong> do PortoRua do Campo Alegre, 4169-007 Porto, Portugaljpp@dcc.fc.up.ptABSTRACTThis paper presents a new iterative algorithm for optimising thermalunit commitment in power generation planning. The approach,based on a mixed-integer formulation of the problem, consi<strong>de</strong>rsa piecewise linear approximation of the fuel cost function that isdynamically up<strong>da</strong>ted to better reflect problem requirements, convergingto the optimal solution.After thorough computational tests in a broad set of instances, itshowed to be flexible, capable of easily incorporating differentproblem constraints, and to be able to solve large size problems.Keywords: Unit Commitment, Approximation Algorithms, Scheduling1. INTRODUCTIONThe Unit Commitment problem (UCP) is the problem of <strong>de</strong>cidingwhich power generator units must be committed/<strong>de</strong>committedover a planning horizon (lasting from 1 <strong>da</strong>y to 2 weeks, and generallysplit in periods of 1 hour), and the production levels at whichthey must be operating (Pre-Dispatch), so that a given objective isoptimised. The committed units must generally satisfy the forecastedsystem load and reserve requirements, subject to a large setof other system, technological and environmental constraints.This is a topic of major practical relevance because the effectivenessof the schedules obtained has a strong economical impact inany power generation company. Due to that and to its complexity,it has received consi<strong>de</strong>rable research attention and, after several<strong>de</strong>ca<strong>de</strong>s of intensive study, is still a rich and challenging topic ofresearch.Proposed optimisation techniques for Unit Commitment encompassvery different paradigms, ranging from exact approaches andLagrangian Relaxation to some rule of thumb or very elaborateheuristics and metaheuristics. The combinatorial nature of theproblem and its multi-period characteristics prevented exact approachesfrom being successfully used in practice: they resultedin very inefficient algorithms that were only capable of solvingsmall size instances of no practical interest. Heuristic techniques,as those based in Priority Lists, were also not very successful asthey ten<strong>de</strong>d to lead to low quality solutions. Concerning metaheuristics,they had a very promising behaviour when they firststarted being explored. The quality of the results was better thanthe ones achieved by well established techniques, and good solutionswere obtained very quickly.Some drawbacks can however be pointed out when metaheuristicsgo into play. One major drawback, if one consi<strong>de</strong>rs that the ultimategoal is to <strong>de</strong>sign techniques that can be accepted and usedby a company, is the <strong>de</strong>pen<strong>de</strong>nce of these techniques on parametertuning. Tuning the parameters is a time consuming and somehowcomplex procedure that requires <strong>de</strong>ep knowledge on the algorithmimplemented. Furthermore, it is vital for good algorithm performance.A second drawback has to do with the lack of informationthis techniques provi<strong>de</strong> in terms of solution quality (i.e. how farit is from the optimal solution). Some proposals have been ma<strong>de</strong>to soften the referred drawbacks; but this is still an open line ofresearch.Currently, the dramatic increase in efficiency of mixed-integer programming(MIP) solvers requests for a thorough exploitation oftheir capabilities. Some research has been directed towards the<strong>de</strong>finition of alternative, more efficient, mixed-integer linear programming(MILP) formulations of the problem e.g. [1, 2]. Extensivesurveys on different optimisation techniques and mo<strong>de</strong>llingissues are provi<strong>de</strong>d by e.g. [3, 4].This paper presents a new MILP approach to the UCP that furtherexplores this line of research. Instead of consi<strong>de</strong>ring a quadraticrepresentation of the fuel cost, we consi<strong>de</strong>r a piecewise linear approximationof that function and, in an iterative process up<strong>da</strong>te, itby including additional pieces. The function up<strong>da</strong>te is based in thesolutions obtained in the previous iterations.The approach was tested in a well known set of instances from theliterature and showed to be flexible, capable of easily incorporatingdifferent problem constraints, and of solving large size problems.2. PROBLEM DESCRIPTIONDifferent mo<strong>de</strong>lling alternatives, reflecting different problem issueshave been published: they consi<strong>de</strong>r fuel, multiarea and emissionconstraints (e.g. [5, 6, 7]) and, more recently, security constraints[8] and market related aspects [9].The <strong>de</strong>centralised management of production brought up new issuesto the area [10], in some markets the problem being nowreduced to single-unit optimisation. However, for several <strong>de</strong>centralisedmarkets the traditional problem is still very similar to thatof centralised markets [1, 2]. The main difference is the objectivefunction that, rather than minimising production costs, aims atmaximising total welfare. Therefore, the techniques that apply fora centralised management of the production, will also be effectiveALIO-EURO <strong>2011</strong> – 9


at solving many <strong>de</strong>centralised market production problems.In this paper we will consi<strong>de</strong>r the centralised UC mo<strong>de</strong>l. The objectiveis to minimise total production costs the rea<strong>de</strong>r overisaddressed givento planning[11].horizon. They are expressed as the sum of fuel costs (quadraticfunctions that <strong>de</strong>pend on the production level of each unit) andstart-up costs. Start-up costs are represented by constants that <strong>de</strong>pendon the last period the unit was operating; two constants are<strong>de</strong>fined: one for hot start-up costs, that is consi<strong>de</strong>red when theunit has been off for a number of periods smaller or equal to aotherwise).given value; and another for cold start-up costs, consi<strong>de</strong>red otherwise.The following constraints will be inclu<strong>de</strong>d in the ciPF(P it)=formulation:system power balance <strong>de</strong>mand, system reserve requirements,it 2unit initial conditions, unit minimum up and down times, generationlimits and ramp constraints. For a mathematical formulationthe rea<strong>de</strong>r is addressed to [11].3. MIP APPROACH AND COMPUTATIONAL RESULTSas can be seen in Figure 1.The approach consi<strong>de</strong>rs a piecewise linear approximation of thequadratic fuel cost function (see Equation (1)). P it are <strong>de</strong>cisionvariables that represent the production level of unit i in period t;a i , b i and c i are fuel cost parameters for unit i (measured in $/h,$/MWh and $/MW 2 h, respectively). There are binary variablesy it that indicate the state of unit i in period t (0 if unit is off, 1otherwise).{ci PF(P it ) = it 2 + b iP it + a i if y it = 10 otherwiseThe main contribution of this paper concerns a linearisation of thiscost function. As it is convex, if we find a straight line tangent toit, and constrain the cost to be greater than the value of the straightline, we have a lower approximation of the cost. The process <strong>de</strong>visedhere is to dynamically find straight lines, at points whosecost is being un<strong>de</strong>restimated, and add them to a set; we then imposethat the cost of a any production level p must be greater thanthe maximum of those straight lines, evaluated at p.For the sake of clarity, let us remove the indices i,t i<strong>de</strong>ntifying thegenerator. For any generator and any period, we start by approximatingits cost by means of two straight lines: one going through(P min ,F(P min )), and another going through (P max ,F(P max )), as canbe seen in Figure 1.After solving the problem with this approximation, we obtain aproduction level for this unit of, say, p. The operating cost at thispoint will be un<strong>de</strong>restimated by the value of the highest of thestraight lines at p; in Figure 1, the value F. In or<strong>de</strong>r to exclu<strong>de</strong> thispoint from the feasible region, we add another straight line to ourset; the line tangent to the quadratic function, evaluated at p, asrepresented in blue in Figure 2. As we add more and more straightlines, we are converging to an exact approximation of the true cost– Workshop on Applied function, Combinatorial asOptimization, can be seen Porto, Portugal, in Figure May 4 -26, for <strong>2011</strong> another possible value p ′ .ed aspects [9].gement of production brought up newome markets the problem being nowisation. However, for several <strong>de</strong>cennalproblem is still very similar to that]. The main difference is the objecnminimising production costs, aims atherefore, the techniques that apply forf the production, will also be effectiveed market production problems.nsi<strong>de</strong>r the centralised UC mo<strong>de</strong>l. Theal production costs over a given planesse<strong>da</strong>s the sum of fuel costs (quadratice production level of each unit) ands are represented by constants that <strong>de</strong>unitwas operating; two constants arep costs, that is consi<strong>de</strong>red when theber of periods smaller or equal to ar cold start-up costs, consi<strong>de</strong>red otheraintswill be inclu<strong>de</strong>d in the formula<strong>de</strong>mand,system reserve requirements,minimum up and down times, generaaints.For a mathematical formulation1].CostFPminpPmaxPowerFigure 1: Initial approximation of the cost function by two straightlines, going through the minimum and maximum operating powerof the unit. If the current production level for this unit is p, its cost(in this iteration) will be approximated by FFigure 1: Initial approximation of the cost function by two straightlines, going through the minimum and maximum operating powerof the unit. If the current production level for this unit is p, its cost(in this iteration) will be approximated by FCostmaximising total welfare. Therefore, the techniques that apply fora centralised management of the production, will also be effectiveat solving many <strong>de</strong>centralised market production problems.In this paper we will consi<strong>de</strong>r the centralised UC mo<strong>de</strong>l. Theobjective is to minimise total production costs over a given planninghorizon. They are expressed as the sum of fuel costs (quadraticfunctions that <strong>de</strong>pend on the production level of each unit) andProc. of the VII ALIO–EURO – Workshop start-up on Applied costs. Start-up Combinatorial costs are represented by Optimization, constants that <strong>de</strong>-Porto, Portugal, May 4–6, <strong>2011</strong>pend on the last period the unit was operating; two constants are<strong>de</strong>fined: one for hot start-up costs, that is consi<strong>de</strong>red when theunit has been off for a number of periods smaller or equal to agiven value; and another for cold start-up costs, consi<strong>de</strong>red otherwise.The following constraints will be inclu<strong>de</strong>d in the formulation:system power balance <strong>de</strong>mand, system reserve requirements,unit initial conditions, unit minimum up and down times, generationlimits and ramp constraints. For a mathematical formulation3. MIP APPROACH AND COMPUTATIONAL RESULTSThe approach consi<strong>de</strong>rs a piecewise linear approximation of thequadratic fuel cost function (see Equation (1)). P it are <strong>de</strong>cisionvariables that represent the production level of unit i in period t;a i,b i and c i are fuel cost parameters for unit i (measured in $/h,$/MWh and $/MW 2 h, respectively). There are binary variablesy it that indicate the state of unit i in period t (0 if unit is off, 1+ biPit + ai if yit = 1previous(1)iteration.0 otherwise3.1. Algorithm <strong>de</strong>scriptionThe main contribution of this paper concerns a linearisationof this cost function. As it is convex, if we find a straight linetangent to it, and constrain the cost to be greater than the valueof the straight line, we have a lower approximation of the cost.The process <strong>de</strong>vised here is to dynamically find straight lines, atpoints whose cost is being un<strong>de</strong>restimated, and add them to a set;we then impose that the cost of a any production level p must begreater than the maximum of those straight lines, evaluated at p.For the sake of clarity, let us remove the indices i,t i<strong>de</strong>ntifyingthe generator. For any generator and any period, we start byapproximating its cost by means of two straight lines: one goingthrough (P min,F(P min)), and another going through (P max,F(P max)),After solving the problem with this approximation, we obtaina production level for this unit of, say, p. The operating cost atthis point will be un<strong>de</strong>restimated by the value of the highest of thestraight lines at p; in Figure 1, the value F. In or<strong>de</strong>r to exclu<strong>de</strong> thispoint from the feasible region, we add another straight line to ourset; the line tangent to the quadratic function, evaluated at p, asrepresented in blue in Figure 2. As we add more and more straightlines, we are converging to an exact approximation of the true costfunction, as can be seen in Figure 2 for another possible value p .(1)FPminpPmaxPowerFigure 1: Initial approximation of the cost function by two straightlines, going through the minimum and maximum operating powerof the unit. If the current production level for this unit is p, its cost(in this iteration) will be approximated by FCostF'FPminpp'PmaxPowerFigure 2: Approximation of the cost function by the maximum ofthree straight lines, after obtaining production at level p on theprevious iteration.Figure 2: Approximation of the cost function by the maximum ofthree straight lines, after obtaining production at level p on theInitially, for each unit, the corresponding quadratic fuel cost functionis approximated by two linear functions. Thereafter, morestraight lines are iteratively ad<strong>de</strong>d into a set, until having one iterationwith all production levels being correctly evaluated, up to anacceptable error.Let N be a set of integers i<strong>de</strong>ntifying the power at which newtangents to the true cost are ad<strong>de</strong>d; initially P = {P min,P max}.At a given iteration, if the production level obtained in the MILPsolution was p , we add this point to P, except if there is a p ∈P : |p − p| < ε.In the MILP solved at each iteration, we add the constraints(making sure that they are only observed if the corresponding unitis switched on at the period consi<strong>de</strong>red)3.1. Algorithm <strong>de</strong>scriptionInitially, for each unit, the corresponding quadratic fuel cost functionis approximated by two linear functions. Thereafter, morestraight lines are iteratively ad<strong>de</strong>d into a set, until having one iterationwith all production levels being correctly evaluated, up to anacceptable error.F ≥ α in + β in(p − p n) for n = 1,...,|P|,Let N be a set of integers i<strong>de</strong>ntifying the power at which newwhere p and F instantiated to the actual producing levels Ptangents to theitand costs true F it of cost a givenare unit, at ad<strong>de</strong>d; a given period. initially For a givenP unit, = the{P min ,P max }.At a given iteration,constants of theifstraightthe productionlines are obtainedlevel by:obtained in the MILPsolution was p ′ , we add this α in = point c i p 2 n + b i pto n + P, a i except if there is a p ∈β in = 2c i p n + bP : |p ′ − p| < ε.iIn the MILP solved at each iteration, we add the constraints (makingsure ALIO/EURO-2 that they are only observed if the corresponding unit isswitched on at the period consi<strong>de</strong>red)F ≥ α in + β in (p − p n ) for n = 1,...,|P|,where p and F are instantiated to the actual producing levels P itand costs F it of a given unit, at a given period. For a given unit, theconstants of the straight lines are obtained by:α in = c i p 2 n + b i p n + a iβ in = 2c i p n + b iIn our implementation, we have set ε = 1; this allows an excellentapproximation of the quadratic function in all the instances used(actually, we could observe no difference at all).3.2. Computational resultsThe algorithm was tested in two sets of problems: one withoutramp constraints but that has for long been a reference when comparingUC algorithms [12]; another where ramp constraints areinclu<strong>de</strong>d. CPU times were obtained with CPLEX 12.1, on a computerwith a Quad-Core Intel Xeon processor at 2.66 GHz, runningMac OS X 10.6.6; only one core was assigned to this experiment.Tables 1 and 2 present the results obtained with the algorithm proposedin this paper for different sets of instances. Problems P1 toP6, in Table 1, do not inclu<strong>de</strong> ramp constraints. Those constraintsare consi<strong>de</strong>red in problems R1 to R6 (Table 2). Problems R1 toR6, resulting from problems P1 to P6, set ramp up and down maximumvalues to the minimum production level of each unit. Allproblems consi<strong>de</strong>r a 24h planning horizon and the number of unitsranges from 10 to 100.Table 3 presents results reported in the literature for instances P1to P6. Although the objective function value reported in this paper(565 828) for the 10 unit problem using the approximation algorithmis different from the one reported in other papers (565 825),the actual solution is the same. Small differences in values arejustified by possible rounding of values by other authors.D COMPUTATIONAL RESULTSF'FALIO-EURO <strong>2011</strong> – 10iecewise linear approximation of the(see Equation (1)). P it are <strong>de</strong>cisionproduction level of unit i in period t;Pminpp'PmaxPower


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>In Tables 1 and 2, column Quadr provi<strong>de</strong>s the optimal result forthe base problem and column Lin the result obtained by the approximation.Columns CPU L and CPU Q refer to the time spent (inseconds) to solve the quadratic problem and to reach convergencefor the linear problem, respectively.Prob. Size Lin CPU L Quad CPU QP1 10 565 828 0.33 565 828 1.95P2 20 1 126 000 7.46 1 126 000 241.P3 40 2 248 280 134. 2 248 280 22716.P4 60 3 368 950 2639.P5 80 4 492 170 192966.P6 100 5 612 690 157742.Table 1: Results for problems P1 to P6. Attempts to solve theproblem with the quadratic formulation were not successful forinstances with more than 50 units.As far as the authors know, no optimal results had ever been establishedfor problems P1 to P6, even for the smallest ones. Wenow show that for problems up to 40 units optimal results can beobtained by highly efficient MIP solvers. Furthermore, the effectivenessand efficiency of the approach proposed in this paper arereflected in the values of columns Lin and CPU L , respectively. Forproblems up to 40 units the iterative approach is able to reach theoptimal solution with dramatical cuts in CPU times, when comparedto direct solution with the quadratic solver of CPLEX. Forproblems of bigger size, good lower bounds on the optimal resultare also reachable as can be conclu<strong>de</strong>d by comparing those valueswith the best published values for the quadratic problem (seeTable 3).Similar conclusions may be taken for the ramp problem. Thequadratic solver of CPLEX was capable of reaching optimal solutionsfor instances of up to 20 units. Optimal values for the sameset of problems were also reached by the approximation algorithm,that was capable of solving instances of up to 80 units.Prob. Size Lin CPU L Quad CPU QR1 10 573 570 0.94 573 570 2.00R2 20 1 144 450 258. 1 144 450 147.17R3 40 2 284 670 12084.R4 60 3 424 310 1830.R5 80 4 565 420 41907.R6 100Table 2: Results for problems R1 to R6. Attempts to solve theproblem with the quadratic formulation were not successful for instanceswith more than 20 units. With the linearisation algorithm,limiting CPU to 200000 seconds, allowed solution of instanceswith up to 80 units.4. CONCLUSIONS AND FURTHER DEVELOPMENTSThe main contribution of this paper is a method for approximatingthe quadratic cost of electricity generating units, with an iterativemethod that converges to the exact solution.Computational analysis shows that for problems without ramps themethod is capable of reaching the quadratic optimal result wheneverit is known, within much less computational time. For largerinstances, where the quadratic problem optimal is not known, themethod also provi<strong>de</strong>s high quality lower bounds for the results.The paper also establishes optimal results for small size instancesshowing that currently, state-of-the-art MIP solvers can solve tooptimality problems that were not solvable before.Prob. Size LR [12] GA [12] LR–MA [13]P1 10 565 825 565 825 565 827P2 20 1 130 660 1 126 243 1 127 254P3 40 2 258 503 2 251 911 2 249 589P4 60 3 394 066 3 376 625 3 370 595P5 80 4 526 022 4 504 933 4 494 214P6 100 5 657 277 5 627 437 5 616 314ICGA [14] GRASP [11] CON [15]P1 10 566 404 565 825 565 825P2 20 1 127 244 1 126 805 1 126 070P3 40 2 254 123 2 255 416 2 248 490P4 60 3 378 108 3 383 184 3 370 530P5 80 4 498 943 4 524 207 4 494 140P6 100 5 630 838 5 668 870 5 615 410Table 3: Previous results for problems P1 to P6.Similar conclusions can be taken when ramp constraints are mo<strong>de</strong>lled.The method is also capable of reaching quadratic optimal results(now with extra computational time). Furthermore, for problemswith more than 20 units where quadratic optimal solutionswere not obtained, the approximate method was still effective.As future work the authors plan to inclu<strong>de</strong> additional features inthe algorithm to make it more efficient for very large size problems.5. ACKNOWLEDGEMENTSFinancial support for this work was provi<strong>de</strong>d by the PortugueseFoun<strong>da</strong>tion for Science and Technology (un<strong>de</strong>r Project PTDC/EGE-GES/099120/2008) through the “Programa Operacional TemáticoFactores <strong>de</strong> Competitivi<strong>da</strong><strong>de</strong> (COMPETE)” of the “Quadro Comunitário<strong>de</strong> Apoio III”, partially fun<strong>de</strong>d by FEDER.6. REFERENCES[1] M. Carrio and J. Arroyo, “A computationally efficient mixedintegerlinear formulation for the thermal unit commitmentproblem,” IEEE Transactions in Power Systems, vol. 21,no. 3, pp. 1371–1378, 2006.[2] A. Frangioni, C. Gentile, and F. Lacalandra, “Tighter approximatedmilp formulations for unit commitment problems,”Power Systems, IEEE Transactions on, vol. 24, no. 1, pp.105 –113, Feb. 2009.[3] N. Padhy, “Unit commitment – a bibliographical survey,”IEEE Transactions in Power Systems, vol. 19, no. 2, pp.1196–1205, 2004.[4] H. Yamin, “Review on methods of generation scheduling inelectric power systems,” Electric Power Systems Research,vol. 69, pp. 227–248, 2004.[5] F. Lee, “A fuel constrained unit commitment method,” IEEETransactions on Power Systems, vol. 4, pp. 1208–1218, 1989.[6] Z. Ouyang and S. Shahi<strong>de</strong>hpour, “Heuristic multi-area unitcommitment with economic dispatch,” IEE <strong>Proceedings</strong> – C,vol. 138, pp. 242–252, 1991.[7] D. Srinivasan and A. Tettamanzi, “An evolutionary algorithmfor evaluation of emission compliance options in view of theclean air act amendments,” IEEE Transactions on Power Systems,vol. 12, no. 1, pp. 336–341, 1997.[8] Y. Fu and M. Shahi<strong>de</strong>hpour, “Fast scuc for large-scale powersystems,” Power Systems, IEEE Transactions on, vol. 22,no. 4, pp. 2144 –2151, Nov. 2007.ALIO-EURO <strong>2011</strong> – 11


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[9] J. Xu and R. Christie, “Decentralised unit commitmentin competitive energy markets,” in The Next Generationof Electric Power Unit Commitment Mo<strong>de</strong>ls, B. Hobbs,M. Rothkopf, R. O’Neill, and H. Chao, Eds. Kluwer Aca<strong>de</strong>micPublishers, 2001, pp. 293–315.[10] B. Hobbs, M. Rothkopf, R. O’Neill, and H. Chao, Eds., TheNext Generation of Electric Power Unit Commitment Mo<strong>de</strong>ls.Kluwer Aca<strong>de</strong>mic Publishers, 2001.[11] A. Viana, J. Sousa, and M. Matos, “Using GRASP to solvethe unit commitment problem,” Annals of Operations Research,vol. 120, no. 1, pp. 117–132, 2003.[12] S. Kazarlis, A. Bakirtzis, and V. Petridis, “A Genetic Algorithmsolution to the unit commitment problem,” IEEETransactions on Power Systems, vol. 11, pp. 83–92, 1996.[13] J. Valenzuela and A. Smith, “A see<strong>de</strong>d memetic algorithmfor large unit commitment problems,” Journal of Heuristics,vol. 8, no. 2, pp. 173–195, 2002.[14] I. G. Damousis, A. Bakirtzis, and P. Dokopoulos, “A solutionto the unit commitment problem using integer-co<strong>de</strong>d geneticalgorithm,” IEEE Transactions on Power Systems, vol. 19,pp. 1165–1172, 2004.[15] A. Viana, J. Sousa, and M. Matos, “Fast solutions for UCproblems by a new metaheuristic approach,” Electric PowerSystems Research, vol. 78, pp. 1385–1395, 20087.ALIO-EURO <strong>2011</strong> – 12


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Dispatch Hydroelectric Power Plant using Genetic AlgorithmJessica Pillon Torralba Fernan<strong>de</strong>s ∗Paulo <strong>de</strong> Barros Correia ∗∗ Department of Energy, Faculty of Mechanical Engineering, University of Campinas - UNICAMPCampinas, Brazil{pillon, pcorreia}@fem.unicamp.brABSTRACTThis paper presents an optimization mo<strong>de</strong>l for <strong>da</strong>ily operation ofMiddle Sao Francisco River hydroelectric system in Brazil. Thestudy consi<strong>de</strong>rs eight hydroelectric power plants – Sobradinho,Luiz Gonzaga, Apolonio Sales, Paulo Afonso I, II, III, IV e Xingo– witch belongs to the Sao Francisco Hydroelectric Company. Itsobjective is to maximize the hydroelectric power plant efficiencyand, simultaneously, to minimize the number of startups and shutdownsof generating units. The technique of resolution is ma<strong>de</strong>in two steps: Step 1 <strong>de</strong>termines the load allocated in each hydroelectricpower plant at each per hour and Step 2 <strong>de</strong>fines thenumber of generating units in operation and the load of particularpower plant. The mathematical formulation is non-linear mixedinteger programs and solved with a Genetic Algorithm (GA) approach,and Linear Programming . This mo<strong>de</strong>l was implementedwith two computation programs, one a commercial optimizationsolver, and a in house GA solver co<strong>de</strong>d with a programming languageof four generation. One of programs was used as interface,while the fourth generation, the optimization mo<strong>de</strong>l was implemented.Keywords: Linear and non-linear optimization, Multiobjectiveoptimization, Hydroeletric system, Generating units, Genetic algorithm1. INTRODUCTIONthe operation of a hydroelectric power plant. The mo<strong>de</strong>l consists oftwo algorithms based on GA. The first algorithm is used to allocatethe generating units and aims to maximize the efficiency of powerplant at each time interval. The second step aims to maximize efficiencyand minimize the number of startups and shutdowns ofgenerating units.The dispatch mo<strong>de</strong>l proposed by [3], and [4], was divi<strong>de</strong>d into twosubproblems called Dispatch of Units (DU) and Dispatch Generation(DG). DG was solved via Lagrangean Relaxation and DUwas used with Genetic Algorithms. This methodology was appliedto actual case study of the hydroelectric power plants systemof Paranapanema in Brazil.2. PHYSICAL ASPECTSIt is important that the physical aspects of generating units must bemore <strong>de</strong>tailed in the dispatch, such as operational restriction andthe operating characteristics (for example their efficiencies), wherecosts and goals are more important.• Unit efficienciesGeneration unit efficiency <strong>de</strong>pends on three variables: waterhead of the plant, water discharge and eletric power ofthe unit. The hill is a three-dimensional curve that plots efficiencyas a fuction of the water head of the plant and theeletric power of unit, as shown in Figure 1.Several objectives are adopted for the dispatch mo<strong>de</strong>ls of generatingunits in hydroelectric power plants. Generally, the problemof maximizing the efficiency of the Brazilian hydroelectric plantshas as the main objective a mo<strong>de</strong>l for the Optimal Load Dispatch(DOC). The DOC resolves the load allocation problem of the hydroelectricplants and it can be implemented as an EvolutionaryComputation problem, specifically with Genetic Algorithm. It alsoallows calculating the global efficiency of the plants when the operatingconditions, the hills curves and operatives restrictions areknown.According to [1], the efficiency of generating units is the main factorinfluencing the performance of generation of electricity in a hydroelectricpower plant . The operation planning of generation systemscovers the long, medium and short term. This article focuseson the short-term operation. The short-term programming requiresa more <strong>de</strong>tailed mathematical representation of the operatives restrictionsand it is <strong>de</strong>termined the curve of a generation plant, andthen, the units are chosen to be dispatched. Thus, this paper proposesan optimization mo<strong>de</strong>l of the Sao Francisco’s hydroelectricplants <strong>da</strong>ily operation. Its objective is to maximize the plant’s efficiencyand minimize the number of startups and shutdowns ofthe generating units simultaneously. The literature presents a significantnumber of works that relate the problem of dispatch withdifferent approaches that vary according to the applicability of thesame.[2] proposed a mo<strong>de</strong>l of multiobjective optimal dispatch forFigure 1: Hill curve of a real hydroelectric power plant.• DemandThe load of the plant is <strong>de</strong>termined by long- and mid-termplanning. A short-term scheduling mo<strong>de</strong>l estimates the plant’s<strong>da</strong>ily load curve. The Figure 2 shows a typical load curveof one <strong>da</strong>y. In this case, the <strong>de</strong>mand to be met by powerplants of Middle Sao Francisco river.• Startups and Shutdowns of generating unitsIn some studies the costs of startups and shutdowns of thegenerating units have a great importance, since it <strong>de</strong>creasesALIO-EURO <strong>2011</strong> – 13


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>they can be applied to problems with discontinuities, which arevery common in dispatch problems.4. PROBLEM DESCRIPTION4.1. The Sao Francisco riverFigure 2: Typical <strong>da</strong>ily load curve.the life of units and increases the maintenance of windingsand mechanical equipament them.A study presented by [5] showed how startups affect thecost of short term hydro operation and how these costs affectshort term scheduling strategies of power producingcompanies in Swe<strong>de</strong>n. Overall, the study points to an approximatevalue of 3US$/MW.• Plant Production FactorPower output in a hydroelectric plant per unit turbine flow.It varies according to plant gross head, and is expressed inMW/m3/s. For purposes of illustration, the Figure 3 showsthe productivity of a specific plant from Brazil.The Sao Francisco is a river in Brazil. With a length of 3200kilometres, the Sao Francisco originates in the Canastra mountainrange in the central-western part of the state of Minas Gerais andtraverses the states of Minas Gerais (MG), Bahia (BA), Pernambuco(PE), Sergipe (SE) and Alagoas (AL).Casca<strong>de</strong> Middle Sao Francisco River is formed by uses of theHPPs Sobradinho, Luiz Gonzaga, Apolônio Sales (Moxotó), PauloAfonso I, II, III, IV and Xingó. These HPPs are the core of thesystem producing electric power from the Northeast, CompanhiaHidro Eletrica do Sao Francisco (CHESF). The Figure 4 showsthe location of the Middle Sao Francisco in Brazil, along with theHPPs.Figure 3: Plant Production Factor.3. GENETIC ALGORITHMMath and computational techniques have been <strong>de</strong>veloped for <strong>de</strong>ca<strong>de</strong>swith the principles of Darwin’s evolution theory, <strong>de</strong>fining whatis known as Evolutionary Computation. Insi<strong>de</strong> its branches, GeneticAlgorithms (GA) are the most used [6]. GA were <strong>de</strong>velopedby Holland [7], who analyzed the phenomena of the process of naturalselection of species and the genetic selection of races. Eachindividual in the GA is a coding of a possible solution of a problem.This encoding can be binary or real.The first step towards its implementation is the generation of aninitial population, that for most problems is randomly generated.However, <strong>de</strong>pending on the application forms, the individuals canbe selected heuristically to compose a more favorable population[8]. GA use some genetic operators like crossover and mutation,and these operators are applied to generate new solutions insi<strong>de</strong> afeasible set of solutions.Also, the operators are randomized to provi<strong>de</strong> diversities in theoverall population seeking global optimal solutions. The advantageof GA is that its use does need differentiable functions, soFigure 4: System of the Middle Sao Francisco with him HPPs locatedin Brazil.The Figure 5 illustrates the HPPs Casca<strong>de</strong> Middle Sao Francisco.Figure 5: Casca<strong>de</strong> Middle Sao Francisco river in Brazil.4.2. Mathematical FormulationThe problem presented is solved in two steps, as follows Diagram6.The dispatch is <strong>de</strong>scribed by Equations 1 to 9ALIO-EURO <strong>2011</strong> – 14


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>USBULGUSQUXGikjQ ip i, jxi0UHEρ iη iGM iy i,k, jd jM ik i, jHPP SobradinhoHPP Luiz Gonzaga (Itaparica)HPP Paulo Afonso IVHPP XingóPower plant in<strong>de</strong>xGenerating unit in<strong>de</strong>xTime period in<strong>de</strong>xAvarage flow of that the HPP i must keepduring the <strong>da</strong>yPower generated by the HPP i in period jReservoir level of the HPP i in the lastperiod of the previous <strong>da</strong>ySet of power plants UHE = {USB,ULG,USQ,UAS}Plant Production Factor function of the HPP iEfficiency function of the power plant iGeneration of HPP UPA e UASSet of UG of the power plant iIndicates if the UG k of the power plant iin period j is dispatchedDemand of the four power plants UHE in period jSet of UG of the power plant iIndicates if the UG k of the power plant iin period j is dispatchedp mini, j (k i, j ) Minimum power to k i, j UGp maxi, j (k i, j ) Maximum power to k i, jG Generation of HPP UPA and UASTable 1: Variables used in the mathematical formulation.Figure 6: Diagram of the proposed problem.24Max ∑ ∑ ∑ η i (p i,k, j )y i,k, j (1)j=1 i∈UHE k∈M i24 ∣ ∣Min ∑ ∑ ∑ ∣yi,k, j − y i,k, j−1 (2)j=2 i∈UHE k∈M is.a.∑∑p i,k, j = d j − G (3)i∈UHE k∈M i24p USB, j∑ ∑j=1 k∈M iρ USB (xUSB 0 , p USB,k, j) = 24Q USB (4)24p ULG, j∑ ∑j=1 k∈M iρ ULG (xULG 0 , p ULG,k, j) = 24Q ULG (5)24p USQ, j∑ ∑j=1 k∈M iρ USQ (xUSQ 0 , p USQ,k, j) = 24Q USQ (6)24p UXG, j∑ ∑j=1 k∈M iρ UXG (xUXG 0 , p UXG,k, j) = 24Q UXG (7)p mini,k, j y i,k, j ≤ p i,k, j ≤ p maxi,k, j y i,k, j (8)y i,k, j ∈ {0,1} (9)for i ∈ UHE = {USB,ULG,USQ,UXG}, k = {1,...,n} and j =1,...,24, whereThis problem has a multiobjective character because its objectivefunctions 1 and 2 seek to maximize productivity and minimize thenumber of startups and shutdowns, respectively.Equations 4 to 7 represent the <strong>da</strong>ily average for each mill. Thevariable k i, j indicates whether unit i is dispatched (k i, j = 1) or notdispatched (k i, j = 0).5. METHODOLGYThe problem above is solved in two steps, as Figure 7. The Step 1Figure 7: Illustration of the problem.<strong>de</strong>termines how much each power plant must generate at each timeinterval. It provi<strong>de</strong>s an initial solution which takes into accountthe service and vi<strong>de</strong>o-streaming market averages per hydroelectricpower plant.From this initial solution, the Step 2 <strong>de</strong>termines the number ofunits in operation and the load of a particular plant. This last stepis divi<strong>de</strong>d into two phases which are solved iteratively until convergence.ALIO-EURO <strong>2011</strong> – 15


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>5.1. Step 1This Step 1 solves a simplified problem given below, which doesnot <strong>de</strong>ci<strong>de</strong> on the number of machines in operation.24Min ∑ ∑j=1 i∈UHEs.a.p i, jρ i(10)∑ p i, j = d j − G (11)i∈UHE24 p i, j∑ = 24Q USB (12)j=1 ρ USB24 p ULG, j∑ = 24Q ULG (13)j=1 ρ ULG24 p USQ, j∑ = 24Q USQ (14)j=1 ρ USQ24 p UXG, j∑ = 24Q UXG (15)j=1 ρ UXGp mini, j (1) ≤ p i, j ≤ p maxi, j (n i, j ) (16)for i ∈ UHE = {USB,ULG,USQ,UXG} and j = 1,...,24.5.2. Step 2s.a.∑k∈M ip k, j = d j − G (24)24 n p j∑ ∑j=1 k=1 ρ(x 0 = 24Q (25), p k, j )p mink, j y k, j ≤ p k, j ≤ p maxk, j y k, j (26)y k, j ∈ {0,1} (27)for i ∈ UHE = {USB,ULG,USQ,UXG}, k ∈ M i and j = 1,...,24.To the Step 2, was one chosen HPPs Sobradinho and Paulo AfonsoIV to be the study of case.6. RESULTSIt was consi<strong>de</strong>red a <strong>da</strong>ily horizon with a half-hour discretizationcontaining all the HPPs in casca<strong>de</strong>, according to the schedule <strong>da</strong>taheld on September 10, 2007. The <strong>da</strong>ily load curve to be atten<strong>de</strong>dby the casca<strong>de</strong>, the initial state of the reservoirs and expected inflowsfor each <strong>da</strong>y, were the available <strong>da</strong>ta provi<strong>de</strong>d by CHESF.The Step 1 produced a graph that shows the result in terms of generatingfor each HPPs of casca<strong>de</strong>, shown in Figure 8. Basically, allthe HPPs followed the curve of charge and its ranged according toher keeping the levels of its reservoirs within the allowed limit.Due to its mixed character, the problem in this step is <strong>de</strong>composedinto two phases, iteratively solved until convergence. Both phasesare resolved by the GA techniques.5.2.1. Phase 1The dispatch problem formulation in this phase is <strong>de</strong>scribed by thefollowing objective function and constraints, with time j and HPPi are fixed.∑∑Max η i (p i,k, j )y i,k, j (17)i∈UHE k∈M is.a.∑∑p i,k, j = d j − G (18)i∈UHE k∈M ip i, j∑k∈M iρ i (xi 0, p i,k, j) = 24Q i (19)p mini,k, j y i,k, j ≤ p i,k, j ≤ p maxi,k, j y i,k, j (20)y i,k, j ∈ {0,1} (21)for i ∈ UHE = {USB,ULG,USQ,UXG}, k ∈ M i e t = 1,...,24.5.2.2. Phase 2The dispatch problem formulation in the second Phase is <strong>de</strong>scribedby the following objective function and constraints, with HPP ifixed.24 nMax ∑ ∑ η i (p k, j )y k, j (22)j=1 k=124 n∣Min ∑ ∑∣ yk, j − y k, j−1 (23)j=2 k=1Figure 8: Generation of casca<strong>de</strong> and HPPs.In Step 2 obtained the graphics of generation and centrifugation forHPPs Sobradinho and Paulo Afonso IV, also indicating the limitsof maximum and minimum generation, shown in Figures 9 and 10.7. CONCLUSIONSThis paper approached the dispatch problem by a mathematicalmo<strong>de</strong>l that maximizes the energy efficiency of power plant takinginto account the operational restrictions translated in terms ofreservoir levels, the swallowing of the turbines, the goal of generationand vi<strong>de</strong>o-streaming of the HPP.The genetic algorithm is a powerful optimization tool that has beenused very often in solving similar problems in the proposed work.The efficiency of its use in simulation of this work showed an appropriatediscovery of dispatch. The result achieved with its usewas a great diversity of solutions with startups and shutdowns differentthat the best solution will be found <strong>de</strong>pending on the priorityof the problem.The applicability of this mo<strong>de</strong>l can be used for optimization ofother HPPs in casca<strong>de</strong>.ALIO-EURO <strong>2011</strong> – 16


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>8. REFERENCESFigure 9: Generation and Centrifugation for Sobradinho.[1] C. T. Salmazo, “Mo<strong>de</strong>lo <strong>de</strong> otimizacao eletro-energergico <strong>de</strong>curto prazo (pre-<strong>de</strong>spacho) aplicado ao sistema copel,” Master’sthesis, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> <strong>Engenharia</strong> Eletrica e <strong>de</strong> Computacao,Universi<strong>da</strong><strong>de</strong> Estadual <strong>de</strong> Campinas, 1997.[2] G. Conalgo and P. Barros, “Multiobjective dispatch of hydrogeneratingunits using a two-step genetic algorithm method,”IEEE Congress on Evolutionary Computation, pp. 2554 –2560, 2009.[3] E. F. D. Santos, “Um mo<strong>de</strong>lo <strong>de</strong> pre-<strong>de</strong>spacho em usinashidreletricas usando algoritmos geneticos,” Master’s thesis,<strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> <strong>Engenharia</strong> Eletrica e Computacao, Universi<strong>da</strong><strong>de</strong>Estadual <strong>de</strong> Campinas, 2001.[4] A. S. A. Encina, “Despacho otimo <strong>de</strong> uni<strong>da</strong><strong>de</strong>s geradoras emsistemas hidreletricos via heuristica basea<strong>da</strong> em relaxacao lagrangeanae programacao dinamica,” Ph.D. dissertation, <strong>Facul<strong>da</strong><strong>de</strong></strong><strong>de</strong> <strong>Engenharia</strong> Eletrica e <strong>de</strong> Computacao, Universi<strong>da</strong><strong>de</strong>Estadual <strong>de</strong> Campinas, 2006.[5] O. Nilsson and D. Sjelvgren, “Hydro unit start-up costs andtheir impact on the short term scheduling strategies of swedishpower producers,” IEEE Transactions on Power Systems,vol. 12, pp. 38 – 43, 1997.[6] Z. Michalewicz, Genetic Algorithms + Data Structures = EvolutionPrograms, 3, Ed. Sringer, 1996.[7] H. J. Holland, A<strong>da</strong>ptation in Natural and Artificial Systems.The University of Michigan Press, 1975.[8] E. G. M. Lacer<strong>da</strong> and A. C. P. L. Carvalho, Introducao aosAlgoritmos Geneticos. Universi<strong>da</strong><strong>de</strong> Fe<strong>de</strong>ral do Rio Gran<strong>de</strong>do Sul, 1999.Figure 10: Generation and Centrifugation for Paulo Afonso IV.ALIO-EURO <strong>2011</strong> – 17


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Algebraic Group Theory driven Divi<strong>de</strong> and Evolve of multi-objective ProblemsNail El-Sourani ∗Markus Borschbach ∗∗ Chair of Optimized Systems, University of Applied SciencesFHDW, Haupstrasse 2, D-51465 Bergisch Gladbach{nail.el-sourani, markus.borschbach}@fhdw.<strong>de</strong>ABSTRACTMost real world problems remain as a multi-objective solutionspace. To overcome the well known computational complexity ofsuch problems, the divi<strong>de</strong> and evolve is a feasible solution, if thesub-problems remain solvable. This paper envisions a road-map,when and how to apply algebraic group theory structures into amulti stage evolutionary approach. It solves certain combinationsof objectives from group stage to group stage in a nested groupstructure, until the reference problem at hand even reaches the distinctsolution of the problem. Further, the quality of the solution,i.e. the overall number of steps to reach the solution results in alow number of steps (albeit not the lowest possible). Performanceand integrity of this approach are consequently verified.Keywords: Group theory, Divi<strong>de</strong> and evolve, Evolution strategy,Discrete optimization1. INTRODUCTIONThe universe of combinatorial optimization problems is a quite diversespace of problems. Evolutionary solutions for so far infeasiblecomplexity spaces provi<strong>de</strong> an opportunity if an algebraic grouptheory based structure can be i<strong>de</strong>ntified. The Rubik’s Cube is introduce<strong>da</strong>s a reference and benchmark problem to fulfill an integrityand performance profile of a consequently applied algebraic grouptheory driven divi<strong>de</strong> and evolve approach. The main task is to fin<strong>da</strong> structure of subgroups which, when transformed for applicationas fitness function(s) in an evolutionary approach, enable an overallmulti-objective optimization problem - previously non-solvableor only with high computational cost - to be solved in reasonabletime. The problem at hand, introduced and formalized in this paper,is multi-objective in the sense that a scrambled Cube has to besolved (first objective) using a preferably small number of moves(second objective).On a general level, a group-theoretic structure has to be found,which divi<strong>de</strong>s the infeasible problem domain into solvable tasks,represented by algebraic groups. The phase-transition of solutionsfrom one group to the following one is realized by specific fitnessfunctions for each group-transition. Each transition itself solves apartly multi-objective subproblem with varying, subgroup-inducedprime objectives. Making use of the nested group structure guaranteesa steady improvement of individuals and promotes a stablepopulation towards the end of each evolution phase. Each groupinduces a combination of constraints which remain fulfilled andsubsequently add up until the final group-transition.Large population sizes and the presented evolutionary phase-transitionmechanic increase individual diversity to ensure efficient transitionsfrom group to group and finally the overall unique solution.This remains different from the general combinatorial optimizationtask which, in general, <strong>de</strong>fines an equal number of solutions.In the reference problem however, the sequences of moves foundfor group-transitions remain non-<strong>de</strong>terministic and therefore different.The overall solution is a single unique point in the searchspace. By <strong>de</strong>riving a statistical analysis of the search space, a simulationonset based on an integrity verification is provi<strong>de</strong>d. Accordingly,all computationally feasible states up to a certain complexityhave been generated. The presented approach has been approvedupon this onset and further a random selection of more complexpoints of the search space to ensure a solution from every pointof the search space (including the known most complex). In thecase of this reference problem, each solution in the search space isevaluated by the exact and shortest solution known so far.2. DIVIDE AND CONQUER THE RUBIK’S CUBE2.1. Structure and NotationThe classic 3 3 Rubik’s Cube is wi<strong>de</strong>ly known and the one subjectto this paper. It consists of 26 pieces: 8 corner pieces, 12 edgepieces and 6 center pieces, distributed equally on the six si<strong>de</strong>sof the Cube. Each si<strong>de</strong> of the Cube will be called face, each 2-dimensional square on a face will be referred to as facelet.FFigure 1: Classic 3 3 Rubik’s Cube, effect of CW turn of front face.Corners, edges and centers are all cubies - representing the physicalobject. A corner shows 3 facelets, an edge 2 and a center1. Each si<strong>de</strong> of the Rubik’s Cube can be rotated clockwise (CW)and counterclockwise (CCW). Every such single move changesthe position of 4 edges and 4 corners - note that the center faceleton every of the Cube’s faces always stays in the same position (seeFigure 1). Thus, the color of a solved face is always <strong>de</strong>termined byits center color. For each edge and corner it is of great importanceto distinguish between position and orientation: i.e. an edge canbe in its right position (<strong>de</strong>fined by the two adjacent center colors)but in the wrong orientation (flipped).There are several known notations [11] for applying single moveson the Rubik’s Cube. We will use F,R,U,B,L,D to <strong>de</strong>note a clockwisequarter-turn of the front, right, up, back, left, down faceand Fi,Ri,Ui,Bi,Li,Di for a counterclockwise quarter-turn. Everysuch turn is a single move. In Cube related research half-turns(F2,R2,U2,B2,L2,D2) are also counted as single move, we willdo so as well. This notation is <strong>de</strong>pen<strong>de</strong>nt on the users viewpoint tothe cube rather than the center facelets’ colors.ALIO-EURO <strong>2011</strong> – 18


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>2.2. Algebraic CharacteristicsA group G is a set together with multiplication and i<strong>de</strong>ntity e (eg =g), inverse (gg −1 = g −1 g = e) and an associative law. A subgroupH < G is a subset H that is closed un<strong>de</strong>r group operations. S ⊆ G,written G =< S > is a generator of G if any element of G can bewritten as a product of elements of S and their inverses. The or<strong>de</strong>rof the group is the number of elements in it, |G|.All possible states of a Rubik’s Cube are <strong>de</strong>scribed by the groupgenerated by its applicable moves G C =< F,R,U,B,L,D >, alsocalled the Cube Group (|G C | = 4.3 · 10 19 ). All configurations ofthe Rubik’s Cube can be reached by using combinations of singlemoves in this group, thus the single moves generate G C . Further,there is always a neutral element, i.e. F · FFFF = FFFFF = Fand F 4 = 1 (also showing the or<strong>de</strong>r of each generator in G C is 4)and an inverse: Fi · F = 1 and Fi = FFFGiven a group G and a subgroup H < G, a coset of H is the setHg = hg : h ∈ H; thus, H < G partitions G into cosets. The set ofall cosets is written HG.Let H =< L,R,F,B,U2,D2 > be a subgroup of G C , representinga Cube where only the edge positions matter, as no edge orientationscan be altered. Thus, HG C <strong>de</strong>picts the left coset spacewhich contains all possibly attainable states when only flippingedge cubies (changing an edges orientation). For exten<strong>de</strong>d explanationrefer to [6], [3].2.3. Related WorkSolving the Rubik’s Cube is a challenging task. Both the size ofthe solution space induced by the number of attainable states andmultiple <strong>de</strong>sirable si<strong>de</strong>-objectives next to restoring the Cube (favorablyin the smallest possible number of moves and lowest calculationcomplexity) make this an interesting optimization problem.Although invented in 1974, the number of moves required tosolve any state of Rubik’s Cube (the so-called God’s Number) hasjust recently been found to be 20 [12].Various algorithms were <strong>de</strong>vised to <strong>de</strong>crease the upper bound. However,all those approaches are strictly exact methods and the mostrecent ones rely on terabytes of pre-calculated lookup-tables. Thisis reflected in the research road-map of lowest upper bounds byRokicki [12] to finally prove it to be 20. This number was attainedby applying the same method he had used earlier for pushing theupper bound to 26, 25 and then 23 moves - using the very same algorithmonly on more powerful hardware and a longer calculationtime [11], [12].Evolutionary Algorithms have been successfully applied in a varietyof fields, especially highly complex optimization problems [2],[9], [14]. Oftentimes, superior solutions - as compared to classicalalgorithms have been achieved - notably in multi-objective cases(for example multi-constraint knapsack problems [5]). This givesrise to the i<strong>de</strong>a of applying Evolutionary Algorithms to the Rubik’sCube problem. All relevant approaches are based on dividingthe solution space of the Rubik’s Cube into mathematical groups,starting with Thistlethwaite using 4 [13], then Reid combining twoof Thistlethwaite’s groups resulting in total of 3 [10] and finallyKociemba’s [8] and Rokicki’s approach using 2 subgroups. Thismakes the group theoretic approach a reasonable starting point for<strong>de</strong>signing Evolutionary Algorithms. It is of particular interest to usto <strong>de</strong>termine how such an EA can solve the Cube without relyingon extensive lookup-tables. Only a few evolutionary approaches<strong>de</strong>dicated to solve the Rubik’s Cube exist. In 1994 Herdy <strong>de</strong>vise<strong>da</strong> method which successfully solves the Cube [7] using pre-<strong>de</strong>finedsequences as mutation operators that only alter few cubies, resultingin very long solutions. Another approach by Castella could notbe verified due to a lack of documentation. Recently Borschbachand Grelle [1] <strong>de</strong>vised a 3-stage Genetic Algorithm based on acommon human “SpeedCubing” [11] method, first transformingthe Cube into a 2x2x3 solved state, then into a subgroup whereit can be completed using only two adjacent faces (two-generatorgroup).2.4. Rubik’s Cube as an IndividualThe Rubik’s Cube is represented using 6 2D matrices containingvalues from 1 to 6, each representing one color. Every quarter- andhalf-turn can be applied to this representation, yielding a total of18 different single moves while still leaving the Cube’s integrityintact. Thus, mutation is easily realized by not modifying a singlefacelet’s color but applying a sequence of moves to the Cube.This guarantees that the Cube’s integrity stays intact at all timesand makes a separate integrity test superfluous. Every individualremembers the mutations it has un<strong>de</strong>rgone, i.e. a list of moves thathave been applied. To keep this list as small as possible, redun<strong>da</strong>ntmoves are automatically removed. For example an individual thathas been mutated with F and is then mutated with FRRiB will onlyremember the optimized sequence F · FRRiB = F2B, preventingredun<strong>da</strong>ncy. Essentially, this is realized via a while-loop, eliminatingredun<strong>da</strong>nt moves in each pass until no further optimizationscan be ma<strong>de</strong>: e.g. F2BBiR2R2F is optimized to Fi by first removingBBi, then removing R2R2 and finally transforming F2F intoFi.3. FITNESS FUNCTION BASED ON ALGEBRAICGROUPS3.1. Divi<strong>de</strong> and ConquerTranslating the classic Thistlethwaite Algorithm [13] into an appropriateFitness Function for an Evolutionary Algorithm essentiallyforces the <strong>de</strong>sign of four distinct subfunctions. As each subgroupof G 0 has different constraints, custom methods to satisfythese constraints are proposed. The groups provi<strong>de</strong>d by Thistlethwate[13] are: G 0 =< F,R,U,B,L,D >, G 1 =< F,U,B,D,R2,L2 >, G 2 =< U,D,R2,L2,F2,B2 >, G 3 =< F2,R2,U2,B2,L2,D2 >, G 4 = I.Obviously, G 0 = G C . The functional principle of Thistlethwaite’sAlgorithm is to put the Cube into a state where it can be solved byonly using moves from G i which again has to be achieved by onlyusing moves from G i−1 for i = 1,...4, thus named nested groups.This provi<strong>de</strong>s the basis of the presented divi<strong>de</strong> and conquer ESapproach.As we use randomly generated mutation sequences (albeit<strong>de</strong>pen<strong>de</strong>nt of the current fitness phase/group in the final version),first attempts while working in the whole of the group G Cwould consistently fail to solve due to the very high or<strong>de</strong>r of |G C |- and thus the solution space.The divi<strong>de</strong> and conquer ES-approach however evolves a transitionsequence for an individual in the current coset space G i+1 G ito the next one (i = i + 1). These coset spaces, each <strong>de</strong>scribinga reduced form of the 3 3 Rubik’s Cube puzzle, induce differentkinds of constraints. This directly results in the total number ofattainable states being reduced by using only moves from somesubgroup G i+1 . The exact or<strong>de</strong>rs for each group are calculated exemplaryfor G 1 G 0 (complete calculations are found in [3], [4]):The first coset space G 1 G 0 contains all Cube states, where theedge orientation does not matter. This is due to the impossibilityof flipping edge cubies when only using moves from G 1 . As thereare 2 11 possible edge orientations,|G 1 G 0 | = 2 11 = 2048 (1)ALIO-EURO <strong>2011</strong> – 19


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>the or<strong>de</strong>r of |G 1 | is|G 1 | ≡ |G 0||G 1 G 0 | = 2.11 · 1016 . (2)3.2. Algebraic group-based Fitness CalculationG 0 → G 1To reach G 1 from any scrambled Cube, we have to orient all edgepieces right while ignoring their position. The fitness functionfor this phase simply increases the variable phase 0 by 2 for eachwrong oriented edge. Furthermore, we add the number of movesthat have already been applied to the particular individual in or<strong>de</strong>rto promote shorter solutions, yielding a multi-objective optimizationproblem. Finally, we adjust the weight between w (number ofwrong oriented edges) and c (number of moves applied to currentCube individual). This will be done similarly in all subsequentphases.phase 0 = 5 · (2w) + c (3)With a total of 12 edges which can all have the wrong orientationthis gives max{2w} = 24. The Cube has been successfully put intoG 1 when phase 0 = c. Reaching G 1 is fairly easy to accomplish,thus making the weight-factor 5 a good choice.G 1 → G 2In or<strong>de</strong>r to fulfill G 2 the 8 corners have to be oriented correctly.Edges that belong in the middle layer get transferred there. Testswith the Thistlethwaite ES showed it somewhat problematic to dothis in one step. Oftentimes, the algorithm would get stuck in localoptima. To solve this, the process of transferring a Cube from G 1to G 2 has been divi<strong>de</strong>d into two parts. First, edges that belonginto the middle layer are transferred there. Second, the corners areoriented the right way. The first part is fairly easy and the fitnessfunction is similar to that from phase 0 except for w (number ofwrong positioned edges), i.e. edges that should be in the middlelayer but are not.phase 1 = 5 · (2w) + c (4)In the second part, for each wrong positioned corner, 4 penaltypoints are assigned as they are more complex to correct than edges.Obviously, in or<strong>de</strong>r to put the Cube from G 1 to G 2 both phases<strong>de</strong>scribed here have to be fulfilled, which yields:phase 2 = 10 · (4v) + phase 1 (5)where v represents the number of wrong oriented corners. Theweighing factor is increased from 5 to 10 to promote a successfultransformation into G 2 over a short sequence of moves.G 2 → G 3We now have to put the remaining 8 edges in their correct orbit.The same is done for the 8 corners which also need to be alignedthe right way. Thus, the colors of two adjacent corners in onecircuit have to match on two faces. In G 3 the Cube will only haveopposite colors on each face. Let x (number of wrong coloredfacelets) and y (number of wrong aligned corners), thenphase 3 = 5 · (x + 2 · y) + c . (6)G 3 → G 4 (solved)The Cube can now be solved by only using half-turns. For thefitness function we simply count wrong colored facelets. Let z bethe number of wrong colored facelets, thenphase 4 = 5 · z + c . (7)To summarize, 5 different fitness functions are nee<strong>de</strong>d for the ThistlethwaiteES. phase i is solved if phase i = c, i = 0,...,4 and withthe properties of nested groups we can conclu<strong>de</strong>, given the above,a solved Cube implies:4∑ phase i = c . (8)0Fulfilling the above equation satisfies the constraints induced bythe groups G 0 ,...,G 4 , with the final fitness value c <strong>de</strong>scribingthe final solution sequence length. The weight factors chosen arebased on consecutive testing throughout <strong>de</strong>velopment. The ratio<strong>de</strong>pends on the size of the nested groups. Finding optimal weightspresents a separate optimization problem and may be subject tofuture work.4. REMARKS ON SELECTION, GROUPS AND DIVIDEAND CONQUERIn the specific case of the Rubik’s Cube, the unsolvable completesolution space of |G C | = 4.3 · 10 19 using non-restricted, randomlygenerated mutation sequences consisting of single moves, spawnedthe i<strong>de</strong>a of dividing the problem into smaller subproblems. Thei<strong>de</strong>a itself however is not exclusive to this application.The general problem in this type of situation is to find a consistentdivi<strong>de</strong> and conquer strategy, equivalent to the original problem.However, oftentimes many problems already provi<strong>de</strong> such inform of classical, non-ES algorithms. With this work we intend toshow how such existing divi<strong>de</strong> and conquer concepts can be use<strong>da</strong>nd transformed into heuristics suitable for a<strong>da</strong>ption into fitnessfunctions to enable quick and efficient <strong>de</strong>ployment of divi<strong>de</strong> andconquer EAs. Next, it is necessary to provi<strong>de</strong> suitable mutation operatorsand selection methods. Mutation operators in our case arestill randomly generated only adhering to the single moves provi<strong>de</strong>dby the current subgroup, which again <strong>de</strong>pends on the currentfitness phase. However, this only needs a minor tweak from theoriginal i<strong>de</strong>a, removing some entries from the list of single movesthat can be randomly chosen from.Finding an appropriate selection function for efficient EA <strong>de</strong>signin large solution spaces is a far more challenging and, at times, creativeprocess. Even more so when building a divi<strong>de</strong> and conquerEA where essentially each phase proves to be a single, classicalES-loop and the input (starting population) of the current loop is tobe the solution provi<strong>de</strong>d by the previous one. A first version of ourRubik’s Cube ES for example would evolve until one individualfulfilling the current fitness phase had been found to form the startingpopulation of the subsequent phase by duplication. However,in problems where there exist more than one solution, typicallymulti-dimensional solutions in multi-objective optimization, mostoften one of these dimensions outweighs the others in importance.In the present two-dimensional Rubik’s Cube example objectivedimensions are distance_to_phase_solve (variables v,w,x,y,z inequations (3) - (7)) and current_sequence_length (variable c inequations (3),(4),(6),(7),(8)) - where distance_to_phase_solve isthe primary, to be fulfilled un<strong>de</strong>r all circumstances.This property can be exploited in scenarios where the alreadysmaller solution spaces acquired by divi<strong>de</strong> and conquer are stilllarge. Key is to provi<strong>de</strong> subsequent ES-loops with a high diversityof individuals which fulfill at least the prime objective (e.g.distance_to_phase_solve but may - or even should - differ in theother (e.g. current_sequence_length). Even if some individualswith non-optimal, even relatively bad secon<strong>da</strong>ry objective values,form part of the starting population for the subsequent ES loop- the gain in diversity provi<strong>de</strong>s new search paths in the solutionspace and ultimately increases overall ES efficiency. Using atypicallylarge µ and λ further helps to increase diversity.In our exemplary ES for solving the Rubik’s Cube these mechanicshave been applied as follows. After some solution to a phase hasALIO-EURO <strong>2011</strong> – 20


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>been found, the ES does not immediately start calculation of thenext group-transition (which would take only this one individual asbasis for further calculation) but continues evolution until at leastµ different individuals have been found to form the start populationfor the next phase. To further increase diversity we used large(µ,λ) = (1000,50000).5. BENCHMARKS AND CONCLUSIONSTo provi<strong>de</strong> a brief performance overview 100 random scramblesof minimum length 10 and maximum length 50 were generate<strong>da</strong>nd and solved in 5 repetitions. Solution lengths and calculationtime are of particular interest to us. The test was conductedwith the TWES using (µ,λ) = (1000,50000), weighing factors(5,5,5,5,5), mutation lengths (5,5,13,15,17) and maximum generationsbefore reset (250).avg. Run 1 Run 2 Run 3 Run 4 Run 5Generations 95.72 100.63 92.71 99.66 92.22Moves 50.67 50.32 50.87 50.23 49.46Time(s) 321.78 381.68 393.99 312.98 287.93Table 1: Solutions of 100 random scrambles, 5 repetitions,Thistlethwaite ES.As seen in Table 1, the solution sequences hit an average of about50 single moves, further <strong>de</strong>monstrating a consistent performancethroughout the repetitions. Most scrambles are solved in 35-45moves, outliers are responsible for the higher average count. Extensiveadditional benchmarks can be found in [3].The benchmarks are promising, yielding comparable results to theclassic TWA. Outliers calculated by TWES provi<strong>de</strong> both significantlyshorter and longer solutions. This is most probably dueto inter-group <strong>de</strong>pen<strong>de</strong>ncies and future focus lies on increasingour TWES’ ten<strong>de</strong>ncy to such shorter results. Instead of obtainingstatic solutions dictated by the lookup-table used in the classicTWA, the dynamic evolution process enables those shorter solutionsequences not previously possible.Regarding the Rubik’s Cube optimization problem, our evolutionaryapproach is evi<strong>de</strong>ntly competitive with the exact method ita<strong>de</strong>pts. As this was the first such attempt - based on the first grouptheoretic exact approach using lookup-tables (Thistlethwaite) - futurework promises further improvement. This algorithm onlysolves the classic 3 3 Rubik’s Cube, just as the exact method it isbased on, does. However, our modular EA can also be used tosolve higher dimensional Rubik’s Cubes by appropriately substitutingthe current fitness functions.The next <strong>de</strong>velopmental step will a<strong>de</strong>pt approaches that reducethe number of subgroups to 3 and then 2, potentially yielding furtherimprovement in solution sequence length. Conveniently, ourimplementation already provi<strong>de</strong>s such possibilities for extensions,enabling quick testing of different subgroup combinations6. REFERENCES[1] M. Borschbach, C. Grelle, S. Hauke, “Divi<strong>de</strong> and EvolveDriven by Human Strategies. Simulated Evolution andLearning (SEAL),” pp. 369-373. LNCS 6457, Springer(2010)[2] W. Boyzejko, M. Wo<strong>de</strong>cki, “A Hybrid Evolutionary Algorithmfor some Discrete Optimization Problems,” In: <strong>Proceedings</strong>of the 5th International Conference on IntelligentSystems Design and Applications, pp. 326–331. IEEE ComputerSociety, Washington (2005)[3] N. El-Sourani, “Design and Benchmark of different EvolutionaryApproaches to Solve the Rubik’s Cube as a DiscreteOptimization Problem,” Diploma Thesis, WWU Muenster,Germany (2009)[4] N. El-Sourani, S. Hauke, M. Borschbach, “An EvolutionaryApproach for Solving the Rubik’s Cube Incorporating ExactMethods. Applications of Evolutionary Computations.” pp.80-90. LNCS 6024, Springer (2010)[5] K. Florios, G. Mavrotas, D. Diakoulaki, “Solving multiobjective,Multiconstraint Knapsack Problems Using MathematicalProgramming and Evolutionary Algorithms,” EuropeanJournal of Operational Research 203, 14–21 (2009)[6] A. Frey, D. Singmaster, “Handbook of Cubic Math.” Enslow,Hillsi<strong>de</strong> (1982)[7] M. Herdy, G. Patone, ‘Evolution Strategy in Action,” 10 ES-Demonstrations. Technical Report, International Conferenceon Evolutionary Computation (1994)[8] H, Kociemba, “Cube Explorer,” http://kociemba.org/Cube.htm[9] H. Muehlenbein, T. Mahnig, “FDA - A Scalable EvolutionaryAlgorithm for the Optimization of Additively DecomposedFunctions,” Evol. Comput. 7, 353–376 (1999)[10] M. Reid, “Cube Lovers Mailing List,” http://www.math.rwth-aachen.<strong>de</strong>/~Martin.Schoenert/Cube-Lovers/In<strong>de</strong>x_by_Author.html[11] T. Rokicki, “Twenty-Five Moves Suffice for Rubik’s Cube,”http://Cubezzz.homelinux.org/drupal/?q=no<strong>de</strong>/view/121[12] T. Rokicki, http://cube20.org[13] M.B. Thistlethwaite, “The 45-52 Move Strategy,” LondonCL VIII (1981)[14] E. Zitzler, “Evolutionary Algorithms for multi-objective Optimization:Methods and Applications,” Penn State (1999)ALIO-EURO <strong>2011</strong> – 21


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Multi-objective Evolutionary Course TimetablingA. L. Márquez ∗ C. Gil ∗ R. Baños ∗ A. Fernán<strong>de</strong>z ∗∗ University of AlmeríaCarretera <strong>de</strong> Sacramento S/N, La Caña<strong>da</strong> <strong>de</strong> San Urbano, 04120 Almería{almarquez, cgilm, rbanos, af<strong>de</strong>zmolina}@ual.esABSTRACTMulti-Objective Evolutionary Algorithms (MOEAs) are highly flexibleprocedures capable of producing a set of optimal compromisesolutions called Pareto Front. These solutions represent the bestvalues that can be obtained for each objective without reducingthe optimality of the other objectives of the solution. Taking thisinto account, timetabling problems that are usually <strong>de</strong>alt with aweighted sum of penalization functions can be consi<strong>de</strong>red a multiobjectiveproblem. This paper presents a study of the use of differentMOEAs to solve several instances of a particular type oftimetabling problems called Course TimeTabling (CTT).Keywords: Multi-objective, Timetabling, MOEA1. INTRODUCTIONCourse Timetabling problems consist of the weekly planning oflectures for a set of courses. There are many formulations for thisproblem, which differ greatly, especially when they consi<strong>de</strong>r howto <strong>de</strong>al with the hard and soft constraints imposed by the problem<strong>de</strong>finition. The hard constraints must be completely satisfied,while the soft constraints are consi<strong>de</strong>red penalizations that have tobe optimized. Among the techniques used to solve this problemare Evolutionary Algorithms [1, 2], or meta-heuristics [3] such asprocedures based on Tabu Search [4] or Simulated Annealing [5].A more complete study on different timetabling problems can befound in [6], discussing several kinds of timetabling problems anddifferent methods that could be used to solve them.A timetable is a set of encounters organized in time. An encounteris a combination of resources (rooms, people or equipment), someof which can be specified by the problem while others must beorganized as part of the solution. It has long been known thattimetabling is an NP-complete problem [7], which means that thereis no known method to solve it in a reasonable amount of time.It is usually consi<strong>de</strong>red that the solution to be found (whether withan evolutionary algorithm, tabu search, simulated annealing, orany other technique) is a weighted sum of the values of the problemobjectives (the soft constraints), effectively turning the probleminto a single-objective one. On the other hand, a Pareto Frontbasedmultiobjective approximation [8] can also be used whenconsi<strong>de</strong>ring many weighted sums as several different objectivesto optimize, or even <strong>de</strong>fining as many objectives as there are constraints.The remain<strong>de</strong>r of this paper is organized as follows: Section 2shows the main concepts behind multi-objective optimization, whilesection 3 briefly explains the basics of several MOEAs. In section4 the problem of course timetabling is <strong>de</strong>scribed, along withthe main restrictions that apply to a particular instance. Finally,sections 5 and 6 explain the experimental results and conclusionsrespectively.2. CONCEPTS IN MULTI-OBJECTIVE OPTIMIZATIONThe use of Multi-Objective Optimization as a tool to solve Multi-Objective Problems (MOP) implies explaining some key conceptsthat are of invaluable importance. Without them it would be inaccurateto <strong>de</strong>scribe what a good approximation to the Pareto Frontis in terms of criteria such as closeness to the Pareto set, diversity,etc [9, 10, 11, 12].Multi-Objective Optimization is the exploration of one or more<strong>de</strong>cision variables belonging to the function space, which simultaneouslysatisfy all constraints to optimize an objective functionvector that maps the <strong>de</strong>cision variables to two or more objectives.minimize/maximize( f k (s)),∀k ∈ [1,K] (1)Each <strong>de</strong>cision vector s={(s 1 , s 2 , .., s m )} represents accurate numericalqualities for a MOP. The set of all <strong>de</strong>cision vectors constitutesthe <strong>de</strong>cision space. The set of <strong>de</strong>cision vectors that simultaneouslysatisfies all the constraints is called feasible set (F). The objectivefunction vector ( f ) maps the <strong>de</strong>cision vectors from the <strong>de</strong>cisionspace into a K-dimensional objective space Z∈R K , z= f (s),f (s)={ f 1 (s), f 2 (s),..., f K (s)}, z∈Z, s∈F.In or<strong>de</strong>r to compare the solutions of a given MOP with K≥2 objectives,instead of giving a scalar value to each solution, a partialor<strong>de</strong>r is <strong>de</strong>fined according to Pareto-dominance relations, as <strong>de</strong>tailedbelow.Or<strong>de</strong>r relation between <strong>de</strong>cision vectors: Let s and s’ be two<strong>de</strong>cision vectors. The dominance and incomparability relations ina minimization problem are:{s dominates s ′ (s ≺ s ′ ) i f ff k (s) < f k (s ′ ) ∧ f k ′(s) ≯ f k ′(s′ ), ∀k ′ ≠ k ∈ [1,K]{s, s ′ are incomparable (s ∼ s ′ ) i f ff k (s) < f k (s ′ ) ∧ f k ′(s) > f k ′(s′ ), k ′ ≠ k ∈ [1,K]Pareto-optimal solution: A solution s is called Pareto-optimal ifthere is no other s’∈F, such that f (s’)< f (s). All the Pareto-optimalsolutions <strong>de</strong>fine the Pareto-optimal set, also called Pareto Front.Non-dominated solution: A solution s∈F is non-dominated withrespect to a set S ′ ∈F if and only if ̸ ∃s’∈S ′ , verifying that s ′ ≺s.Obtaining a set of non-dominated solutions is not the only importantobjective when solving this kind of problem. Obtaining a wi<strong>de</strong>and evenly distributed Pareto Front is also of key importance becausesuch a set of solutions is more useful for the <strong>de</strong>cision makingprocess. This happens because a wi<strong>de</strong> and evenly distributedPareto Front h(2)(3)ALIO-EURO <strong>2011</strong> – 22


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>3. IMPLEMENTED MOEASThe following MOEAs have been used to perform the experimentsnee<strong>de</strong>d to gather the <strong>da</strong>ta used in this paper:• NSGA-II, Non-dominated Sorting Genetic Algorithm II [13].It makes use of a population as well as a temporary helperpopulation where it stores the <strong>de</strong>scen<strong>da</strong>nt individuals. Itthen joins both populations and classifies them by usinga fast non-dominated sorting to separate the solutions intoseveral fronts, with a domination relationship between them.To generate the next population, only the first fronts arekept, while the other solutions are disregar<strong>de</strong>d. As an estimationof solution <strong>de</strong>nsity, the Crowding distance is calculated,in or<strong>de</strong>r to use a crowding comparison operator togui<strong>de</strong> the selection process towards a uniform front. In thisway, the population holds the Pareto front and becomes thesolution at the end of the procedure.• PESA, Pareto Envelope-based Selection Algorithm [14].This MOEA uses a hypergrid for analyzing the <strong>de</strong>nsity informationof the individuals. PESA keeps the non-dominatedindividuals in an archive, up<strong>da</strong>ting it each time a newsolution is inserted by removing the old solutions that becomeindifferent or dominated by the new one. The archiveholds the Pareto front, which becomes the solution at theend of the procedure.• SPEA2, Strength Pareto Evolutionary Algorithm [15]. Ituses a strength indicator in or<strong>de</strong>r to measure the solutionquality of the individuals stored in the archive. At the endof the procedure, the archive becomes the final solution,storing the generated Pareto front. The main operations inthis MOEA consist of generating the fitness of the solutions,calculating the <strong>de</strong>nsity information for each solutionwithin the solution set, and then truncating the archive onceit becomes full, by removing the worst quality solutions inthe <strong>de</strong>nsest areas.• msPESA, Mixed Spreading PESA [16]. This MOEA is a<strong>de</strong>rivative of PESA that implements a different hypergridpolicy allowing the grid resolution to increase without penalizingperformance. In this case, the hypergrid has onedimension less than the PESA hypergrid, so the memory requirementsare greatly reduced for larger populations. Thelogic behind this consists of using the same number of cellsin the grid as there are solutions. I<strong>de</strong>ally this would meanthat as the algorithm optimizes the Pareto front, the solutionswould end up evenly spread alongsi<strong>de</strong> the front. Oninserting a solution into the archive, it performs a localsearch procedure in or<strong>de</strong>r to improve the quality of the solution,or it even inserts more than one possible solution.Inserting a new solution into the archive does not enforcea strong elitism, since all the solutions are kept, and theyare only removed when the archive is full. This increasesgenetic variety during the first iterations of the MOEA.4. PROBLEM DEFINITION: COURSE TIMETABLINGThe implemented MOEAs use the problem proposed by Di Gasperoand Schaerf [4] , which consi<strong>de</strong>rs q lectures (c 1 ,. . . , c q ), p periods(1,..., p) and m rooms (r 1 ,. . . ,r m ). Each course c i consists of l i periodsthat will be scheduled in different time slots with s i assignedstu<strong>de</strong>nts. Each room r j has a capacity cap j , <strong>de</strong>fined by the numberof available seats. There are also g lecture groups called curricula,such that any pair of courses of a curriculum have stu<strong>de</strong>nts in common.The objective of the problem is to satisfy every hard constraint ineach and every one of the final solutions of the problem, while thesoft constraints may not be fully satisfied, <strong>de</strong>teriorating the solutionquality. The following <strong>de</strong>finitions show the constraints for abasic <strong>de</strong>finition of this timetabling problem:Lectures (hard) The number of lectures of course c i must be exactlyl i .Room Occupancy (hard) Two distinct lectures cannot take placein the same period and room.Conflicts (hard) Lectures of courses in the same curriculum ortaught by the same teacher must be scheduled at differenttimes.Availabilities (hard) Lecturers may not be available for some periods.Room Capacity (soft) The number of stu<strong>de</strong>nts that attend a coursemust be less than or equal to the number of seats in each ofthe rooms that host its lectures.Minimum Working Days (soft) The set of periods p is split inwd <strong>da</strong>ys of p/wd periods each (assuming that p is divisibleby wd). Each period therefore belongs to a specific week<strong>da</strong>y. The lectures of each course c i must be spread over aminimum number of <strong>da</strong>ys d i (with d i ≤ l i and d i ≤ wd).Curriculum Compactness (soft) The <strong>da</strong>ily schedule of a curriculumshould be as compact as possible, avoiding isolated lectures,i.e. one lecture for a given curriculum that is not adjacentto any other lecture within the same <strong>da</strong>y.There are other categories of constraints and requirements existingon a practical level, rather than on an aca<strong>de</strong>mic one, such as:Lecture Management A teacher must not give lectures in morethan 4 consecutive periods.4.1. Timetabling SolverAs an initial treatment, an attempt to schedule the classes is ma<strong>de</strong>by sorting the rooms in <strong>de</strong>scending or<strong>de</strong>r of available seats, whichgreatly helps to schedule the initialization of the Individuals (ofthe initial population, which has not yet been evolved). This pretreatmenttries to fit all the lectures in time slots where they fit an<strong>da</strong>re not violating any hard constraints. Individuals that are createdfrom another one (<strong>de</strong>scen<strong>da</strong>nts) clone them (they become exactcopies). This behavior helps to reduce the amount of hard constraintviolations.During the evaluation of each Individual the violations of hardconstraints are checked. In case of violation, it will most likelyhappen during the first generations because Individuals that complywith the hard constraints have not yet evolved. Once a violationof a hard constraint happens, then the evaluation procedurewill try to correct it by randomly making additional changes to theschedule in a mutation-like manner. This will always be applie<strong>da</strong>fter the mutation operation. Only changes that do not producehard constraint violations are allowed. This means that both themutation operation and the additional corrections performed at thebeginning of the evaluation process allow valid individuals to appearafter a brief time interval. Once the hard constraints have beenremoved, all the optimization efforts will be centered on minimizingthe violations of soft constraints.Mutations follow a pattern inspired by Simulated Annealing, whichmeans that as the number of function evaluations increases, theamount of time slot exchanges slowly <strong>de</strong>creases. At the beginningof the procedure, up to three movements are ma<strong>de</strong> in the mutation.At the end of the process only one change is allowed. Choosingthe amount of initial maximum movements is related with performanceissues, since each movement implies checking for compliancewith all the hard constraints beforehand, higher numbers ofALIO-EURO <strong>2011</strong> – 23


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>changes impair performance significantly. No crossover operationhas been implemented in or<strong>de</strong>r to avoid generating timetables thatviolate hard constraints (the constraint on the amount of lecturesfor each course).The objectives chosen for optimization are the sum of the valuesof CurriculumCompactness, RoomCapacity and MinimumWorkingDaysas the first objective, and CurriculumCompactness as thesecond one. The intention is both to minimize the whole set ofobjectives, while placing a special focus on the importance of havinga <strong>de</strong>nse time schedule in or<strong>de</strong>r to reduce the problem of <strong>de</strong>adhours that is so inconvenient for both teachers and stu<strong>de</strong>nts. Thisalso allows for easy sorting based on the first objective in or<strong>de</strong>rto i<strong>de</strong>ntify the best global solutions, while in some situations it ismore interesting to choose solutions with a higher penalty for CurriculumCompactnessbecause it usually has an impact on the otherconstraints. Usually, the higher the penalty on CurriculumCompactness,the lower the penalty on the other objectives.The problem instance is loa<strong>de</strong>d in memory as a set of linked objects,which allows easy analysis of the relations between the differentcourses, rooms, curricula and constraints. With that information,the timetable is constructed as a string-in<strong>de</strong>xed vector thatholds a matrix of courses. The string in<strong>de</strong>x represents the assignedroom while the matrix of courses it references is the timetable assignedto that room, using the matrix in<strong>de</strong>xes to represent the timeperiod and <strong>da</strong>y of the week.5. EXPERIMENTAL RESULTSThe results obtained by the MOEAs <strong>de</strong>pend on the implementationof the individual, because the operations nee<strong>de</strong>d to build a proper,working, timetable are not as simple as the operations nee<strong>de</strong>d tooptimize the ZDTn functions used as benchmarks. Furthermore,representing a timetable as well as groups of stu<strong>de</strong>nts, teacher andspace constraints implies additional challenges to add to the evolutionaryoperations.The configuration parameters for the experiments were 100 individualsfor archive size in PESA and msPESA (10 for their workingpopulations), 100 individuals for SPEA2 archive and workpopulation, and 100 for NSGA-II (its helper population has thesame size as the main one). The local search parameter for msPESAis to generate 10 new individuals with two moves each, and all theprocedures were set to finish after performing 10 6 function evaluations.Table 1 shows the best results found by the tabu search procedureused in [4] as a reference to compare with the results generatedby the MOEAs implemented for this thesis. Note that in the originalsettings for the results obtained with the tabu search, there isno specification of any limits in the amount of time or number offunction evaluations used in the experiments.The experiments with MOEA have been performed by choosingthe soft constraints as objectives. The assigned weights are 1x foreach violation of RoomCapacity, 1x for each violation of CurriculumCompactness and 5x for each violation of MinimumWorking-Days. In the tabu search procedure, the sum of all penalizationsgenerates the value of the solution. Therefore, the lower the sum,the better the solution.An interesting convergence phenomenon appeared when performingthe experiments: different solutions shared the same penalizationscore. This means that as the experiments progress further, thePareto front tends to converge towards a local minimum, unless bychance a better timetable is found, which effectively substitutes ina few generations all the solutions with the previous penalization.Since the Pareto dominance criterion is not met, due to the convergenceto the best solution, it is far more difficult for the MOEAsto solve the timetabling problem with this criterion. This is whyin table 1 the solutions are given as a single scalar (the best solutionfound after calculating the weight of all the penalizations,of all the solutions returned by the MOEAs), instead of giving thePareto fronts generated by each procedure. The values given arethe result of the weighted sum of the objectives, as used for thegeneration of the Optimal solution of the different instances..Test1 Test2 Test3 Test4Optimal Solution 214 8 36 43NSGA-II 364 52 99 84SPEA2 253 59 66 97PESA 236 28 81 68msPESA 235 11 61 67Table 1: Comparison of the best solution found by each procedureafter 10 6 function evaluations. The optimal solution is given asreference [4].As table 1 illustrates, PESA and msPESA are the best proceduresfor this problem after running 1,000,000 objective function evaluations.6. CONCLUSIONSTable 1 shows that msPESA is the best procedure in all situations,with the limit of 10 6 evaluations imposed on the procedures. Theuse of a local search procedure allowed it to improve the solutionquality faster than other MOEAs. Though it does not reach optimalresults, it comes close, especially for the problems test1 and test2.Given the ad<strong>de</strong>d difficulties to obtain solutions to the timetablingproblem, these results are interesting, consi<strong>de</strong>ring how close thePESA-based methods were to the optimal solution for some of thetest instances.7. ACKNOWLEDGEMENTSThis work has been financed by the Spanish Ministry of Innovationand Science (TIN2008-01117) and the Excellence Project of Junta<strong>de</strong> An<strong>da</strong>lucía (P07-TIC02988), in part financed by the EuropeanRegional Development Fund (ERDF).8. REFERENCES[1] D. Corne, P. Ross, and H. lan Fang, “Evolutionarytimetabling: Practice, prospects and work in progress,” in In<strong>Proceedings</strong> of the UK Planning and Scheduling SIG Workshop,Strathcly<strong>de</strong>, 1994.[2] B. Paechter, A. Cumming, H. Luchian, and M. Petriuc, “Twosolutions to the general timetable problem using evolutionarymethods,” in proceedings of the IEEE Conference on EvolutionaryComputation, vol. 1994, 1994.[3] E. K. Burke and S. Petrovic, “Recent research directions inautomated timetabling,” European Journal of OperationalResearch, vol. 140, no. 2, pp. 266 – 280, 2002.[4] L. Di Gaspero and A. Schaerf, “Neighborhood portfolioapproach for local search applied to timetabling problems,”Journal of Mathematical Mo<strong>de</strong>ling and Algorithms, vol. 5,no. 1, pp. 65–89, 2006. [Online]. Available: http://www.diegm.uniud.it/satt/papers/DiSc06.pdf[5] P. Kostuch, “The university course timetabling problemwith a three-phase approach,” in Practice and Theory ofAutomated Timetabling V, ser. Lecture Notes in ComputerALIO-EURO <strong>2011</strong> – 24


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Science, E. Burke and M. Trick, Eds. Springer Berlin/ Hei<strong>de</strong>lberg, 2005, vol. 3616, pp. 109–125. [Online].Available: http://dx.doi.org/10.1007/11593577_7[6] A. Schaerf, “A survey of automated timetabling,” ArtificialIntelligence Review, vol. 13, pp. 87–127,1999, 10.1023/A:1006576209967. [Online]. Available:http://dx.doi.org/10.1023/A:1006576209967[7] T. Cooper and J. Kingston, “The complexity of timetableconstruction problems,” in <strong>Proceedings</strong> of the First InternationalConference on the Practice and Theory of AutomatedTimetabling (ICPTAT ’95), 1995, pp. 511–522.[8] D. Datta, C. M. Fonseca, and K. Deb, “A multi-objective evolutionaryalgorithm to exploit the similarities of resource allocationproblems,” J. of Scheduling, vol. 11, no. 6, pp. 405–419, 2008.[9] K. Deb, Multi-Objective Optimization using EvolutionaryAlgorithms. John Wiley & Sons, 2001.[10] E. Talbi, Metaheuristics: From Design to Implementation.New York: John Wiley & Sons, Inc., 2009.[11] C. C. Coello, G. Lamont, and D. van Veldhuizen, EvolutionaryAlgorithms for Solving Multi-Objective Problems,2nd ed., ser. Genetic and Evolutionary Computation. Berlin,Hei<strong>de</strong>lberg: Springer, 2007.[12] M. Voorneveld, “Characterization of pareto dominance,” OperationsResearch Letters, vol. 31, no. 1, pp. 7 – 11, 2003.[13] K. Deb, A. Pratab, S. Agrawal, and T. Meyarivan, “A FastElitist Non-Dominated Sorting Genetic Algorithm for Multi-Objective Optimization: NSGA-II,” IEEE Transactions onevolutionary computation, vol. 6, no. 2, pp. 182–197, 2002.[14] D. Corne, J. Knowles, and M. Oates, “The Pareto EnvelopebasedSelection Algorithm for Multiobjective Optimization,”in <strong>Proceedings</strong> of the Parallel Problem Solving from NatureVI Conference, M. Schoenauer, K. Deb, G. Rudolph, X. Yao,E. Lutton, J. J. Merelo, and H.-P. Schwefel, Eds. Paris,France: Springer. Lecture Notes in Computer Science No.1917, 2000, pp. 839–848.[15] E. Zitzler, M. Laumanns, and L. Thiele, “SPEA2: Improvingthe Strength Pareto Evolutionary Algorithm,” Gloriastrasse35, CH-8092 Zurich, Switzerland, Tech. Rep. 103, 2001.[16] C. Gil, A. Márquez, R. Baños, M. Montoya, and J. Gómez,“A hybrid method for solving multi-objective globaloptimization problems,” Journal of Global Optimization,vol. 38, no. 2, pp. 265–281, 2007. [Online]. Available:http://www.springerlink.com/content/f3n1284ur211p587ALIO-EURO <strong>2011</strong> – 25


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Automated Design of Software Architectures for Embed<strong>de</strong>d Systems usingEvolutionary Multiobjective OptimizationR. Li ∗ R. Etemaadi ∗ M.T.M. Emmerich ∗ M.R.V. Chaudron ∗∗ Lei<strong>de</strong>n Institute of Advanced Computer Science (LIACS), Lei<strong>de</strong>n UniversityPostbus 9512, 2300RA, Lei<strong>de</strong>n, The Netherlands{ruili, etemaadi, emmerich, chaudron}@liacs.nlABSTRACTThe <strong>de</strong>sign of software architecture for embed<strong>de</strong>d system is one ofthe big challenges in the research field of mo<strong>de</strong>rn software engineering.It requires software architects to address a large numberof non-functional requirements that can be used to quantify theoperation of system. Furthermore, these quality attributes oftenconflict with each other, for instance, improving system performanceoften needs more powerful hardware, which could increasethe production cost and power consumption in the meantime. Inmost cases, software <strong>de</strong>signers try to find a set of good architecturesby hand. However because of large and combinatorial <strong>de</strong>signspace, this process is very time-consuming and error-prone. Asa consequence, architects could easily end up with some suboptimal<strong>de</strong>signs. In this paper, we introduce our AQOSA (AutomatedQuality-driven Optimization of Software Architecture) toolkit whichcan improve these aforementioned non-functional properties in anautomated manner. More precisely, beginning with some initial architectures,AQOSA toolkit can use its optimizer to not only produceseveral alternatives, but also apply tra<strong>de</strong>-off analysis to thesenewly created architectures according to multiple attributes of interests.Keywords: Component-Based Software Architecture, EvolutionaryMultiobjective Optimization1. INTRODUCTIONMo<strong>de</strong>rn embed<strong>de</strong>d systems are large and complicated and thereforedifficult to <strong>de</strong>velop and maintain. For example, real-time systems,which nowa<strong>da</strong>ys are intensively applied to application domainssuch as automobile and multimedia, are often built to guaranteethe safety and robustness requirements. To meet these requirementsmakes the <strong>de</strong>sign of real-time systems very challenging.Un<strong>de</strong>r such circumstances, software architecture which is an importantfield of study in software engineering receives more andmore attentions in the last few years. More technically speaking,software architectures <strong>de</strong>scribe various aspects of the system,mostly their <strong>de</strong>ployment, behavioral, and structural features. Withthem, <strong>de</strong>signers have the opportunity to analyze the quality propertiesof software at a high level and thus can make optimal architectural<strong>de</strong>cisions to satisfy the quality attributes at the very earlyarchitectural stage of the project.In many cases, quality properties conflict with each other, that is,improving one quality property can have a negative impact on others,and thus to construct a system that satisfies all its requirementscould be difficult. One possible solution is to use optimizationtechniques to generate several feasible architectures according toinitial mo<strong>de</strong>ls and then select optimal solutions from all alternativesthrough the tra<strong>de</strong>-off analysis with respect to all quality requirements.In current practice, this process is normally performed manuallyto the system <strong>de</strong>sign. The drawback of this is that it can be timeconsumingand error-prone work, especially for large and complexarchitectures. For complex applications, having some of this workautomated could be a consi<strong>de</strong>rable cost saver. To this end we proposeour AQOSA toolkit which was <strong>de</strong>veloped to automaticallyimprove the non-functional properties of an architectural <strong>de</strong>signand thus enable architects to focus on the higher-level <strong>de</strong>sign <strong>de</strong>cisions.The paper is organized as follows. Section 2 summaries some existingmethods which are different from ours. Section 3 explainsour proposed AQOSA toolkit, especially the execution procedure,in <strong>de</strong>tail. The case study as well as some experimental results ispresented in Section 4. Finally, conclusions and future works aregiven in Section 5.2. RELATED WORKAs we emphasized at the very beginning of this paper, it is almostimpossible for software architects to manually find optimal architecture<strong>de</strong>signs from not only large but also discontinuous <strong>de</strong>signsearch space. Researchers have proposed several approaches, especiallysome metaheuristic-based methods which can automatethis process. For instance, Martens et al. [1] introduced approachwhich could automatically improve software architectures mo<strong>de</strong>lledwith the Palladio Component Mo<strong>de</strong>l based on tra<strong>de</strong>-off analysisof performance, reliability, and cost.ArcheOpterix [2] is another generic framework which optimize architecturemo<strong>de</strong>ls with evolutionary algorithms. It supports onlyone <strong>de</strong>gree of freedom for exploration, that is allocation of softwarecomponents. Besi<strong>de</strong>s, two quality criteria (<strong>da</strong>ta transmissionreliability and communication overhead) are <strong>de</strong>fined and theevaluation is based on formal mathematical analysis. Similar toMarten’s approach, ArchiOpterix suffers from the limitation onsearch freedom and has chance to be trapped by some suboptimalsolutions.To alleviate this issue, our proposed AQOSA toolkit, which <strong>de</strong>ploysboth advanced mo<strong>de</strong>l technology and evolutionary multiobjectiveoptimization algorithms with specially <strong>de</strong>signed geneticencoding scheme, allows not only more quality attributes but alsomore complex <strong>de</strong>grees of freedom like exploration of architecturetopology.3. AQOSA TOOLKITThe <strong>de</strong>tailed working process of AQOSA toolkit is illustrated inFigure 1. As can be seen, the automated optimization process startswith some initial software architectures, which could be <strong>de</strong>signedALIO-EURO <strong>2011</strong> – 26


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>by domain experts by using some advanced mo<strong>de</strong>l <strong>de</strong>sign tools.Next, these architectures are evaluated and corresponding multiplequality criteria of interests are obtained. More specific, processorutilization, <strong>da</strong>ta flow latency, and cost metrics are addressed inthis study. At the current stage, the simulation-based approach 1is used for AQOSA evaluator. Note that the precision of evaluationis highly <strong>de</strong>pen<strong>de</strong>nt on the mo<strong>de</strong>ling <strong>de</strong>tails and the featuressupported by simulator.As mentioned earlier, some conflicting quality attributes, such asutilization and cost, are often involved in performance analysis.Thus the domination principle could be adopted by evolutionaryoptimizer for doing tra<strong>de</strong>-off analysis on quality attributes whichare extracted through an extractor based on our performance metrics.Some good architectures are then selected from current availablesolutions. Furthermore, the evolutionary optimizer could automaticallyproduce new candi<strong>da</strong>te architectures by using reproductionoperators like "crossover" and "mutation".toolkit can be easily exten<strong>de</strong>d to support other quantitative qualitycriteria of software architectures by introduce new evaluationplug-ins, i.e. for communication lines loads evaluation, we justnee<strong>de</strong>d to add a new listener which implements the measurementof the bus load to our simulation engine. Another advantage ofAQOSA is that it provi<strong>de</strong>s some very flexible API for the interactionbetween evaluator and various optimization frameworks suchas Opt4J and JMetal 2 .3.2. Evolutionary Optimizer3.2.1. Evolutionary multiobjective optimizationEvolutionary multiobjective optimization (EMO) [7] <strong>de</strong>rives fromsingle objective evolutionary optimization (EO) algorithms and isrecognized as a fast growing fields of research. It is relatively simpleto implement and wi<strong>de</strong>-spread applicable. In this work, tworepresentative multiobjective optimization algorithms (NSGAII [8]and SPEA2 [9]) from literatures are chosen and applied to one architecture<strong>de</strong>sign task for the car radio navigation (CRN) system.3.2.2. Search problem formulationFrom EMO algorithm perspective, architecture <strong>de</strong>sign problem canbe generalized as following optimization task (see Equation 3.2.2):min f m (x), m = 1,2,...,M (1)s.t. g j (x) 0j = 1,2,...,NFigure 1: The <strong>de</strong>tailed working scheme of AQOSA (AutomatedQuality-Driven Optimization of Software Architecture) toolkit.Next, we will explain some key components and related techniquesin <strong>de</strong>tail.3.1. Mo<strong>de</strong>ling and Evaluation EngineFor software architecture mo<strong>de</strong>ling, as a natural extension of previouswork [3] AQOSA integrates ROBOCOP [4] (Robust OpenComponent Based Software Architecture for Configurable DevicesProject) mo<strong>de</strong>ling language. Furthermore, AQOSA also supportsAADL [5] (Architecture Analysis & Design Language) which isnow wi<strong>de</strong>ly recognized industrial stan<strong>da</strong>rd in mo<strong>de</strong>ling embed<strong>de</strong><strong>da</strong>nd real-time architectures. The architect can easily <strong>de</strong>signthe initial architecture in OSATE (Open Source AADL Tool Environment)and then import it into AQOSA framework. To useADeS [6] as the core part of our AQOSA simulation engine, wema<strong>de</strong> some modifications of ADeS in scheduling and ad<strong>de</strong>d newfeatures for <strong>da</strong>ta flow latencies evaluating. More specifically speaking,our evaluation engine first loads an AADL mo<strong>de</strong>l and createsnecessary objects for simulation. After that, it generates systemevents based on the behaviour annex of the mo<strong>de</strong>l and follow theevents through the mo<strong>de</strong>l connections till end of flows. For complexand concurrent events, the scheduling module <strong>de</strong>ci<strong>de</strong>s whichprocess can take the processor.At present, we implement three quality properties: processor utilization,<strong>da</strong>ta flow latency and architecture cost. By <strong>de</strong>sign, AQOSA1 As compared to analysis-based approach.Here, x is a solution and can be of any domain, e.g., real or binary.In the given context, x could be a valid architecture fromembed<strong>de</strong>d system <strong>de</strong>sign domain. For each solution x, there existsm = 3 objectives, i.e. f 1 : Processor utilization, f 2 : Cost, andf 3 : Data flow latency. g j (x) represents a number of constraintswhich any feasible solution must satisfy. The aim is not only provi<strong>de</strong>one optimal solution but rather to provi<strong>de</strong> a broad variety ofnondominated solutions representing tra<strong>de</strong>-offs in the three objectives.3.2.3. Generic <strong>de</strong>gree of freedom to explorationWith specially <strong>de</strong>signed genotype representation, the following <strong>de</strong>greesof freedom to exploration are implemented: (1) System hardwaretopology (hypergraph), i.e. processor/bus can be ad<strong>de</strong>d orremoved from the system, (2) Allocation of service instances, (3)Replacement between different hardwares, i.e. one component canbe replaced by its counterparts from available hardware repository.Figure 2 shows three system topologies which are supported andvalid for car radio navigation (CRN) architecture <strong>de</strong>sign (i.e. casestudy in Section 4).4. CASE STUDY AND EXPERIMENTAL RESULTS4.1. Car Radio Navigation SystemTo vali<strong>da</strong>te our proposed AQOSA toolkit, we applied it to onebenchmark application - the car radio navigation (CRN) system[10]. The CRN system is constructed according to the componentbasedparadigm. An overview of the software architecture is <strong>de</strong>pictedin Figure 3.As can be seen, the CRN system contains three major functionalblocks:2 http://opt4j.sourceforge.net and http://jmetal.sourceforge.netALIO-EURO <strong>2011</strong> – 27


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>AQOSA 20 runs (≈ 10 hours). The resulting archive of optimalsolutions can be visualized in the 3-D Pareto front with respect toprocessor utilization, cost, and <strong>da</strong>ta flow latency in Figure 5.Figure 2: Possible topologies supported by genotype representation:Single processor no<strong>de</strong> (left), Two processor no<strong>de</strong>s with singlebus (middle), and Three processor no<strong>de</strong>s with single bus (right).Figure 3: Overview of the car radio navigation system functionality.• The Man-Machine Interface (MMI), that takes care of allinteractions with the end-user, such as handling key inputsand graphical display output.• The Navigation functionality (NAV) is responsible for <strong>de</strong>stinationentry, route planning and turn-by-turn route gui<strong>da</strong>ncegiving the driver visual advices. The navigation functionalityrelies on the availability of map <strong>da</strong>tabase and positioninginformation.• The Radio functionality (RAD) is responsible for tunnerand volume control as well as handling of traffic messagechannel information services.The major challenge is to <strong>de</strong>termine a set of optimal architectureswith respect to quality attributes such as processor utilization, <strong>da</strong>taflow latency, and cost. Technically speaking, we investigate howto distribute these aforementioned functionalities over the availableresources (processor no<strong>de</strong> in Figure 2) to meet some globalrequirements. Vector representation in Figure 4 illustrates how thegenotype is used to <strong>de</strong>scribe possible architecture topologies (Figure2) as well as mapping of services.4.2. Experimental Setup and ResultsThe experimental setup is as follows: two stan<strong>da</strong>rd evolutionarymultiobjective optimization algorithms from Opt4J, Non-dominatedSorting Genetic Algorithm (NSGA-II) and Strength Pareto EvolutionaryApproach 2 (SPEA2), will be used. Furthermore, thefollowing parameter settings are adopted: initial population size:50, parent population size: 25, number of offspring: 25, archivesize: 100, number of generation: 50, crossover rate is set to 0.95,constant mutation probability is 0.01. For each algorithm we runFigure 4: Genotype vector for possible software architectures representation(884,736 possibilities).Figure 5: Resulting Pareto front approximations of archive population(non-dominant solutions) after 50 generations of one typicalrun of SPEA2. Colors are used to distinguish between differentfound architecture topologies.An interesting finding is that resulting pareto front consists of threesegmentation (with clearly gap in between). This could be the resultof discontinuities in the search space caused by structural transitions.By i<strong>de</strong>ntifying and mapping each individual from archiveback to corresponding <strong>de</strong>sign architecture, solutions from samesegmentation share the same architectural topology 3 (i.e. Figure2). This discovery is consistent with our un<strong>de</strong>rstanding of CRNsystem, for instance, solutions with topology 3 (solutions with bluecolor) normally have lower processor utilization and higher costfor the hardware. On the contrary, solutions with topology 1 (redcolor) have higher processor utilization and lower cost.Figure 6: Plot between two objectives of archive population (nondominantsolutions): Cost vs. Processor utilization (left) and Costvs. Data flow latency (right).The 2-D plot of two quality attributes is presented in Figure 6. Inthis way, the software architect can make tra<strong>de</strong>-off <strong>de</strong>cision mucheasier. For instance, the left plot shows the processor utilizationover the cost per candi<strong>da</strong>te architecture while the right one indicatesthe <strong>da</strong>ta flow latency over cost. There is no obvious conflictbetween processor utilization and <strong>da</strong>ta flow latency and the correspondingplot is exclu<strong>de</strong>d here. Further more, both the attainmentsurface of one typical run of SPEA2 and the box-plots of the hypervolumeindicator [11] for ref. point (1,1,1) T of archive populationfor NSGA-II, SPEA2, and random search over 20 runs arepresented in Figure 7From figure 7 (left), it gets clear that final solutions from archiveare mutually non-dominated with respect to three quality attributesinvestigated. Another observation is that NSGA-II and SPEA2show the comparable performance with each other (stu<strong>de</strong>nt’s t-test3 All three algorithms which we studied show the same behaviour.ALIO-EURO <strong>2011</strong> – 28


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 7: The dominated Hypervolume approximation of one typicalrun of SPEA2 (left) and the box-plots of the hypervolume indicatorfor NSGA-II, SPEA2, and random search on CRN <strong>de</strong>signproblem over 15 runs (right).with 1% confi<strong>de</strong>nce level), and the results are very similar. Randomsearch, by contrast, shows worst performance.5. CONCLUSIONS AND OUTLOOKWe presented so-called AQOSA (Automated Quality-driven Optimizationof Software Architecture) toolkit. It not only can helpsoftware architects to reduce the workload for mo<strong>de</strong>ling and evaluatingreal-world problems, but also can automatically improvequality attributes by using evolutionary multiobjective optimizers.We applied AQOSA on the car radio navigation (CRN) system.The preliminary results are very promising.For future research several questions are of interest: First, morechallenging application (i.e., from automobile industry) will bemo<strong>de</strong>led and tested by using AQOSA. Secondly, besi<strong>de</strong>s aforementione<strong>da</strong>ttributes which we studied in this work other nonfunctionalqualities such as power consumption and safety will beintegrated. Algorithms such as SMS-EMOA [12] are also worthinvestigating for the resulting many-objective problems.6. ACKNOWLEDGEMENTSThis work has been supported by the Dutch national project OMECA(Optimization of Modular Embed<strong>de</strong>d Computer-vision Architectures)and European project SCALOPES (an ARTEMIS projecton SCalable LOw Power Embed<strong>de</strong>d platformS).7. REFERENCES[1] A. Martens, H. Koziolek, S. Becker, and R. Reussner, “Automaticallyimprove software architecture mo<strong>de</strong>ls for performance,reliability, and cost using evolutionary algorithms,”in <strong>Proceedings</strong> of the first joint WOSP/SIPEW internationalconference on Performance engineering, 2010, pp. 105–116.[2] A. Aleti, S. Björnan<strong>de</strong>r, L. Grunske, and I. Mee<strong>de</strong>niya,“Archeopterix: An exten<strong>da</strong>ble tool for architecture optimizationof AADL mo<strong>de</strong>ls,” in ICSE 2009, MOMPES Workshop2009, May 16, 2009, Vancouver, Cana<strong>da</strong>, 2009, pp. 61–71.[3] R. Li, M. R. Chaudron, and R. C. La<strong>da</strong>n, “Towards automatedsoftware architectures <strong>de</strong>sign using mo<strong>de</strong>l transformationsand evolutionary algorithms,” in GECCO (Companion).ACM, 2010, pp. 2097–2098.[4] E. Bon<strong>da</strong>rev, M. R. Chaudron, and P. <strong>de</strong> With, “A processfor resolving performance tra<strong>de</strong>-offs in component-based architectures,”in Component-Based Software Engineering, ser.LNCS, vol. 4063, 2006, pp. 254–269.[5] P. H. Feiler, D. Gluch, and J. J. Hu<strong>da</strong>k, “The architectureanalysis & <strong>de</strong>sign language (AADL): An introduction,”Carnegie Mellon University, Technical Report CMU/SEI-2006-TN-011, 2006.[6] R. S. Jean-François Tilman, Amélie Schyn, “Simulation ofsystem architectures with AADL,” in <strong>Proceedings</strong> of 4thInternational Congress on Embed<strong>de</strong>d Real-Time Systems,ERTS 2008., 2008.[7] K. Deb, “Multiobjective optimization,” J. e. a. Branke, Ed.Springer-Verlag, 2008, ch. Introduction to Evolutionary MultiobjectiveOptimization, pp. 59–96.[8] K. Deb, S. Agrawal, A. Pratap, and T. Meyarivan, “A fastelitist non-dominated sorting genetic algorithm for multiobjectiveoptimization: NSGA-II,” in Parallel Problem Solvingfrom Nature PPSN VI, ser. LNCS, 2000, vol. 1917, pp.849–858.[9] E. Zitzler, M. Laumanns, and L. Thiele, “SPEA2: Improvingthe Strength Pareto Evolutionary Algorithm for MultiobjectiveOptimization,” Tech. Rep., 2002.[10] E. Wan<strong>de</strong>ler, L. Thiele, M. Verhoef, and P. Lieverse, “Systemarchitecture evaluation using modular performance analysis:a case study,” Int J Softw Tools Technol Transfer (STTT),vol. 8, no. 6, pp. 649–667, 2006.[11] E. Zitzler, L. Thiele, M. Laumanns, C. Fonseca, andV. <strong>da</strong> Fonseca, “Performance assessment of multiobjectiveoptimizers: an analysis and review,” IEEE Trans. on EvolutionaryComputation, vol. 7, no. 2, pp. 117–132, April 2003.[12] N. Beume, B. Naujoks, and M. Emmerich, “SMS-EMOA:Multiobjective selection based on dominated hypervolume,”European Journal of Operational Research, vol. 181, no. 3,pp. 1653–1669, 2007.ALIO-EURO <strong>2011</strong> – 29


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>New Characterizations for Subfamilies of Chor<strong>da</strong>l GraphsL. Markenzon ∗1 P.R.C. Pereira † C.F.E.M. Waga ‡∗ NCE - Universi<strong>da</strong><strong>de</strong> Fe<strong>de</strong>ral do Rio <strong>de</strong> JaneiroP. O. Box: 2324, RJ, Brazil 20010-974markenzon@nce.ufrj.br† Instituto Militar <strong>de</strong> <strong>Engenharia</strong>Praça General Tibúrcio, 80, Rio <strong>de</strong> Janeiro, Brazil 22290-270prenato@ime.eb.br‡ IME - Universi<strong>da</strong><strong>de</strong> do Estado do Rio <strong>de</strong> JaneiroRua São Francisco Xavier, 524, Rio <strong>de</strong> Janeiro, Brazil, 20550-900waga@ime.uerj.brABSTRACTIn this paper, we give new characterizations for some subfamiliesof chor<strong>da</strong>l graphs, such as k-intercats and SC k-trees, based onproperties of their minimal vertex separators. We also establishthe relationship among these families and interval graphs.Keywords: Chor<strong>da</strong>l graph, k-tree, ur-chor<strong>da</strong>l1. INTRODUCTIONChor<strong>da</strong>l graphs are an extensively studied class of graphs, as theirpeculiar clique-based structure allows a more efficient solution formany algorithmic problems. The investigation of new propertiesof the family brings up the possibility of solving problems moreefficiently, with a different approach.In this context, the minimal vertex separators play a <strong>de</strong>cisive role.Their <strong>de</strong>termination has been already studied in at least two recentpapers [1, 2]. The presentation of a very simple algorithm [3] toperform this task renews the chance to find better results for severalproblems. Based on properties of the minimal vertex separatorsof chor<strong>da</strong>l graphs and their multiplicities, we propose in this papernew characterizations for some known subfamilies of chor<strong>da</strong>lgraphs such as k-intercats and SC k-trees, which generalizes mopsand maximal planar chor<strong>da</strong>l graphs. The new structural characterizationslead to simple and efficient recognition algorithms. Weare also able to prove inclusion relations among these families andother subfamilies of chor<strong>da</strong>l graphs such as interval graphs.2. BACKGROUNDBasic concepts about chor<strong>da</strong>l graphs are assumed to be known andcan be found in Blair and Peyton [4] and Golumbic [5]. In thissection, the most pertinent concepts are reviewed.Let G = (V,E) be a graph, with |E| = m, |V | = n > 0. The setof neighbors of a vertex v ∈ V is <strong>de</strong>noted by Ad j(v) = {w ∈ V |(v,w) ∈ E}. For any S ⊆ V , we <strong>de</strong>note G[S] the subgraph of Ginduced by S. S is a clique when G[S] is a complete graph. Avertex v is said to be simplicial in G when Ad j(v) is a clique in G.A subset S ⊂ V is a separator of G if two vertices in the same connectedcomponent of G are in two distinct connected components1 Partially supported by grant 305372/2009-2, CNPq, Brazil.of G[V − S]. The set S is a minimal separator of G if S is a separatorand no proper set of S separates the graph. A subset S ⊂ V is avertex separator for non-adjacent vertices u and v (a uv-separator)if the removal of S from the graph separates u and v into distinctconnected components. If no proper subset of S is a uv-separatorthen S is a minimal uv-separator. When the pair of vertices remainsunspecified, we refer to S as a minimal vertex separator. Itdoes not necessarily follow that a minimal vertex separator is alsoa minimal separator.The next theorem presents a characterization of chor<strong>da</strong>l graphs interms of minimal vertex separators.Theorem 1. [5] A graph is chor<strong>da</strong>l if and only if every minimalvertex separator of it induces a clique.The clique-intersection graph of a chor<strong>da</strong>l graph G is the connectedweighted graph whose vertices are the maximal cliques ofG and whose edges connect vertices corresponding to non-disjointmaximal cliques. Each edge is assigned an integer weight, givenby the cardinality of the intersection between the maximal cliquesrepresented by its endpoints. Every maximum-weight spanningtree of the clique-intersection graph of G is called a clique-tree ofG.Theorem 2. [4] Let G = (V,E) be a chor<strong>da</strong>l graph and T =(V T ,E T ) a clique-tree of G. The set S ⊂ V is a minimal vertex separatorof G if and only if S = Q ′ ∩Q ′′ for some edge (Q ′ ,Q ′′ ) ∈ E T .Observe that the set of minimal vertex separators related to oneclique-tree is actually a multiset, since the same minimal vertexseparator can appear several times. Blair and Peyton [4] provedthat, for a chor<strong>da</strong>l graph G, the same multiset is always obtained.Theorem 3. Let G = (V,E) be a chor<strong>da</strong>l graph. The multiset S ∗of the minimal vertex separators of G is the same for every cliquetreeof G.From Theorem 3 it is clear that |S ∗ | = l − 1, being l the numberof maximal cliques of G. We <strong>de</strong>fine the multiplicity of the minimalvertex separator S, <strong>de</strong>noted by µ(S), as the number of times thatS appears in S ∗ . The set of minimal separators S (S ∗ withoutrepetitions) has cardinality η.Two important subfamilies of chor<strong>da</strong>l graphs, the k-trees and theinterval graphs, can be <strong>de</strong>fined as follows [6].Definition 1. A k-tree, k > 0, can be inductively <strong>de</strong>fined as follows:ALIO-EURO <strong>2011</strong> – 30


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>• Every complete graph with k vertices is a k-tree.• If G = (V,E) is a k-tree, v /∈ V and Q ⊆ V is a k-clique ofG, then G ′ = (V ∪{v},E ∪{{v,w}|w ∈ Q}) is also a k-tree.• Nothing else is a k-tree.The simplicial vertices of a k-tree are also called k-leaves.Definition 2. An interval graph is the intersection graph of a setof intervals on the real line. It has one vertex for each interval inthe set, and an edge between every pair of vertices correspondingto intervals that intersect.3. GENEALOGY OF CHORDAL GRAPHSInterval graphs and k-trees are well known in the literature. Ourgoal is to establish the relation among these families and threeother genealogical branches of chor<strong>da</strong>l graphs. The first branch,<strong>de</strong>fined by Proskurowski [7], is the family of k-caterpillars and its<strong>de</strong>scen<strong>de</strong>nt, the k-intercats. The second one, <strong>de</strong>fined by Kumar andMadhavan [8], is the family of ur-chor<strong>da</strong>l graphs and its <strong>de</strong>scen<strong>de</strong>nt,the ur-interval graphs. The last one, <strong>de</strong>fined by Markenzonet al. [9], is the family of SC k-trees and its <strong>de</strong>scen<strong>de</strong>nt, the k-pathgraphs. The <strong>de</strong>finitions of all these families are reviewed in thissection.Kumar and Madhavan <strong>de</strong>fined several families based on structuralproperties of the clique-tree. We are going to focus on two of thesefamilies.Definition 3. [8] A chor<strong>da</strong>l graph is called uniquely representablechor<strong>da</strong>l graph (briefly ur-chor<strong>da</strong>l graph) if it has exactly one cliquetree. An interval graph that is uniquely representable is called anur-interval graph.Theorem 4 presents a characterization of ur-chor<strong>da</strong>l graphs.Theorem 4. [8] Let G = (V,E) be a connected chor<strong>da</strong>l graph. Gis an ur-chor<strong>da</strong>l graph if and only if (i) there is no proper containmentbetween any two minimal vertex separators and (ii) allminimal vertex separators have multiplicity 1.The concept of a k-path appeared first in [10], as a generalizationof paths. It is the base of the formal <strong>de</strong>finition of k-path graphs.Definition 4. [10] In a graph G = (V,E), a k-path of length p > 0is a sequence 〈B 0 ,C 1 ,B 1 ,C 2 ,B 2 ,...,C p ,B p 〉, where:• B i ⊂ V , 0 ≤ i ≤ p, are distinct k-cliques of G;• C i ⊆ V , 1 ≤ i ≤ p, are distinct (k + 1)-cliques of G;• B i−1 ⊂ C i , B i ⊂ C i and no other k-clique B j , 0 ≤ j ≤ p,j ≠ i − 1 and j ≠ i, is a subset of C i , 1 ≤ i ≤ p.Definition 5. [9] Let G = (V,E) be a k-tree with n > k vertices.G is a k-path graph if there is a maximal k-path 〈B 0 ,C 1 ,B 1 ,...,C p ,B p 〉, p > 0, such that the subgraph of G induced by C 1 ∪ ... ∪C p is isomorphic to G.Observe that k-paths and k-path graphs are often confused. However,for k > 1, the concepts can be quite distinct; actually, thereare k 2 different maximal k-paths in a k-path graph; the k-cliquesB 1 ,...,B p−1 belong to all maximal k-paths.The recognition of a k-tree as a k-path graph can be easily accomplished,due to the characterization provi<strong>de</strong>d by the next theorem.Theorem 5. [9] Let G = (V,E) be a k-tree with n > k +1 vertices.G is a k-path graph if and only if G has exactly two simplicialvertices.The inductive <strong>de</strong>finition of a simple-clique k-tree (SC k-tree) follows.Note that its construction is similar to the one presented inDefinition 1, except that it is more restrictive. It is worth to mentiontwo particular cases of the family: SC 2-trees are the maximalouterplanar graphs (mops) and SC 3-trees, the maximal planarchor<strong>da</strong>l graphs.Definition 6. [9] A Simple Clique k-tree (SC k-tree), k > 0, can beinductively <strong>de</strong>fined as follows:• Every complete graph with k + 1 vertices is a SC k-tree.• If G = (V,E) is a SC k-tree v /∈ V and Q ⊂ V is a k-cliqueof G not previously chosen in the existing SC k-tree, thenG ′ = (V ∪ {v},E ∪ {{v,w}|w ∈ Q}) is also a SC k-tree.• Nothing else is a SC k-tree.The <strong>de</strong>finition of k-caterpillars and k-intercats is also based on theconcept of k-paths and were presented in [7]. Firstly we <strong>de</strong>fine thebody of a graph.Definition 7. Let G be a chor<strong>da</strong>l graph and H the set of its simplicialvertices. We call G[V − H], the subgraph induced by V − H,the body of G.Definition 8. Let G be a k-tree and P its body. G is a k-caterpillarif P is: (i) an empty graph or (ii) a complete graph or (iii) a k-pathgraph.Definition 9. Let G be k-caterpillar and P its body. G is an interiork-caterpillar (k-intercat, for short) if: (i) P is an empty graphor (ii) P is a complete graph with k vertices or (iii) there is a maximalk-path in P 〈B 0 ,C 1 ,B 1 ,...,C p ,B p 〉 such that for any k-leaf vof G, v is adjacent to all vertices of some k-clique B i .4. NEW CHARACTERIZATIONSIn this section we present three theorems that establish the relationsamong all the families mentioned. It is interesting to note that thesetheorems actually provi<strong>de</strong> new characterizations for some of thesefamilies such as the SC k-trees and the k-intercats. For the latter,the characterization leads to a simple linear recognition algorithm.Theorem 6. Let G = (V,E) be a k-tree with n > k + 1 vertices.The three following statements are equivalent:1. G is a SC k-tree.2. All minimal vertex separators of G have multiplicity one, that isη = n − k − 1.3. G is an ur-chor<strong>da</strong>l graph.Proof:(1 ⇐⇒ 2) Definition 1 provi<strong>de</strong>s the construction of a k-tree G. Itis possible to build at the same time the clique-tree of G: each newvertex v, together with the k-clique Q, chosen in the current graph,forms a new maximal clique and, consequently, a new vertex ofthe clique-tree. Two maximal cliques of G have the same subsetQ; so, Q is a minimal vertex separator of G. By Definition 6, in aSC k-tree Q can be chosen only once.(2 ⇐⇒ 3) Kumar and Madhavan [8] proved that a chor<strong>da</strong>l graphis uniquely representable if and only if (i) there is no proper containmentbetween any two minimal vertex separators and (ii) allminimal vertex separators have multiplicity 1. By Rose [11], everyminimal vertex separator of a k-tree has cardinality k; so, thereis no containment between them.□ALIO-EURO <strong>2011</strong> – 31


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>The concept of asteroi<strong>da</strong>l triple is fun<strong>da</strong>mental for a compact characterizationof interval graphs. Three vertices u,v,w of G forman asteroi<strong>da</strong>l triple (AT) if for every pair of them there is a pathconnecting the two vertices that avoids the neighborhood of theremaining vertex. Brandstadt et al. [6] refer to the following theorem:Theorem 7. G is an interval graph if and only if G is chor<strong>da</strong>l andcontains no AT.Besi<strong>de</strong>s the efficient recognition of k-intercats, the next theoremalso shows that a k-tree is an interval graph if and only if it is ak-intercat.Theorem 8. Let G be a k-tree with η ≥ 2 minimal vertex separatorsand P its body. The three following statements are equivalent:1. G is a k-intercat.2. G is an interval graph.3. P has exactly η − 2 minimal vertex separators.Proof:(1⇒2) Let 〈B 0 ,C 1 ,B 1 ,...,C p ,B p 〉 be a longest k-path of G. Let G ′be the subgraph of G induced by the vertices of this k-path. G ′ hastwo simplicial vertices (Theorem 5): v ′ ∈ B 0 and v ′′ ∈ B p . As G ′is a k-path graph, it is an interval graph [12]. Let H be the set ofsimplicial vertices of G. By <strong>de</strong>finition, each w ∈ H, except v ′ andv ′′ , is adjacent to a k-clique B i , 1 ≤ i ≤ p − 1.Let us add a vertex v ∈ H to G ′ and suppose, by absurd, that vertexv with vertices u and w of G ′ form an asteroi<strong>da</strong>l triple. Vertex vis adjacent to some B i , 1 ≤ i ≤ p − 1. As B i = C i ∩C i+1 , B i is aminimal vertex separator of G ′ . The removal of B i separates G ′ intwo components. Two cases can happen:case 1) B i separates u and w. As B i is a minimal vertex separator,all paths linking u and w cannot avoid the neighborhood of v.case 2) After removing B i , u and w belong to the same connectedcomponent. Since u and w are not adjacent, they belong to differentmaximal cliques of G ′ . The clique-tree of a k-path graph is apath. As v is adjacent to B i , the vertex corresponding to the newmaximal clique C ′ can be inserted between cliques C i and C i+1 .Suppose, without loss of generality, that u ∈ C q and u /∈ C q+1 ,i < q. Suppose also that w ∈ C t , t > q. B q separates u and wand it belongs to the neighborhood of u. All paths between v andw cannot avoid B q . So, it is impossible to have an asteroi<strong>da</strong>l tripleand G is an interval graph.(2⇒3) Let T = ({Q 1 ,...,Q p },{(Q i ,Q i+1 )|1 ≤ i ≤ p − 1}) be aclique-tree of G such that T is a path. We know that simplicialvertices belong to just one maximal clique, and we know that in ak-tree at most one simplicial vertex belongs to a maximal clique.So, Q 1 = v ′ ∪ S 1 and Q p = v ′′ ∪ S p .The body P of G (and its clique-tree) is obtained by the removalof all simplicial vertices of G. This task will be performed in twosteps. Firstly, we remove all vertices of H − {v ′ ,v ′′ }, being H theset of simplicial vertices of G. Let v ∈ Q i , i ≠ 1, p, be a simplicialvertex and Q i = {v} ∪ S i . As |Q i ∩ Q i+1 | = |Q i ∩ Q i−1 | = k,then Q i−1 ∩ Q i+1 = S i . So, the maximal clique Q i does not existanymore and so the corresponding vertex of the clique-tree;(Q i−1 ,Q i+1 ) is a new edge in the clique-tree. Observe that S i isa minimal vertex separator (because it is an edge) of the cliquetreeof the remaining graph. After the removal of all vertices ofH − {v ′ ,v ′′ }, the remaining graph is a k-path graph.Secondly, we remove vertices v ′ and v ′′ . All minimal vertex separatorsof a k-path graph are distinct. So, after the removal of thesetwo vertices, the maximal cliques Q 1 and Q p do not belong to Pand the two minimal vertex separators S 1 and S p are not minimalvertex separators of P.(3⇒1) By Definition 9, P is subgraph of G; G is a k-tree, so Pis also a k-tree. As all simplicial vertices of G were removed, avertex of P belongs to at least one minimal vertex separator of G.Let v be a simplicial vertex of P. The minimal vertex separatorthat contains v in G is not a minimal vertex separator of P. In a k-tree, there are not adjacent simplicial vertices. So, as P has η − 2minimal vertex separators, P has exactly two simplicial verticesand P is a k-path graph.Let 〈B 0 ,C 1 ,B 1 ,...,C p ,B p 〉 be a maximal k-path of G. Observethat 〈B 1 ,C 2 , B 2 ,...,C p−1 ,B p−1 〉 is a maximal k-path of P and onlyB 1 and B p−1 are not minimal vertex separators of P. So, all simplicialvertices of G are adjacent to a k-clique B i , 1 ≤ i ≤ p − 1,i.e., G is a k-intercat.By <strong>de</strong>finition, we know already that ur-interval graphs are intervalgraphs; in [11], Pereira et al. proved that k-path graphs are alsointerval graphs. Recalling that an interval graph has a clique-treethat is a path, the following theorem shows that the k-path graphsactually satisfy the <strong>de</strong>finition of three important families.Theorem 9. A graph G is a k-tree, an interval graph and an urchor<strong>da</strong>lgraph if and only if it is a k-path graph.Proof:(⇒) By Theorem 6, a k-tree that is an ur-chor<strong>da</strong>l has all minimalvertex separators with multiplicity one. So, a simplicial vertex ofG is adjacent to exactly one minimal vertex separator B of G andB is not a minimal vertex separator of P. By Theorem 8 the bodyP of a k-tree that is an interval graph has η − 2 minimal vertexseparators. So G has exactly two simplicial vertices, i.e, G is ak-path graph.(⇐) By <strong>de</strong>finition a k-path graph is a k-tree and Pereira et al.proved that k-path graphs are interval graphs. Let 〈B 0 ,C 1 ,B 1 ,...,C p ,B p 〉 be a maximal k-path of G. Observe that B 1 ,B 2 ,...,B p−1are the η = n−k −1 minimal vertex separators of G. By Theorem6 G is an ur-chor<strong>da</strong>l graph.Figure 1 shows all results covered in this paper, showing the hierarchyof subfamilies. Note that an arrow indicates that a family issubfamily of its parent. If more than one arrow arrives at a no<strong>de</strong>,the family is the intersection of the parent families.ur-chor<strong>da</strong>lur-intervalchor<strong>da</strong>lintervalSC k-treek-path graphk-treek-caterpillark-intercatFigure 1: Relationship among k-trees, ur-chor<strong>da</strong>l and intervalgraphs.□□ALIO-EURO <strong>2011</strong> – 32


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>5. ACKNOWLEDGMENTThis work is supported by grant 305372/2009-2, CNPq, Brazil.6. REFERENCES[1] L.S. Chandran and F. Grandoni, "A linear time algorithmto list the minimal separators of chor<strong>da</strong>l graphs", DiscreteMath., vol.306, pp. 351-358, 2006.[2] P.S. Kumar and C.E.V. Madhavan, "Minimal vertex separatorsof chor<strong>da</strong>l graphs", Discrete Appl. Math., vol. 89, pp.155-168, 1998.[3] L. Markenzon and P.R.C. Pereira, "One-phase algorithm forthe <strong>de</strong>termination of minimal vertex separators of chor<strong>da</strong>lgraphs", Internat. Trans. in Oper. Res., vol. 17, pp. 683-690,2010.[4] J.R.S. Blair and B. Peyton, "An introduction to chor<strong>da</strong>lgraphs and clique trees", in Graph Theory and Sparse MatrixComputation, IMA vol. 56, 1993, pp. 1-29.[5] M.C. Golumbic, Algorithmic Graph Theory and PerfectGraphs, 2 nd edition, Aca<strong>de</strong>mic Press, New York, 2004.[6] A. Brandstädt, V.B. Le, and J. Spinrad, Graph Classes - aSurvey, SIAM Monographs in Discrete Mathematics and Applications,1999.[7] A. Proskurowski, "Separating subgraphs in k-trees: cablesand caterpillars", Discrete Math., vol.49, pp. 275-285, 1984.[8] P.S. Kumar and C.E.V. Madhavan, "Clique tree generalizationand new subclasses of chor<strong>da</strong>l graphs", Discrete Appl.Math., vol.117, pp. 109-131, 2002.[9] L. Markenzon, C.M. Justel, and N. Paciornik, "Subclassesof k-trees: characterization and recognition", Discrete Appl.Math., vol.154, pp. 818-825, 2006.[10] L.W. Beineke and R.E Pippert, "Properties and characterizationsof k-trees", Mathematika, vol.18, pp. 141-151, 1971.[11] D.J. Rose, "On simple characterizations of k-trees", DiscreteMath., vol. 7, pp. 317-322, 1974.[12] P.R.C. Pereira, L. Markenzon, and O. Vernet, "A cliquedifferenceencoding scheme for labelled k-path graphs", DiscreteAppl. Math., vol.156, pp. 3216-3222, 2008.ALIO-EURO <strong>2011</strong> – 33


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Efficient Algorithms for Regionalization: an Approach Based on Graph PartitionGustavo Silva Semaan ∗ José André <strong>de</strong> Moura Brito † Luiz Satoru Ochi ∗∗ Instituto <strong>de</strong> Computação - Universi<strong>da</strong><strong>de</strong> Fe<strong>de</strong>ral Fluminense, IC-UFFRua Passo <strong>da</strong> Pátria 156 - Bloco E - 3 o an<strong>da</strong>r, São Domingos, CEP: 24210-240, Niterói, RJ, Brasil{gsemaan, satoru}@ic.uff.br† Escola Nacional <strong>de</strong> Ciências Estatísticas - Instituto Brasileiro <strong>de</strong> Geografia e Estatística, ENCE-IBGERua André Cavalcanti 106, sala 403, CEP: 20231-50, Rio <strong>de</strong> Janeiro, RJ, Brasiljose.m.brito@ibge.gov.brABSTRACTThis paper proposes new approaches based on the GRASP andEvolutionary algorithms for the resolution of a specific regionalizationproblem. This problem can be mapped on a capacity andconnectivity graph partition problem. A review of literature showingthat the algorithms work only with the edges of the MinimumSpanning Tree is presented. In this case, the algorithms act on theoriginal graph, in or<strong>de</strong>r to increase the possibilities of vertex migration.Results obtained from the application of such algorithmsover a set of real <strong>da</strong>ta suggested that the use of original graphsthrough them is a new efficient way to solve this problem.Keywords: Graph Partition Problem, Clustering, Regionalization,Metaheuristics1. INTRODUCTIONAccording to [1, 2], regionalization is a clustering procedure appliedto spatial objects with a geographic representation, whichgroups them into homogeneous contiguous regions and ClusterAnalysis is a multivariate technique used to group objects togetherbased on a selected similarity measure, in such way that objects inthe same cluster are very similar and objects in different clustersare quite distinct [3].Consi<strong>de</strong>ring a given set with n objects X = {x 1 ,..,x n } , it must extractpartitions from the set X in k different clusters C i , respectingthe following three conditions:k⋃C i = Xi=1C i ≠ /0,1 ≤ i ≤ kC i ∩C j = /0,1 ≤ i, j ≤ k,i ≠ jThe cluster analysis is a fun<strong>da</strong>mental technique to experimentalsciences in which the classification of elements into groups is <strong>de</strong>sirable. As examples of these fields it is possible to cite: biology,medicine, economy, psychology, marketing, statistic among others[4].2. GRAPH PARTITION PROBLEMSeveral clustering problems can be mapped on graph partition problems.Thisconsists in grouping the vertexes of the graphs in differentsubsets (clusters), according to their similarities, by using afitness function [1, 5, 6]. Moreover, this regionalization problemconsi<strong>de</strong>rs the following restrictions:• Connectivity: the vertexes grouped in each cluster must beconnected.• Minimum Capacity: associated total to one of the variablesmust be higher than minimum capacity submitted as parameter.The high combinatorial possibilities of the clustering problemssuggests the use of metaheuristic algorithms [7]. This algorithmcan reach a typical optimal solution which is very close to globalsolution, in some cases the global optimal, in a reasonable amountof time. So, papers about clustering problems, including graphpartition problem that consi<strong>de</strong>r additional restrictions such as connectivityand capacity had been wi<strong>de</strong>ly reported in literature.Some Groups [8, 9] had proposed heuristics algorithms for the capacityclustering problem, while others [1, 2, 10] had suggeste<strong>da</strong>lgorithms for the regionalization problem, in which the connectivityrestriction was consi<strong>de</strong>red (Automatic Zoning Procedure -AZP and the Spatial ’K’luster Analysis by Tree Edge Removal -SKATER).The problem presented in this paper consi<strong>de</strong>rs both connectivityand capacity restrictions into partition graph problem. It is importantto un<strong>de</strong>rline that, excepting the AZP, the other works referencedthat consi<strong>de</strong>red the connectivity restriction were based onMinimum Spanning Tree (MST) Partition Method. This method iscomposed by two steps:1. Construction of a MST from the graph which represents theproblem.2. Formation of sets of clusters through of partitioning of MST.According to the connectivity restriction, a natural solution for theproblem will consist of building a MST T from G, respecting thesmaller values of d i j (1).d i j =√ p∑(xi s − xs j )2 (1)s=1In this way, these areas are geographically immediate neighbors,and homogeneity, regarding a set of p variables associated to populationaland environmental known characteristics. These variables,which will be represented by x s , s = {1,..,p}, are also called indicators(associated variables to each vertex).Consi<strong>de</strong>ring these indicators and using the distances d i j between iand j neighbors vertexes are calculated. The distances d i j representthe homogeneity <strong>de</strong>gree, i.e., the proximity among values from pvariables associated to all vertexes to be aggregated.Once provi<strong>de</strong>d one tree T and a number k of partitions (clusterto be generated), it is possible to extract (k − 1) edges from T,ALIO-EURO <strong>2011</strong> – 34


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong><strong>de</strong>fining, this way, a set of K subtrees T j , j={1,.., k}. Each one ofthese subtrees will be associated to one cluster.The connectivity property can be observed in each of the subtrees(clusters). Thus, the solution for the problem will consist of partitioningT in k subtrees T j , j={1,.., k} associated to cluster whatsatisfies the capacity restriction and results in the lower possiblevalue for a fitness function(2).p nf (T ) = ∑ ∑ (x i j − x j ) 2 (2)j=1 i=1The case of AZP was based on the spatial object neighbor structureto assure the connectivity restriction and acts, basically, on themigration of the objects in or<strong>de</strong>r to minimize a fitness solution.Figure 1: Adjacency relations between objects [1].According to 1, follows the <strong>de</strong>scriptions of the items: (1) connectivitygraph, (2) minimum Spanning Tree and (3) an example ofsolution.3. PROPOSED ALGORITHMSReview of literature showed that the proposed algorithms workonly on the edges of MST. In or<strong>de</strong>r to increase the possibilitiesof vertex migration this work presents new heuristic algorithmsthat act with the original submitted graph of the problem. Thisproposal enables and facilitates the formation of not only feasible,which the restriction of capacity is respected, but also betterquality solutions.According to [6], a good <strong>da</strong>ta structure for the problem is extremelyimportant to the algorithms performance and it can be <strong>de</strong>cisivefor a fast convergence and quality of the obtained solutions.The group-number structure was used to representation of the solution,where the in<strong>de</strong>x of vector represents the vertex of the graphand its content represents the cluster to which the vertex belongs(also used by [5, 6, 11] ).The proposed approach consists in creating solutions using theMST Partition Method through the constructive heuristics, and so,refining its using local search procedures. It was used versions oflocal search that consi<strong>de</strong>r the original graph, and not only the MSTbuilt.3.1. Constructive HeuristicsTwo versions of constructive heuristics were proposed, assuringthe connectivity restriction through MST Partition Method, bothconsi<strong>de</strong>ring the concepts of GRASP Metaheuristic (Greedy RandomizedA<strong>da</strong>ptive Search Procedures [12]).While a first version worked aiming to build feasible solutions,which the restriction of capacity is respected, the second versionacted in or<strong>de</strong>r to minimize the fitness solution, in<strong>de</strong>pen<strong>de</strong>ntly ofthe restriction of capacity.Both versions act to generate k partitions, removing (k − 1) edgesfrom T, since the hierarchical division strategy was used and, initially,all the vertexes belong to the same cluster.The Constructive Heuristic 1 (CH1) was proposed by [11] and consistsin, after the selection of the cluster (associated with a subtreeT i ) that must be partitioned (what have the high fitness function),to evaluate all the possibilities of edge removal in or<strong>de</strong>r to minimizethe fitness function. This way, must be removed the edge ofhigh value of (3) of the subtree T i , generation two new subtrees T 1iand T 2i . C edge = f (T i ) − ( f (T 1i ) + f (T i 2 )) (3)Although it is a greedy procedure which has an expensive computationalcost, it was applied on the building of the initial solutionfor the proposed algorithm. In or<strong>de</strong>r to make this algorithm semigreedy,it was used a Restricted Candi<strong>da</strong>te List (RCL), which theα high edges (according C edge value) are selected and, one of themis randomly selected, aiming to divi<strong>de</strong> the selected cluster.The Constructive Heuristic 2 (CH2) was based on the CH1 but, inthis version, intending to obtain valid solutions. In this case, theselection of the cluster that must be partitioned occurs by capacitycriteria, in which the cluster with higher capacity must be selected.Moreover, the algorithm is also semi-greedy and a RCL was used.In or<strong>de</strong>r to build valid solutions, the CH2 acts dividing the selectedcluster C w (subtree T w ) in the clusters C w1 and C w2 and, afterwards,one of them must have its capacity minimized and the capacitycriteria respected.3.2. Local Search ProceduresSix versions of Local Search (LS) were used consi<strong>de</strong>ring:• MST: only the edges of the MST built.• Original Graph: all edges from the original submitted graph.• Feasible Solutions: construction of valid solutions.• Better Solutions: to minimize the fitness solution, in<strong>de</strong>pen<strong>de</strong>ntof the restriction of capacity.Table 1 ilustrates the distributions of the Local Search versionsamong the consi<strong>de</strong>ring properties.Property LS1 LS2 LS3 LS4 LS5 LS6MST x x xOriginal Graph x x xFeasible Solutions x x xBetter Solutions x x xTable 1: Properties by Local Search versions.Descriptions of the Local Search versions:• LS1: uses the edges that were selected during the clusterpartition. Basically, the procedure verifies if one and onlyone cluster associated to vertexes of the edge is penalized(if it has capacity less than the minimum capacity). In thiscase, the vertex is migrated to this cluster, aiming to regeneratethe solution.• LS2: realizes migrations of vertexes based on the originalsubmitted graph of the problem, aiming to regenerate theinfeasible solutions.• LS3: realizes migrations of vertex based on the original submittedgraph of the problem aiming to minimize the fitness’solution.ALIO-EURO <strong>2011</strong> – 35


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>• LS4 and LS5: work joining adjacent clusters in which existsan edge connecting vertexes of this clusters, and after,dividing this cluster using, respectively, the CH1 and CH2procedures.• LS6: was based on the known clustering algorithm of theliterature, the K-Means [13, 4] but, in this case, the restrictionsof this problem were consi<strong>de</strong>red.3.3. Additional Comments about the ImplementationThis paper proposes Evolutionary Algorithms (EA)[12] that bringtogether the construtives and local search procedures. It followsthe other implemented techniques:• Crossover: the vertexes migration occur by the 1-point typecrossover operator. It was necessary to verify if the newsolutions have k clusters and if the clusters are connected.• Mutation: it was used random vertex migration, aiming toperturb the solution.• Elitism: The best found solutions are saved and inserted tothe next population in or<strong>de</strong>r to improve quality by using theothers procedures.• Minimum Capacity: Total associated to one of the variables.This value can be either submitted as parameter or calculate<strong>da</strong>t the begin of the algorithm, which β is the fit factor,k a number of clusters, n a number of vertexes and x s i thevariable s associate with the vertex i (4).nCap Min = (β/k). ∑ xi s (4)i=1In the experiments, were consi<strong>de</strong>red only two versions of EA:• EAOG: Evolutionary Algorithm that consi<strong>de</strong>r the originalsubmitted graph. It was used: LS2, LS3, LS6, Elitism, CH1or CH2.• EAMST: Evolutionary Algorithm that consi<strong>de</strong>r only the edgesof the MST. It was used: LS1, LS4, LS5, Crossover, Mutation,Elitism, CH1 or CH2.4. COMPUTATIONAL RESULTSA real set of twenty six instances from Brazilian DemographicCensus (<strong>da</strong>ta for public use) was used for the experiments. Moreover,the algorithms presented were co<strong>de</strong>d in Ansi C, running on aIntel Centrino II 2,4 GHz processor and 4GB RAM.Table 2 presents properties of the used instances, where each vertexis a weigthed area. A weighted area is a small geographicalarea formed by a mutually exclusive enumeration areas (clusterof census segments), which comprise, each one of them, a set ofrecords of households and people. And the associated variablesare: total of houses, total of domiciles, total of person, sum ofsalaries, sum of time of instruction or study, sum of salary percapita,average time of instruction or study of the responsible.Aiming to calibrate the parameters, several preliminary experimentswere run based on the selected set of instances. The obtainedparameters were: k=3 (clusters), PopulationSize=10 solutions,StopCriteria=100 generations, Crossover =80%, Mutation=5%and α=5. The crossover and mutation have a high probability sinceits execution is evaluate in or<strong>de</strong>r to form only feasible solutions.Although real applications can <strong>de</strong>fine the Minimum Capacity foreach instance, in this experiment was fixed β = 30%.Id |Vertex| |Edge| Id |Vertex| |Edge|1 21 58 14 178 7912 61 286 15 121 5673 409 2020 16 75 3594 73 350 17 114 5025 14 46 18 133 6206 18 59 19 195 8687 89 363 20 68 3078 16 60 21 181 8439 57 236 22 151 56010 375 1769 23 86 38811 179 882 24 155 72212 74 357 25 461 238513 231 1172 26 285 1451Table 2: Real instances of Brazilian Demographic Census.In the experiment, each algorithm was executed over the same instancetwenty times. The elapsed time and the gap associated withthe best known result of the each instance were obtained.The tables 3 and 4 present, respectively, the best of this resultsby EA version for each instance and some statistics about this experiment.The EAOG obtained best results for all the instances,however, its average of elapsed time was higher then EAMST versions.Gap(AEGO,EAMST ) = 100 ∗ | f AEGO − f AEMST |f AEGO(5)Id Gap Id Gap Id Gap1 26.97 10 43.33 19 54.082 7.1 11 61.85 20 16.443 5.82 12 40.31 21 41.864 20.3 13 51.23 22 39.055 11.71 14 91.09 23 60.66 3.97 15 65.76 24 48.967 78.84 16 35.38 25 26.468 17.44 17 56.49 26 56.259 59.59 18 84.02Table 3: Gap between EAOG and EAMST.Average Time EAOG 269 secondsEAMST 133 secondsGap (EAOG, EAMST) Min 3.97%Max 91.09%Mean 42.49%Median 42.59%Gap [Best Known reference] EAOG 4.00%EAMST 51.00%Table 4: Statistics.In or<strong>de</strong>r to analyze the results, three categories were created accordingto the Gap values of the best solution known: Best (Gap =0%), Interesting (Gap ≤ 5%) and Bad (Gap > 70%).The table 5 presents the results by categories.Since the AEOG reached best results but its elapsed time washigher than of AEMST, both algorithms were submitted to a newexperiment. They were run one hundred times, over three amongthe bigger selected instances and, in this experiment, the StopCriteriawas a maximum time (300 seconds) or the solution reach thetarget value, submitted as parameters.ALIO-EURO <strong>2011</strong> – 36


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Categories EAOG EAMSTBest 40% 12%Interesting 60% 17%Bad 0% 29%Table 5: Results by categories.In this experiment all the AEOG executions reached the target,while the AEMST had probabilities of 52%, 55% and 38% for theinstances 4, 13 and 22, respectively.Despite the algorithm had obeyed the stipulated processing time,the EAMST continued limited in best local solutions, while theEAOG obtained new different solutions, that could not be formedthrough the only MST method. Moreover, the AEOG reached thetarget of the instances 4, 13 and 22 at 40, 10 and 10 seconds, respectively.5. CONCLUSIONSIn this paper two versions of constructive heuristics were proposed,both consi<strong>de</strong>ring the concepts of GRASP Metaheuristic. Afterwards,six local search procedures were used aiming to refine thesolutions, in or<strong>de</strong>r to increase <strong>de</strong> solutions’ quality or regenerateinfeasible solutions.Two Evolutionary Algorithms were presented, bring together theconstrutives and local search procedures: The EAOG (based onthe Original Graphs) and EAMST (based only on edges of MST).It was possible to confirm that the procedures that acted with theoriginal submitted graph increase the possibilities of vertex migrationand thus facilitated the formation of both valid as betterquality solutions.The computational results showed that the use of Constructive Heuristicsthat consi<strong>de</strong>r only edges of MST together a local search proceduresand the use of Original Graphs are an interesting alternativeto solve this problem, improving both the solution’s quality as thequantity of formation of valid solutions.These results indicate that the proposed heuristics are an efficientway to solve this problem. Besi<strong>de</strong>s, as another ways to solve itwe can cite: the use of Pathrelinking in or<strong>de</strong>r to integrate intensificationand diversification in search for new best solutions [12]; to<strong>de</strong>velope and analyze the use of other metaheuristics, such as: IteratedLocal Search (ILS), Variable Neighborhood Search (VNS),Tabu Search or a hybrid heuristic version [12].6. ACKNOWLEDGMENTSTo all the teachers and stu<strong>de</strong>nts of the Computer Institute at UFF( http://www.ic.uff.br ) and CAPES ( http://www.capes.gov.br) for the financial support.7. REFERENCES[1] R. M. Assunção, M. C. Neves , G. Câmara, C. Freitas,“Efficient regionalization techniques for socio-economic geographicalunits using minimum spanning trees,” InternationalJournal of Geographical Information Science, vol. 20,no. 7, pp. 797–811, 2006.[2] M.J. Smith, M. F. Goodchild, P. A. Longley, GeospatialAnalysis : a Comprehensive Gui<strong>de</strong> to Principles, Techniquesand Software Tools. Troubadour Publishing Limited, 2009.[3] J. Han and M. Kamber, Data Mining: Concepts and Techniques.Morgan Kaufmann, 2006.[4] H. C. Romesburg, Cluster Analysis for Researchers. LuluPress, 2004.[5] C. R. Dias, L. S. Ochi, “Efficient evolutionary algorithms forthe clustering problems in directed graphs,” in Proc. of theIEEE Congress on Evolutionary Computation (IEEE-CEC),Canberra, Austrália, 2003, pp. 983–988.[6] D. Doval, S. Mancoridis, B. S. Mitchell, “Automatic clusteringof software systems using a genetic algorithm,” in Proc.of the Int. Conf. on Software Tools and Engineering Practice,Pittsburgh, USA, 1999, pp. 73–81.[7] P. Hansen, B. Jaumard, “Cluster analysis and mathematicalprogramming,” Mathematical Programming, vol. 79, pp.191–215, 1997.[8] S. W. Scheuerer, “A scatter search heuristic for the capacitatedclustering problem,” European Journal of OperationalResearch, vol. 169, 2006.[9] H. M. Shieh, M. D. May, “Solving the capacitated clusteringproblem with genetic algorithms,” Journal of the Chinese Instituteof Industrial Engineers, vol. 18, 2001.[10] R. M. Assuncao, J. P. Lage, A. E. Reis, “Analise <strong>de</strong> conglomeradosespaciais via arvore geradora minima,” RevistaBrasileira <strong>de</strong> Estatística, 2002.[11] G. S. Semaan, L. S. Ochi, J. A. M. Brito, “An efficient evolutionaryalgorithm for the aggregated weighting areas problem,”in International Conference on Engineering Optimization,2008.[12] F. Glover, Handbook of Metaheuristics. Kluwer Aca<strong>de</strong>micPublishers, 2003.[13] J. MacQueen, “Some methods for classification and analysisof multivariate observations,” in <strong>Proceedings</strong> of 5th BerkeleySymposium on Mathematical Statistics and Probability,1967.ALIO-EURO <strong>2011</strong> – 37


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Lagrangean based algorithms for the Weight-Constrained Minimum SpanningTree ProblemCristina Requejo ∗ Eulália Santos ∗ †∗ Department of Mathematics, University of Aveiro3810-193 Aveiro, Portugalcrequejo@ua.pt† School of Technology and Management, Polytechnic Institute of Leiria2411-901 Leiria, Portugaleulalia.santos@ipleiria.ptABSTRACTThe Weight-Constrained Minimum Spanning Tree problem (WMST)is a NP-hard combinatorial optimization problem having importantapplications in the telecommunication networks <strong>de</strong>sign andcommunication networks. We use simple but effective Lagrangeanbased algorithms to compute lower and upper bounds. Computationalresults show that the algorithms are fast and present smallgap values.Keywords: Weight-constraints, Constrained minimum spanningtree, Lagrangean relaxation, Heuristics1. INTRODUCTIONIn this work we discuss Lagrangean based algorithms for the Weight-Constrained Minimum Spanning Tree problem (WMST).Consi<strong>de</strong>r an undirected complete graph G = (V,E), with no<strong>de</strong> setV = {0,1,...,n − 1} and edge set E = {{i, j}, i, j ∈ V,i ≠ j}. Associatedwith each edge e = {i, j} ∈ E consi<strong>de</strong>r nonnegative integercosts c e and nonnegative integer weights w e . The WeightMinimum Spanning Tree problem (WMST) is to find a spanningtree T = (V T ,E T ) in G ( V T ⊆ V and E T ⊆ E) of minimum costC(T ) = ∑ e∈ET c e and with total weight W(T ) = ∑ e∈ET w e not exceedinga given limit W. This combinatorial optimization problemis NP-hard [1, 2].The WMST is known un<strong>de</strong>r several different names. It was firstmentioned in Aggarwal, Aneja and Nair [1], un<strong>de</strong>r another name,the MST problem subject to a si<strong>de</strong> constraint. In this paper the authorspropose an exact algorithm to solve the problem that uses aLagrangian relaxation to approximate a solution combined with abranch and bound strategy. This kind of solution approach can alsobe found in the work of Shogan [3]. The paper of Ravi and Goemans[4] <strong>de</strong>scribes an approximate scheme. In [5] Xue presents asimple but efficient primal-dual algorithm to find approximate solutions.Another approach to solve the problem is given in Hong,Chung and Park [6] where the authors propose a fully polynomialbicriteria approximation scheme. Hassin and Levin [7] adopt thei<strong>de</strong>as in [4] and add to them an application of a matroid intersectionalgorithm. Yama<strong>da</strong>, Watanabe and Kataoka [2] consi<strong>de</strong>r aweight-constrained maximum spanning tree problem. They provethe problem is NP-hard, use a local search heuristic to obtain upperbounds, a Lagrangian relaxation to obtain lower bounds, usea branch-and-bound algorithm to solve the problem and propose amethod to accelerate the computation. The authors refer that theresults can be easily applied to the minimization case. Henn [8]presents a compilation of results and existing algorithms to solvethe problem.A related approach is to inclu<strong>de</strong> the weight of the tree as a secondobjective instead of a hard constraint. The resulting problem is thebi-objective spanning tree problem ( [9, 10, 11, 12, 13, 14, 15],among many others).The WMST appears in several real applications and the weight restrictionsare mainly concerned with a limited budget on installation/upgradingcosts. A general application is related with the upgra<strong>de</strong>and <strong>de</strong>sign of physical systems, somehow connected througha minimum spanning tree, when there is a budget restriction. Onesuch application arises in the areas of communication networksand network <strong>de</strong>sign, in which information is broadcast over a minimumspanning tree. There are several problems that consi<strong>de</strong>r the<strong>de</strong>sign of the enhancement of the performance of an un<strong>de</strong>rlyingnetwork by carrying out upgra<strong>de</strong>s at certain no<strong>de</strong>s and/or edgesof the network. Upgrading a no<strong>de</strong> corresponds to installing fasterswitching equipment at that no<strong>de</strong>. Such upgra<strong>de</strong> reduces the communication<strong>de</strong>lay along each edge emanating from the no<strong>de</strong>. Similarly,upgrading an edge corresponds to replacing an existing linkwith a new type of link. Moreover, costs/profits is not the onlymeaning for the weights. Edge weights may represent the <strong>de</strong>layof an edge or the logarithm of the reciprocal of the reliability ofan edge [5]. Another example (see [8, 16]) arising in communicationnetworks problems, is the minimum cost reliability constrainedspanning tree. In this application we are given a set ofno<strong>de</strong>s in the plane that can communicate with each other. The objectiveis to connect the no<strong>de</strong>s. The cost of a connection mightbe mo<strong>de</strong>led by the distance of the no<strong>de</strong>s and the reliability of aconnection by its fault probability. We now want to compute aminimum cost connection (spanning tree) such that its total faultprobability is beyond a given limit. The interest from the telecommunicationscommunity arises from the great <strong>de</strong>al of emphasis onthe need to <strong>de</strong>sign communication protocols that <strong>de</strong>liver certainperformance guarantees. This need is the result of an explosivegrowth in high bandwidth real time applications that require <strong>de</strong>mandingQoS (Quality of Service) guarantees. It is for this reasonthat the WMST has assumed great importance in telecommunicationsnetwork applications.There are several studies of Lagrangean based approximation algorithmseither to general constrained combinatorial optimizationproblems, cf. [17], or to weight/resource constrained shortest pathproblems, cf. [18, 19]. The WMST has received only brief referencesand computational results are almost non existing. Wewill <strong>de</strong>scribe Lagrangean based algorithms to the WMST and obtaincomputational results. To present the Lagrangean relaxationto the WMST in Section 4, we <strong>de</strong>scribe a general formulation tothe problem in Section 2. We discuss some properties of the prob-ALIO-EURO <strong>2011</strong> – 38


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>lem in Section 3 and a solution procedure in Section 5. We presentexisting settings and propose a different setting to obtain approximatetrees in the solution procedure. Computational results toassess the quality of the discussed procedures will be shown inSection 6.2. A FORMULATION FOR THE WMSTSeveral formulations are well known for the MST (see Magnantiand Wolsey [20]). In [21] natural and exten<strong>de</strong>d formulations forthe WMST are discussed. To obtain formulations to the WMSTone can easily a<strong>da</strong>pt a MST formulation.It is well known (see Magnanti and Wolsey [20]) that orientedformulations (based on the un<strong>de</strong>rlying directed graph) leads, ingeneral, to tighter formulations (formulations whose lower boundsprovi<strong>de</strong>d by the linear relaxations are closer to the optimum values).Thus, henceforward we consi<strong>de</strong>r the corresponding directedgraph, with root no<strong>de</strong> 0, where each edge e = {0, j} ∈ E is replacedwith arc (0, j) and each edge e = {i, j} ∈ E,i ≠ 0, is replaced withtwo arcs, arc (i, j) and arc ( j,i), yielding arc set A = {(i, j), i ∈V \ {0}, j ∈ V,i ≠ j}. These arcs inherit the cost and weight of theancestor edge.Henceforward, let P L be the linear programming relaxation of formulationP and let ϑ(P) be the optimal value of P.Consi<strong>de</strong>r the original variables, the binary variables x i j (for all(i, j) ∈ A) indicating whether arc (i, j) is in the MST solution [20].Two classical formulations on the space of the original variablesfor the MST can be consi<strong>de</strong>red. In or<strong>de</strong>r to ensure the connectivityof the feasible solutions and to prevent the existence of circuits inthe feasible solutions, one formulation uses the cut-set inequalitiesand the other formulation uses circuit elimination inequalities.The linear relaxation of both mo<strong>de</strong>ls provi<strong>de</strong> the same bound [20].However the number of inequalities in both sets increase exponentiallywith the size of the mo<strong>de</strong>l. It is well known that in or<strong>de</strong>r toensure connectivity/prevent circuits, instead of using one of thosefamilies with an exponential number of inequalities, one can usecompact exten<strong>de</strong>d formulations. The well-known MulticommodityFlow formulation (MF) using the additional flow variables canbe consi<strong>de</strong>red. In this formulation the connectivity of the solutionis ensured through the flow conservation constraints together withthe connecting constraints [20]. These three formulations for theMST are easily a<strong>da</strong>pted for the WMST through the inclusion ofa weight constraint. Therefore a formulation to the WMST is asfollows.(WMST ) min ∑ c i j x i j(i, j)∈As.t. x ∈ (MST ) (1)∑(i, j)∈Aw i j x i j ≤ W. (2)Where x = (x i j ) ∈ R |A| and (MST ) represents a set of inequalities<strong>de</strong>scribing the convex hull of the (integer) solutions of the MSTand can use one of the sets of inequalities referred previously (thecircuit elimination inequalities, the cut-set inequalities, the flowconservation constraints together with the connecting constraints)plus the following constraints∑ x i j = 1 j ∈ V (3)i∈Vx i j ∈ {0,1} (i, j) ∈ A. (4)Constraint (2) is the weight constraint and we emphasize that theabove formulation without the weight constraint is a formulationfor the MST [20].If the inci<strong>de</strong>nce vector x = (x i j ) ∈ R |A| represents an (integer) MSTsolution, and subgraph T = (V,A T ), A T ⊆ A, of G = (V,A) thecorresponding tree, then C(T ) = ∑ (i, j)∈A c i j x i j = ∑ (i, j)∈AT c i j andW(T ) = ∑ (i, j)∈A w i j x i j = ∑ (i, j)∈AT w i j . Furthermore, if we <strong>de</strong>finea matrix of non-negative profits p i j associated to each arc (i, j) ∈A, then we use P(T ) = ∑ (i, j)∈A p i j x i j = ∑ (i, j)∈AT p i j .3. SOME PROPERTIES OF THE WMSTThe well know Minimum Spanning Tree problem (MST) is to fin<strong>da</strong> spanning tree T c = (V,A Tc ), A Tc ⊆ A, on G = (V,A) of minimumcost C(T c ) = ∑ (i, j)∈ATc c i j and for this combinatorial optimizationproblem there are several polynomial algorithms such asSollin’s, Kruskal’s and Prim’s algorithm (see [22] for <strong>de</strong>scriptionsof these algorithms). An additional constraint to the MST suchas the one we use (the total tree weight W(T c ) = ∑ (i, j)∈ATc w i jmust not exceed a given limit W) turns the MST into a NP-hardproblem [1]. Consi<strong>de</strong>r a companion problem to the WMST, theMinimum-weight Spanning Tree problem that is to find a spanningtree T w = (V,A Tw ), A Tw ⊆ A, on G = (V,A) of minimum weightW(T w ) = ∑ (i, j)∈ATw w i j .T c and T w are two spanning trees of G, T c of minimum cost andT w of minimum weight. Moreover, these trees give us upper andlower bounds on the optimal value of the problemC(T c ) ≤ ϑ(WMST ) ≤ C(T w )and we can assume the following proposition.Proposition 1. There exists an optimal solution for the WMST ifand only ifW(T w ) ≤ W ≤ W(T c ).Clearly, if W(T w ) > W, then the WMST has no solution. Furthermore,we have the following.Proposition 2. If W(T c ) ≤ W, then T c is an optimal solution forthe WMST.Consi<strong>de</strong>r another companion problem to the WMST. Define somenon-negative profits p i j associated to each arc (i, j) ∈ A which arelinear combination of the cost and weight associated to each arc,p i j = aw i j +bc i j with real scalars a,b. The Minimum-profit SpanningTree problem that is to find a spanning tree T p = (V,A Tp ),A Tp ⊆ A, on G of minimum profit P(T p ) = ∑ (i, j)∈ATp p i j . If a = 0and b = 1 then we have T p ≡ T c . If a = 1 and b = 0 then we haveT p ≡ T w .4. LAGRANGEAN RELAXATIONIn or<strong>de</strong>r to <strong>de</strong>rive a Lagrangean relaxation attach the Lagrangeanmultiplier λ to the weight constraint (2) and dualize the constraintin the usual Lagrangean way. This leads to the following relaxedproblem.(WMST λ ) − λW + min ∑ (c i j + λw i j )x i j(i, j)∈As.t. x = ∈ (MST )For every non-negative multiplier λ, the tree solutions to this relaxedproblem give us lower bounds on the optimum value, i.e.ϑ(WMST λ ) ≤ ϑ(WMST ).ALIO-EURO <strong>2011</strong> – 39


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>For a given non-negative value of the Lagrangean multiplier λ,the relaxed problem WMST λ can be solved using any well knownpolynomial algorithm to solve the MST [22]. Moreover, if for eachmultiplier λ we <strong>de</strong>fine the profits p λ i j = c i j + λw i j , thenϑ(WMST λ ) = −λW + P(T p λ ).Classically a Lagrangean relaxation is solved using a subgradientoptimization procedure [23]. The subgradient optimizationprocedure starts by initializing the Lagrangean multipliers. After,iteratively, solves the relaxed problem WMST λk, then actualizesthe Lagrangean multiplier λ k by setting, at each iteration k ,λ k+1 = max{0,λ k + s k d k } using a direction d k and a step-size s k ,and finally verifies some stopping criteria.An appropriate choice for the step size s k produces a convergentmethod. We can use [23]s k = ρ C(T w) − ϑ(WMST λk)(∑ (i, j)∈A w i j xi k j −W)d = ρ C(T w) − P(T p λ k) + λ k Wk (W(T p λ k) −W)d kwith 0 < ρ < 2 and using the upper bound C(T w ) to approximatethe optimum value of the problem. Observe that for the tree solutionx k = (x k i j ) of the Lagrangean relaxed problem WMST λ k, correspondingto T p λ k, we have ϑ(WMST λk) = −λ k W + P(T p λ k) andW(T p λ k) = ∑ (i, j)∈A w i j x k i j .5. SOLUTION PROCEDUREIn or<strong>de</strong>r to obtain an approximate solution to the WMST we proposethe following general algorithm.AlgorithmStep 1 Obtain an upper bound.Find a spanning tree T w = (V,A Tw ), A Tw ⊆ A, on G of minimumweight W(T w ) = ∑ (i, j)∈ATw w i j .If W(T w ) > W, then there is no solution. STOP. Otherwise,set T α = T w .Step 2 Obtain a lower bound.Find a spanning tree T c = (V,A Tc ), A Tc ⊆ A, on G of minimumcost C(T c ) = ∑ (i, j)∈ATc c i j .If W(T c ) ≤ W, then T c is an optimal solution. STOP. Otherwise,set T β = T c .Step 3 Compute an approximate tree.Compute profits p i j for every (i, j) ∈ A.Find a spanning tree T p = (V,A Tp ), A Tp ⊆ A, on G of minimumvalue P(T p ) = ∑ (i, j)∈ATp p i j .Compute P(T p ), W(T p ) and C(T p ).Step 4 Stopping criteria.If W(T p ) ≤ W then up<strong>da</strong>te upper bound, i.e. if C(T p ) C(T β ) replace T β by T p .If |P(T α ) − P(T p )| ≤ tol, thenT α is the approximate solution, STOP.Go To Step 3.The subgradient optimization scheme perfectly fits this algorithmlayout. Now we will discuss settings for the non-negative profitsp i j = aw i j + bc i j , with real scalars a,b, associated to each arc(i, j) ∈ A and their up<strong>da</strong>te at each iteration. We will consi<strong>de</strong>r settingsfor the profits p i j characterized by associating a parameter,the Lagrangean multiplier, to the weights, a = λ k , and a parameterwith value equal to one to the costs, b = 1. Two examples of suchsettings will be given next.Jüttner et al. [19] built up the Lagrangian Relaxation Based AggregatedCost (LARAC) algorithm which solves the Lagrangianrelaxation of the constrained shortest path (CSP) problem. In [24]the equivalence of the LARAC algorithm and other algorithms in[17, 18, 19] is shown. Using the i<strong>de</strong>as of these algorithms, the firstsetting is a = λ k = C(T α) −C(T β )W(T β ) −W(T α ) .If the Held, Wolfe and Crow<strong>de</strong>r [25] direction is to be consi<strong>de</strong>redd k = ∑ (i, j)∈A w i j xi k j −W = W(T p λ k)−W, leading to the second settinga = λ k = max{0,λ k−1 + ρ C(T w) − P(T pλ k−1) + λ k−1 W}W(T pλ k−1) −Wand initializing λ 0 = C(T w)−C(T c )W(T c )−W .6. COMPUTATIONAL RESULTSComputational results will assess the quality of the approximatesolutions obtained with each setting of the profits.At the moment we present some computational results of the approximationalgorithms for instances to the weight-constrained minimumspanning tree problem on complete graphs and between150 and 300 no<strong>de</strong>s. Costs and weights are generated based onEucli<strong>de</strong>an distances combined with Pisinger’s [26] instances andW = W(T c)+W(T w )2 .|V | W(Tw) W W(Tc) C(Tc) C(Tw) C(Tp)150 824 4197 7570 781 7529 1114200 866 5890 10914 890 10557 1154250 958 6921 12884 1004 12925 1361300 1080 8281 15481 1082 14588 1470Table 1: Computational results.Preliminary computational results show that the algorithms are fastand present small gap values. For the instances in Table 1 thebound obtained is equal for both profits settings and its value isshown in the last column.An extensive computational experience is performed to completethis section.7. ACKNOWLEDGEMENTSThe research of the authors was supported by Center for Researchand Development in Mathematics and Applications (CIDMA) bothfrom the Portuguese Foun<strong>da</strong>tion for Science and Technology (FCT),cofinanced by the European Community Fund FEDER/POCI 2010.8. REFERENCES[1] V. Aggarwal, Y. P. Aneja, and K. P. K. Nair, “Minimal spanningtree subject to a si<strong>de</strong> constraint,” Computers and OperationsResearch, vol. 9, pp. 287–296, 1982.[2] T. Yama<strong>da</strong>, K. Watanabe, and S. Kataoka, “Algorithmsto solve the knapsack constrained maximum spanning treeproblem,” International Journal of Computer Mathematics,vol. 82, pp. 23–34, 2005.[3] A. Shogan, “Constructing a minimal-cost spanning tree subjectto resource constraints and flow requirements,” Networks,vol. 13, pp. 169–190, 1983.ALIO-EURO <strong>2011</strong> – 40


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[4] R. Ravi and M. Goemans, “The constrained minimum spanningtree problem,” in <strong>Proceedings</strong> of the ScandinavianWorkshop on Algorithmic Theory, ser. Lecture Notes in ComputerScience, vol. 1097, 1996, pp. 66–75.[5] G. Xue, “Primal-dual algorithms for computing weightconstrainedshortest paths and weight-constrained minimumspanning trees,” in Performance, Computing, and CommunicationsConference, 2000. IPCCC ’00. Conference Proceedingof the IEEE International, 2000, pp. 271 –277.[6] S.-P. Hong, S.-J. Chung, and B. H. Park, “A fully polynomialbicriteria approximation scheme for the constrained spanningtree problem,” Operations Research Letters, vol. 32, pp.233–239, 2004.[7] R. Hassin and A. Levin, “An efficient polynomial time approximationscheme for the constrained minimum spanningtree problem using matroid intersection,” SIAM Journal onComputing, vol. 33, no. 2, pp. 261–268, 2004.[8] S. Henn, “Weight-constrained minimum spanning tree problem,”Master’s thesis, Department of Mathematics, Universityof Kaiserslautern, Kaiserslautern, Germany, 2007.[9] K. A. An<strong>de</strong>rsen, K. Jörnsten, and M. Lind, “On bicriterionminimal spanning trees: an approximation,” Computers andOperations Research, vol. 23, pp. 1171–1182, 1996.[10] G. Zhou and M. Gen, “Genetic algorithm approach on multicriteriaminimum spanning tree problem,” European Journalof Operational Researc, vol. 114, pp. 141–152, 1999.[11] G. Chen, S. Chen, W. Guo, and H. Chen, “The multi-criteriaminimum spanning tree problem based genetic algorithm,”Information Sciences, vol. 177, pp. 5050–5063, 2007.[12] F. Sourd and O. Spanjaard, “A multiobjective branch-andbound:application to the bi-objective spanning tree problem,”INFORMS Journal on Computing, vol. 20, pp. 472–484, 2008.[13] D. Rocha, E. Goldbarg, and M. Goldbarg, “A new evolutionaryalgorithm for the biobjective minimum spanningtree problem,” in <strong>Proceedings</strong> of the ISDA 07, InternationalConference on Intelligent Systems Design and Applications,2007, pp. 735 –740.[14] M. Davis-Moradkhan, W. Browne, and P. Grindrod, “Extendingevolutionary algorithms to discover tri-criterion and nonsupportedsolutions for the minimum spanning tree problem,”in <strong>Proceedings</strong> of the 11th Annual conference on Geneticand evolutionary computation, ser. GECCO ’09, 2009,pp. 1829–1830.[15] S. Monteiro, E. Goldbarg, and M. Goldbarg, “A new transgeneticapproach for the biobjective spanning tree problem,” in2010 IEEE Congress on Evolutionary Computation (CEC),2010, pp. 1 –5.[16] K. Mehlhorn and M. Ziegelmann, “CNOP - a package forconstrained network optimization,” in Algorithm Engineeringand Experimentation, ser. Lecture Notes in ComputerScience, 2001, vol. 2153, pp. 17–31.[17] D. Blokh and G. Gutin, “An approximation algorithm forcombinatorial optimization problems with two parameters,”Australasian Journal of Combinatorics, vol. 14, pp. 157–164, 1996.[18] G. Handler and I. Zang, “A dual algorithm for the constrainedshortest path problem,” Networks, vol. 10, pp. 293–310, 1980.[19] A. Jüttner, B. Szviatovszki, I. Mécs, and Z. Rajkó, “Lagrangerelaxation based method for the QoS routing problem,” in<strong>Proceedings</strong>. IEEE INFOCOM, 2001, pp. 859–868.[20] T. Magnanti and L. Wolsey, “Optimal trees,” in NetworkMo<strong>de</strong>ls, ser. Handbooks in Operations Research and ManagementScience, Vol. 7, M. Ball, T. Magnanti, C. Monma,and G. Nemhauser, Eds. North-Holland: Elsevier SciencePublishers, 1995, pp. 503–615.[21] C. Requejo, A. Agra, A. Cerveira, and E. Santos, “Formulationsfor the weight-constrained minimum spanning treeproblem,” in <strong>Proceedings</strong> of the International Conference onNumerical Analysis and Applied Mathematics, ser. AIP Conference<strong>Proceedings</strong>, vol. 1281, 2010, pp. 2166–2169.[22] R. Ahuja, T. Magnanti, and J. Orlin, Network Flows: Theory,Algorithms and Applications. Prentice-Hall, 1993.[23] N. Shor, Minimization Methods for Non-Differentiable Functions.Springer-Verlag, 1985, english translation.[24] Y. Xiao, K. Thulasiraman, G. Xue, and A. Jüttner, “The constrainedshortest path problem: Algorithmic approaches an<strong>da</strong>n algebraic study with generalization,” AKCE InternationalJournal of Graphs and Combinatorics, no. 2, pp. 63–86,2005.[25] M. Held, P. Wolfe, and H. Crow<strong>de</strong>r, “Vali<strong>da</strong>tion of subgradientoptimization,” Mathematical Programming, vol. 6, pp.62–88, 1974.[26] D. Pisinger, “Where are the hard knapsack problems?”DIKU, University of Copenhagen, Denmark, Technical Report2003/08, 2003.ALIO-EURO <strong>2011</strong> – 41


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A Heuristic and an Exact Method for Pattern Sequencing ProblemsLuigi De Giovanni ∗ Gionata Massi † Ferdinando Pezzella † Marc E. Pfetsch ‡Giovanni Rinaldi § Paolo Ventura §∗ Dipartimento di Matematica Pura e Applicata, Università <strong>de</strong>gli Studi di Padovavia Trieste 63, 35121 Padova (Italy)luigi@math.unipd.it† Dipartimento di Ingegneria Informatica Gestionale e <strong>de</strong>ll’AutomazioneUniversità Politecnica <strong>de</strong>lle Marche – via Brecce Bianche 12, Ancona (Italy){massi,pezzella}@diiga.univpm.it‡ Institute for Mathematical Optimization, Technische Universität BraunschweigPockelsstraße 14, 38106 Braunschweig (Germany)m.pfetsch@tu-bs.<strong>de</strong>§ Istituto di Analisi <strong>de</strong>i Sistemi e Informatica - Antonio Ruberti, CNRviale Manzoni 30, 00185 Roma (Italy){rinaldi,ventura}@iasi.cnr.itABSTRACTIn many applications, a suitable permutation of patterns (electroniccircuit no<strong>de</strong>s, cutting patterns, product or<strong>de</strong>rs etc.) has to be foundin or<strong>de</strong>r to optimize over some given objective function, so givingrise to the so-called Open Stack Problems. We focus on the GateMatrix Layout Problem, where electronic circuits are obtained byconnecting gates and one seeks a gate layout permutation that minimizesconnection costs un<strong>de</strong>r restrictions on the circuit area. Inthe literature, the connection costs and the circuit area are alsoknow as Time of Open Stacks and Maximum Number of OpenStacks, respectively. We propose a genetic algorithm providingheuristic solutions, and a branch-and-cut algorithm, based on anew linear integer programming formulation and representing, atour best knowledge, the first exact approach in the literature. Thealgorithms are un<strong>de</strong>r extensive test, and preliminary results on realinstances are presented here.Keywords: Time of Open Stacks, Maximum Number of OpenStacks, Genetic Algorithms, Integer Linear Programming, Branchand-Cut1. INTRODUCTIONThe Gate Matrix Layout Problem is related to programmable logicarray folding in Very Large Scale Integration (VLSI) electroniccircuit <strong>de</strong>sign [1]. Roughly speaking, gates correspond to circuitno<strong>de</strong>s and different connections are required. Each connection involvesa subset of no<strong>de</strong>s and is called net. Figure 1(a) shows anexample where 7 gates (vertical lines) have to be connected accordingto 5 different nets, <strong>de</strong>scribed by dots of the same row: netA connects gates 1, 3 and 5, net B connects gates 1, 4, 5 and 6 etc.Wires are used to create connections, one for each net, as shownin Figure 1(b). Note that, to connect the gates of a net, it may benecessary to cross other gates not inclu<strong>de</strong>d in the net, <strong>de</strong>pendingon the gate layout sequence. Also, a single connection track can beused to place non-overlapping net wires, as shown in Figure 1(c)for nets D and E. The total wire length <strong>de</strong>termines the connectioncost, while the number of tracks <strong>de</strong>termines the total circuit area,which may be limited by <strong>de</strong>sign constraints or efficiency issues.ABCDE1 2 3 4 5 6 7ABCDE1 2 3 4 5 6 7ABCD1 2 3 4 5 6 7(a) (b) (c)Figure 1: Sample gate matrix: connection requests (a), wired nets(b) and connection tracks (c).ABCDE1 3 5 2 4 6 7ABCDE1 3 5 2 4 6 7 1 3 5 2 4 6 7AB(a) (b) (c)Figure 2: Sample gate matrix: an improved gate sequence.Both indicators give an estimate of the circuit layout efficiencyand <strong>de</strong>pend on how gates are sequenced. The gate layout of Figure1 requires 19 wire units and 4 tracks, corresponding to the maximumnumber of overlapping net wires. A better layout is shownin Figure 2, using 15 wire units and 3 tracks.We <strong>de</strong>fine the Gate Matrix Layout Problem (GMLP) as the problemof finding a gate permutation such that the connection cost isminimized and the number of required tracks is limited. The problemis NP-Hard and has several applications in different fields [2].For example, in production planning, gates correspond to articles,nets to client or<strong>de</strong>rs and wires represent the occupation of <strong>de</strong>dicatedor<strong>de</strong>r stacks (and related loading facilities) over all the or<strong>de</strong>rprocessing time, <strong>de</strong>pending on the article production sequence.The same stack can be used for non-overlapping or<strong>de</strong>rs and onewants to find a production sequence that minimizes the total stackoccupation time, un<strong>de</strong>r the restriction that the maximum numberof overlapping or<strong>de</strong>rs, that is the maximum number of simultaneouslyopen stacks during the production process, is at most theDECEALIO-EURO <strong>2011</strong> – 42


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>number of available stacks, as <strong>de</strong>termined by plant layouts. Similarly,in cutting stock environments, the items (corresponding tonets in GMLP) obtained from panels sawed according to given cuttingpatterns (corresponding to gates) are heaped on stacks aroundthe sawing machine. Stacks remain open during all the productiontime of the related item and, again, the same stack (correspondingto track) can be used for items whose production does not overlapover time. The problem is to find a cutting pattern permutationthat minimizes the total stack opening time, provi<strong>de</strong>d that the maximumnumber of simultaneously open stacks during the cuttingprocess must not exceed a given threshold, which is a parameterof the sawing center. In the literature, the total stack occupationtime and the maximum number of simultaneously open stacks areknown as Time of Open Stacks (TOS) and Maximum number ofOpen Stacks (MOS), respectively. In GMLP, the wire length correspondsto TOS, and the number of required tracks correspondsto MOS. Note that a given gate sequence may not be feasible becausethe number of required tracks (MOS) exceeds the numberof available tracks as <strong>de</strong>termined by the restrictions on the circuitarea.We can characterize an instance of GMLP by a production matrixM ∈ {0,1} m×n and a parameter λ ∈ Z + representing the numberof available tracks and, hence, an upper bound for MOS, meaningthat all the sequences having MOS greater than λ are not feasible.Rows of M are associated with nets, columns with gates,and M(i, j) = 1 if and only if net i inclu<strong>de</strong>s gate j. A solutionof GMLP consists in a sequence φ : [1,...,n] → [1,...,n], whereφ( j) indicates the layout position of gate j. Such a solution <strong>de</strong>finesa new matrix M φ obtained from M by permuting its columnsaccording to φ. From M φ we obtain a stack matrix ¯M φ by switchingto 1 any 0 of M φ between two 1s in the same row. Therefore¯M φ (i, j) = 1 if and only if, according to φ, the wire of net i inclu<strong>de</strong>sor crosses gate j. Figure 3 reports the production matrix of theM =1 2 3 4 5 6 71 0 1 0 1 0 01 0 0 1 1 1 00 1 0 1 0 0 11 1 1 0 0 0 00 0 0 1 0 1 1(a)M φ =1 3 5 2 4 6 71 1 1 0 0 0 01 1 1 1 1 1 00 0 0 1 1 1 11 1 1 1 0 0 00 0 0 0 1 1 1(b)Literature on pattern sequencing problems is rich and related todifferent application fields and solution techniques. Nevertheless,most works consi<strong>de</strong>r MOS minimization ([3, 4, 5, 6], among others),and TOS is sometimes used to heuristically drive the searchof good MOS sequences (see for example [7, 8]). Just a few workstake TOS optimization explicitly into account. Among the mostrecent ones, we cite [9], proposing a Constructive Genetic Algorithm,where GMLP is solved by integrating genetic operators, localsearch and schemata filling heuristics, and [10], where a biobjectiveapproach is consi<strong>de</strong>red for an application in the paper industry,and the set of Pareto-optimal solutions is approximated bya genetic algorithm improved by initial heuristics and local search.In this paper, we focus on GMLP, i.e. on pattern sequencing problemswhere TOS has to be minimized un<strong>de</strong>r restrictions on MOS,and we propose two algorithms: the first one, <strong>de</strong>scribed in Section2, aims at <strong>de</strong>termining both an as low as possible thresholdλ for the number of tracks (MOS), and a feasible sequence witha low connection cost (TOS); the second one starts from this sequenceand minimizes the wire length (TOS), provi<strong>de</strong>d that MOSmust not exceed λ (Section 3). The first algorithm is based on agenetic approach with a composite and dynamic <strong>de</strong>finition of thefitness function. The second algorithm exploits the flexibility ofa new integer programming formulation based on the propertiesof consecutive-ones matrices and solved by branch-and-cut. Anextensive computational campaign is in progress, and preliminaryresults on real GMLP instances are presented in Section 4.2. GENETIC ALGORITHMThe aim of the first algorithm for GMLP is twofold. First, we needto <strong>de</strong>termine an appropriate threshold λ for MOS, which may benot a priori known. For example, in production or cutting stock environments,the limitation on the number of available stacks maybe too restrictive, so that no feasible sequence exists and temporarywarehousing is necessary. We thus want to take λ as lowas possible, to limit temporary warehousing and preserve processefficiency. Second, we seek for a feasible sequence that, beyondminimizing MOS, has also a good TOS, to mimimize connectioncosts. Note that this may also speed-up the branch-and-cut algorithmfor TOS optimization, as a good initial incumbent solutionis available. We consi<strong>de</strong>r a genetic approach: genetic algorithmsiteratively evolve a population of several individuals according tothe principle of natural selection. Each individual enco<strong>de</strong>s a particularsolution and, at each generation, new individuals are obtainedby selecting parents and combining their features. In or<strong>de</strong>r to obtainbetter and better solutions, a fitness value is associated to eachindividual: the fitter the individuals, the more they are likely tobe selected as parents and to transmit their features to new generations.The Genetic Algorithm for GMLP (GAG) is sketched inFigure 4. Individuals are enco<strong>de</strong>d as columns sequences, and theFigure 3: Sample Production Matrix M (a), and Stack Matrix M φ(φ = [1,3,5,2,4,6,7]) with switched elements in italics (b).sample gate matrix of Figure 2 and the stack matrix of sequence[1,3,5,2,4,6,7]. Note that MOS and TOS for a given sequence φcan be easily obtained from ¯M φ . The length of the wire requiredby net i is the distance (in number of gates) between the first andthe last gate of i, equal to the number of 1s in the i-th row of ¯M φ ,minus 1 (the first gate must not be consi<strong>de</strong>red). Therefore, thelength of the wire for a single net is the sum of the entries of therelated row of ¯M φ minus 1 and TOS is the sum of all the entriesof ¯M φ , minus m. MOS is the maximum number of 1s appearing inany of the columns of ¯M φ . Summarizing, given a {0,1}-matrix M,GMLP is to find a column permutation having MOS not greaterthan λ and minimizing TOS.1. Determine individuals of the initial population2. Repeat (for each generation)3. Repeat (for each offspring)4. Select two parents5. Generate offspring by crossover6. Apply mutation to offspring7. Until a set of new individuals are generated8. Replace old individuals with new ones9. Refine the fittest individuals by local search10. A<strong>da</strong>pt fitness criteria11. Until termination conditions are satisfied12. Return the best individual found.Figure 4: Sketch of the Genetic Algorithm for GMLP.initial population is obtained in part heuristically, in part by randomcolumns permutations (step 1). The operator to recombineindividuals and obtain offspring for the new generation (steps 3to 7) is the Or<strong>de</strong>r Crossover, borrowed from the Traveling SalesmanProblem. After selecting two parents, two new individuals aregenerated: each individual inherits a subsequence from one parentand the remaining elements are filled-in in the relative or<strong>de</strong>r of theother parent. To avoid premature convergence, new individualsun<strong>de</strong>rgo a mutation, with a given probability: mutation exchangesALIO-EURO <strong>2011</strong> – 43


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>the position of two randomly chosen columns. The new generationis obtained by replacing with the new offspring all the individuals,but an elite set of the fittest ones and a steady set chosen at random(step 8). Before starting the next iteration, a refinement operatorexplores the 2-OPT neighborhood of most promising individualsand replaces them with local optima (step 9). GAG terminatesafter a fixed number of generations, returning the best individualfound so far.With respect to stan<strong>da</strong>rd genetic algorithms, GAG introduces somenew features, which have experimentally shown to significantlyimpact on its performance, and are mainly related to the fitnessfunction <strong>de</strong>finition and to the refinement operator. The fitnessfunction is used to gui<strong>de</strong> the selection mechanism and, accordingto the twofold aim of GAG, both MOS and TOS has to betaken into account. MOS is related to critical subsequences and isvery unlikely to change un<strong>de</strong>r small sequence perturbations. Furtherindicators are thus necessary to discriminate fittest individualsand, as discussed in [7], TOS is not enough: in fact, both MOSand TOS measure the whole sequence and may hi<strong>de</strong> good localfeatures. We thus propose two new indicators, based on relationsbetween close columns in a given sequence φ: NEW, which sumsup the 1s in one column of M φ not contained in the previous one,and IOS, the maximum increment in the number of 1s from onecolumn of ¯M φ to the following one. Summarizing, the fitness ofan individual is a weighted sum of MOS, TOS, NEW and IOS.Further, we propose to dinamically change the weights during theevolution (step 10), and three settings are used to obtain differentsearch phases: during the first generations, emphasis is on MOSoptimization, with negligible weights to TOS, NEW and IOS; thenGAG switches to a second setting, aiming at obtaining better TOS,while diversifying the population and emphasis is on TOS, NEWand IOS; finally, the search is gui<strong>de</strong>d again toward MOS optimizationand the related weight is increased, to minimize λ and fin<strong>da</strong> good feasible solution. Concerning the refinement operator, astan<strong>da</strong>rd implementation of the 2-OPT local search may be computationallyexpensive. Several speeding-up tricks has been <strong>de</strong>vised,whose <strong>de</strong>tails are beyond the scope of this short paper. Wejust mention that the refinement is applied with a low frequencyto a few individuals, and that an incremental neighbor evaluationhas been implemented, based on some invariance properties of thestack matrix (the same incremental evaluation is applied to offspringgenerated by crossover).3. EXACT BRANCH-AND-CUT PROCEDUREGiven a matrix A ∈ R m×n , the minor A IJ is the submatrix of A<strong>de</strong>fined by the or<strong>de</strong>red subsets I and J of rows and columns, respectively.Let [A] p,q be the set of all minors of A of size p × q.Given two matrices A,B ∈ R m×n in the following we will <strong>de</strong>noteby 〈A,B〉 the inner product of A and B. A {0,1}-matrix A has theconsecutive ones property for rows (or, briefly, A is C1P) if thecolumns of A can be permuted so to obtain a strict C1P matrix,that is a {0,1} matrix such that in each row the ones appear consecutively,i.e. in each row they can not appear two 1s separatedby one or more 0s. According to this <strong>de</strong>finition we can now stateour formulation for GMLP as follows: given M ∈ {0,1} m×n andλ ∈ Z + , minimize ∑ i∈{1,...,m}, j∈{1,...,n} X(i, j) withX is C1P (1)X(i, j) ≥ M(i, j), ∀i ∈ 1,...,m, ∀ j ∈ 1,...,n (2)λ ≥m∑ X(i, j), ∀ j ∈ 1,...,n (3)i=1X ∈ {0,1} m×n . (4)A feasible solution X of the previous system is then a {0,1}-matrix(constraint (4)), obtained by turning 0s of M into 1s (constraints(2)), and such that there exists a sequence φ of its columns suchthat X = ¯M φ (constraint (1)). Constraints (3) ensure that the numberof stacks contemporary open by the solution X does not exceedthe given value λ and the objective function corresponds toTOS. Still, in or<strong>de</strong>r to obtain an integer linear program, we haveto translate constraint (1) into linear inequalities. Tucker [11] gavea characterization of the C1P matrices using five special matricesTk 1,Tk 2,Tk 3,T4 ,T 5 , called Tucker minor. In particular, T 4 and T 5have fixed dimension, while Tk 1,Tk 2,and T k3 have dimension <strong>de</strong>pendingon parameter k (for example, the minor Tk1 for k = 4 isshown in Figure 5(a)). Tucker proved that a matrix A ∈ {0,1} m×n⎛⎞1 1 0 0 0 00 1 1 0 0 00 0 1 1 0 0⎜0 0 0 1 1 0⎟⎝0 0 0 0 1 1⎠1 0 0 0 0 1(a)⎛⎞1 1 0 0 0 −1−1 1 1 0 0 0−1 0 1 1 0 0⎜−1 0 0 1 1 0⎟⎝−1 0 0 0 1 1 ⎠1 −1 0 0 0 1(b)Figure 5: The Tucker minor T4 1 (a) with the corresponding coefficientsof the Oswald-Reinelt matrix F 14 (b) <strong>de</strong>fining the validinequality 〈F 14 ,X IJ 〉 ≤ 11.is C1P if and only if none of its minors is a Tucker minor. Morerecently, Oswald and Reinelt used the Tucker characterization inor<strong>de</strong>r to provi<strong>de</strong> a <strong>de</strong>scription of the C1P matrices in terms of linearinteger programming. In<strong>de</strong>ed they first <strong>de</strong>fined the {0,1,−1}matrices F 1k , F 2k , F 3 , and F 4 (see Figure 5(b) for an example) andproved the following:Theorem 1 ([12, 13]). A matrix X ∈ {0,1} m×n is C1P if and onlyif all the following OR-inequalities are satisfied:〈F 1k ,X IJ 〉 ≤ 2k + 3, ∀ X IJ ∈ [A] k+2,k+2 , ∀ k ≥ 1; (5)〈F 2k ,X IJ 〉 ≤ 2k + 3, ∀ X IJ ∈ [A] k+2,k+3 , ∀ k ≥ 1; (6)〈F 3 ,X IJ 〉 ≤ 2k + 3, ∀ X IJ ∈ [A] 4,6 ; (7)〈F 4 ,X IJ 〉 ≤ 2k + 3, ∀ X IJ ∈ [A] 4,5 ; (8)We can then use such a characterization to get a linear integer formulationof GMLP by replacing constraint (1) with the set of inequalities(5),...,(8). Observe that here, differently from the formulationproposed by Baptiste in [6], one does not need to takeexplicitly into account the or<strong>de</strong>r of the columns of X. Therefore,let X ∗ be the optimal solution of such a linear integer optimizationprogram. Then X ∗ is a C1P matrix and we can now applythe so-called PQ-tree procedure [14] that, in linear time, returns acolumns sequence φ ∗ that turns X ∗ into a strict C1P matrix.Observe here that, as it corresponds to the number of minors of theinput matrix M, the number of constraints (5) and (6) grows exponentiallywith the size of M (the number of inequalities of type (7)and (8), even if not exponential, is boun<strong>de</strong>d by a high polynomialin m and n). This implies that the proposed formulation cannot beused explicitly but its linear relaxation must be solved by a cuttingplanes procedure. Oswald and Reinelt [13] <strong>de</strong>fined a polynomialtime algorithm to exactly separate inequalities (5),. . . , (8), but herewe implemented a heuristic separation routine that is similar to theone proposed in [12]. In particular, given a fractional solution ˜X,we round its values to the corresponding closest integers so to obtainthe matrix ¯X and then, using the PQ-tree algorithm [14], wecheck if ¯X is C1P. In case ¯X is not C1P, the PQ-tree algorithm producesas output a Tucker minor of ¯X and we use the correspondingOswald and Reinelt inequality as a cutting plane. Although, becauseof the rounding procedure, the separation routine we implementedis not exact, all the integer solution that do not correspondto C1P matrices are cut off. This implies that the solution provi<strong>de</strong>dALIO-EURO <strong>2011</strong> – 44


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>by the branch-and-cut algorithm <strong>de</strong>scribed above is the optimal solutionof the GMLP instance given as input.4. COMPUTATIONAL RESULTSThe proposed approach for GMLP has been implemented in C++and run on a 2.1 GHz Intel Core2 processor. For the branch-andcutprocedure, we have used the SCIP 1.00.7 framework [15] andCplex 11.0 as linear programming solver. The algorithm is currentlyun<strong>de</strong>r extensive test: in this abstract we present preliminaryresults on a benchmark of real instances from VLSI industry proposedin [5]. Concerning GAG, we have experimentally set thenumber of generations to min{20n,500}, the number of individualsto min{10n,500} and, besi<strong>de</strong>s other parameters, the fitnessfunction weights shown in Table 1. The results are reported inUp to iteration MOS TOS NEW IOS35% 0.70 0.16 0.07 0.0750% 0.10 0.50 0.20 0.20100% 0.95 0.05 0.00 0.00Table 1: GAG fitness function weight settings.Table 2 and compare GAG with the Constructive Genetic Algorithm[9] (CGA). Instance name and size are shown in the firstcolumn. Column λ is the threshold on MOS, corresponding to theminimum MOS found by GAG. The same MOS is also found byCGA and, for all the instances, it corresponds to proven optimalor best known (instance W4) MOS. Following columns summarizethe results of 10 trials of CGA and GAG. SR λ is the successrate, that is, the percentage of trials obtaining a MOS = λ. TOS,Avg and Dev are, respectively, the best found TOS, the averageTOS and the stan<strong>da</strong>rd <strong>de</strong>viation over the feasible sequences havingMOS = λ. Note that Avg and Dev refer to the top five trials,as just this information is available from [9]. T(s) is the averagecomputational time, in seconds, over all the 10 trials. The branchand-cutprocedure has been run, with a time limit of 1 hour, withthe aim of improving over the TOS provi<strong>de</strong>d by GAG, or prove itsoptimality un<strong>de</strong>r the constraint MOS ≤ λ: the last two columnsof Table 2 report the obtained TOS (proven optima in bold) andthe time to prove optimality or to find the improved solution (initalics). First, we observe that, for two instances, CGA provi<strong>de</strong>snon-feasible TOS (in italics), as they are below the optimal solution.For all the remaining instances but one, GAG provi<strong>de</strong>s betterTOS. GAG shows also more reliable: it finds the best MOSmore frequently than CGA and it has lower average TOS (exceptW4). Running times are comparable, taking into account that CGAran on a 266 MHz processor. We remark that the TOS shown inTable 2 come from feasible sequences, that is, sequences whoseMOS does not exceed λ. In fact, minimizing TOS and MOS isnot equivalent, as shown in [2], and GAG was able to find nonfeasiblesolutions with better TOS: for example, one trial on W4obtained TOS = 1633 with MOS = 28 and one trial on v4000 obtainedTOS = 52 with MOS = 6. Concerning B&C, it proves theoptimality of four instances, and improves over the TOS provi<strong>de</strong>dby GAG in two cases (MOS is always equal to λ).5. CONCLUSIONSWe have presented a genetic approach (GAG) and a branch-andcutprocedure (B&C) for GMLP, a pattern sequencing problem<strong>de</strong>aling with TOS minimization un<strong>de</strong>r restrictions on MOS. GAGintroduces a dynamic weighted sum of TOS, MOS and other newperformance indicators as fitness function, to take into accountboth global and local features of the pattern sequences. B&C is, toour best knowledge, the first algorithm <strong>de</strong>signed to find proven optimalTOS un<strong>de</strong>r constraints on MOS: it is based on the propertiesof C1P matrices and it is flexible enough to accommo<strong>da</strong>te differentobjectives or performance constraints. Preliminary results on realinstances show that GAG normally outperforms previous literatureresults, and that, in some cases, B&C is able to prove the optimalityof the proposed GMLP solutions. Ongoing research inclu<strong>de</strong>sa better calibration of GAG parameters, extensive tests to betterassess the performance of the approach, more sophisticated fitnessfunction weights setting (cycling between settings, choosing settingsbased on landscape analysis etc.), and the improvement ofB&C efficiency on large instances.6. REFERENCES[1] R. Möhring, “Graph problems related to gate matrix layoutand PLA folding,” Computing, vol. 7, pp. 17–51, 1990.[2] A. Linhares and H. H. Yanasse, “Connections betweencutting-pattern sequencing, VLSI <strong>de</strong>sign, and flexible machines,”Computers & Operations Research, vol. 29, pp.1759–1772, 2002.[3] J. C. Becceneri, H. H. Yanasse, and N. Y. Soma, “A methodfor solving the minimization of the maximum number ofopen stacks problem within a cutting process,” Computersand Operations Reasearch, vol. 31, pp. 2315–2332, 2004.[4] G. Chu and P. J. Stuckey, “Minimizing the maximum numberof open stacks by customer search,” Lecture Notes in ComputerScience, vol. 5732, pp. 242–257, 2009.[5] Y. H. Hu and S. J. Chen, “GM_Plan: A Gate MatrixLayout Algorithm based on Artificial Intelligence PlanningTechniques,” IEEE Transactions on Computer-Ai<strong>de</strong>d Design,vol. 9, pp. 836–845, 1990.[6] B. M. Smith and I. P. Gent, Eds., <strong>Proceedings</strong> of IJCAI’05 –Constraint Mo<strong>de</strong>lling Challenge 2005, Edimburgh, Jul. 31,2005.[7] L. De Giovanni, G. Massi, and F. Pezzella, “An a<strong>da</strong>ptive geneticalgorithm for large-size open stack problems,” DMPA,Università di Padova, Tech. Rep., 2010.[8] A. C. M. d. Oliveira and L. A. N. Lorena, “Pattern SequencingProblems by Clustering Search,” Lecture Notes in ComputerScience, vol. 4140, pp. 218–227, 2006.[9] ——, “A Constructive Genetic Algorithm for Gate MatrixLayout Problems,” IEEE Transactions on Computer-Ai<strong>de</strong>dDesign of Integrated Circuits and Systems, vol. 21, no. 8, pp.969–974, 2002.[10] A. Respício and M. E. Captivo, Metaheuristics: Progress asReal Problem Solvers. Ibaraki T., Nonobe K. and YagiuraM. (Eds.), Eds. Swets & Zeitlinger, 2005, ch. Bi-objectiveSequencing of Cutting Patterns – An Application for the PaperIndustry, pp. 227–241.[11] A. Tucker, “A structure theorem for the consecutive 1’s property,”J. Combinatorial Theory Ser. B, vol. 12, pp. 153–162,1972.[12] M. Oswald and G. Reinelt, “Constructing new facets of theconsecutive ones polytope,” in Combinatorial Optimization– Eureka, You Shrink! Papers Dedicated to Jack Edmonds,5th International Workshop, Aussois, 2001, ser. LNCS,M. Jünger, G. Reinelt, and G. Rinaldi, Eds. Springer-Verlag,2003, vol. 2570, pp. 147–157.[13] ——, “Computing optimal consecutive ones matrices,” inThe Sharpest Cut, The Impact of Manfred Padberg and HisWork, ser. Optimization, M. Grötschel, Ed. MPS/SIAM,2004, pp. 173–184.ALIO-EURO <strong>2011</strong> – 45


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>CGA GAG B&CInst. (m × n) λ SR λ TOS Avg Dev T(s) SR λ TOS Avg Dev T(s) TOS T(s)Wli (11×10) 4 100% 18 18.0 0.0% 0.5 100% 24 24.0 0.0% 0.0 24 5Wsn (17×25) 8 100% 104 106.6 3.6% 1.5 100% 97 97.6 0.6% 0.3 96 48v4000(10×17) 5 100% 53 53.3 1.7% 0.5 40% 58 58.3 5.0% 0.1 56 42v4050(13×16) 5 100% 41 41.4 1.3% 0.5 100% 38 38.8 1.2% 0.1 38 23v4090(23×27) 10 90% 95 96.8 1.7% 2.0 100% 109 109.0 0.0% 0.4 – –V4470(37×47) 9 100% 246 262.4 5.6% 66.5 100% 237 242.6 1.3% 4.0 – –X0 (40×48) 11 80% 303 305.2 0.6% 75.6 100% 298 298.8 0.1% 5.6 – –W1 (18×21) 4 100% 39 39.8 4.6% 1.0 100% 39 39.8 2.8% 0.2 39 4W2 (48×33) 14 100% 235 257.2 8.5% 18.5 100% 233 233.0 0.0% 1.9 – –W3 (84×70) 18 50% 677 751.6 11.9% 306.3 100% 675 677.6 0.3% 82.2 – –W4 (202×141) 27 30% 1730 1805.0 3.3% 5224.7 70% 1701 2000.0 12.0% 94.6 – –– no optimal solution nor improvement after 1 hour computationTable 2: Results on VLSI instances.[14] K. S. Booth and G. S. Lueker, “Testing for the consecutiveones property, interval graphs, and graph planarity using pqtreealgorithms,” J. Comput. Syst. Sci., vol. 13, pp. 335–379,1976.[15] T. Achterberg, “Scip: Solving constraint integer programs,”Mathematical Programming Computation, vol. 1, no. 1, July2009.ALIO-EURO <strong>2011</strong> – 46


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>An integer programming framework for sequencing cutting patterns based oninterval graph completionIsabel Cristina Lopes ∗ † J.M. Valerio <strong>de</strong> Carvalho †∗ ESEIG, Polytechnic Institute of PortoRua D.Sancho I, 981, Vila do Con<strong>de</strong>cristinalopes@eu.ipp.pt† Department of Production and Systems, University of MinhoCampus <strong>de</strong> Gualtar, Bragavc@dps.uminho.ptABSTRACTWe <strong>de</strong>rived a framework in integer programming, based on theproperties of a linear or<strong>de</strong>ring of the vertices in interval graphs,that acts as an edge completion mo<strong>de</strong>l for obtaining interval graphs.This mo<strong>de</strong>l can be applied to problems of sequencing cutting patterns,namely the minimization of open stacks problem (MOSP).By making small modifications in the objective function and usingonly some of the inequalities, the MOSP mo<strong>de</strong>l is applied to anotherpattern sequencing problem that aims to minimize, not onlythe number of stacks, but also the or<strong>de</strong>r spread (the minimizationof the stack occupation problem), and the mo<strong>de</strong>l is tested.Keywords: Integer programming, Interval graphs, Sequencing cuttingpatterns1. INTRODUCTIONCutting stock operations require advanced planning. The classiccutting stock problem consists in <strong>de</strong>fining the cutting patterns witha cost minimization criterion that usually <strong>de</strong>pends on the waste ofthe cutting process. But even after the cutting patterns are <strong>de</strong>fined,there is more optimization that can be done in or<strong>de</strong>r to reduce thecost of the operations. The sequence in which the cutting patternswill be processed on the cutting equipment can be a relevantfactor for the efficiency of the operations, for the organization ofthe work area space, for the fulfillment of the customers’ or<strong>de</strong>rson time, or for the fastness of the <strong>de</strong>liveries to customers. Theseconcerns gave rise to several pattern sequencing problems, such asthe minimization of open stacks and the minimization of the or<strong>de</strong>rspread.In literature, pattern sequencing problems have been studied bothalone and integrated with the <strong>de</strong>termination of the cutting patterns.The most used approach is to solve the problem combining twostages, a first stage where the cutting patterns are <strong>de</strong>fined and asecond stage where the sequence of the implementation of the cuttingpatterns is <strong>de</strong>ci<strong>de</strong>d. This work is <strong>de</strong>voted to the second stage,when the cutting patterns are already <strong>de</strong>termined but the sequencein which they will be processed is still an open issue. The mainproblem addressed is the minimization of the maximum numberof open stacks, also called MOSP.This problem has been wi<strong>de</strong>ly studied in literature, but there areseveral other pattern sequencing problems, such as the minimizationof the or<strong>de</strong>r spread (MORP) and the minimization of discontinuities(MDP).The Minimization of Open Stacks Problem (MOSP) comes fromthe flat glass cutting industry, but it also has many applicationsin other cutting industries (woo<strong>de</strong>n panels, steel tubes, paper,...)as well as in other fields such as production planning, VLSI circuit<strong>de</strong>sign and in classic problems from graph theory. The MOSPproblem is based on the premise that the different items obtainedfrom cutting patterns are piled in stacks in the work area until allitems of the same size have been cut. Usually, machines processone cutting pattern at a time and the sequence in which preset cuttingpatterns are processed can affect the number of stacks thatremain around the machine.Due to space limitations and <strong>da</strong>nger of <strong>da</strong>mages on the stackeditems, it is advantageous to find a sequence for the patterns thatminimizes the number of different items that are being cut andtherefore the number of open stacks.The minimization of open stacks problem is known to have tightrelations with problems in graph theory such as treewidth, vertexseparation and the profile of a matrix. In studying these problems,we found a type of graphs called interval graphs that can play animportant role in this work.An interval graph is an undirected graph G such as its vertices canbe put into a one-to-one correspon<strong>de</strong>nce with a set of intervals Iof a linearly or<strong>de</strong>red set (like the real line) such that two verticesare connected by an edge of G if and only if their correspondingintervals have nonempty intersection. I is called an interval representationfor G. [1]These graphs can be used to <strong>de</strong>scribe a solution of the pattern sequencingproblems, by mo<strong>de</strong>ling the duration of the intervals intime in which the same piece type is being cut. Using several propertiesof this type of graphs we will see that it is possible to <strong>de</strong>rivea general framework that can be used to mo<strong>de</strong>l the minimizationof open stacks problem and to mo<strong>de</strong>l many related problems.MOSP is mo<strong>de</strong>led as an interval graph completion problem. Aninitial integer programming mo<strong>de</strong>l was <strong>de</strong>rived, using the additionof arcs to the graph and the properties of interval graphs to achievea solution, and based on the following characterization of intervalgraphs by Olariu:A graph G = (V,E) is an interval graph if and only if there existsa linear or<strong>de</strong>ring ϕ : V → {1,...,N} such that ∀i, j,k ∈ V : ϕ(i)


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>problem that aims to minimize, not only the number of stacks, butalso the or<strong>de</strong>r spread (the minimization of the stack occupationproblem) is consi<strong>de</strong>red, and the mo<strong>de</strong>l is tested.There is also another pattern sequencing problem called the Minimizationof Tool Switches (MTSP) which is addressed with thisframework, using the similarities between this problem and theMOSP, but for this problem the mo<strong>de</strong>l has a limited use.With the choice being integer programming, the formulation <strong>de</strong>velopedin this work can later be integrated in other integer programmingmo<strong>de</strong>ls for cutting stock problems, namely to create acombined mo<strong>de</strong>l of the stages one and two where the cutting stockpatterns are <strong>de</strong>fined and sequenced.2. MODELING THE MINIMIZATION OF OPEN STACKSConsi<strong>de</strong>r a cutting machine that processes just one cutting patternat a time. The items already cut that are equal are piled in stacks bythe machine. The stack of an item type remains near the machine ifthere are more items of that type to be cut in a forthcoming pattern.A stack is closed and removed from the work area only after allitems of that size have been cut, and immediately before startingto process the next cutting pattern. After a pattern is completelycut and before any stack is removed the number of open stacks iscounted. The maximum number of open stacks for that sequenceof patterns is called the MOSP number.There are often space limitations around the cutting machines,there is <strong>da</strong>nger of <strong>da</strong>mages on the stacked items, difficulty in distinguishingsimilar items, and in some cases there are handlingcosts of removing the stack temporarily to the warehouse. It is advantageousto minimize the number of open stacks, and that can bedone simply by finding an optimal sequence to process the cuttingpatterns.MOSP has been proved to be a NP-hard problem [3].As suggested in [4], an instance of the MOSP can be associatedwith a graph having a vertex for each item that is cut and an edgebetween two vertices if the corresponding items are present in thesame cutting pattern.To optimize the number of stacks, it is convenient to find the bestsequence to process the cutting patterns. Consi<strong>de</strong>ring that the patternsdo not appear explicitly in the MOSP graph constructed inthis way, how will we find that sequence for the cutting patterns?We will focus on finding a sequence to open the stacks, rather thanon sequencing the cutting patterns. That is not a problem, becauseit is possible to take a solution for the or<strong>de</strong>ring of the vertices ofthe graph and construct a sequence for the corresponding cuttingpatterns [5].Given an instance of the problem, we first build a graph G = (V,E),associating each item cut from the patterns to a vertex and creatingan arc joining vertex i and j if and only if items i and j are cutfrom the same pattern. This graph may not be an interval graphat the start, but we will add some arcs to it in such a way that itwill become one. We need this graph to become an interval graphbecause, if we associate each item to the interval of time in whichthe stack of that item is open, we can use the graph to mo<strong>de</strong>l whatintervals should occur simultaneously and what intervals shouldprece<strong>de</strong> others. According to the sequence in which the cuttingpatterns are processed, there may be more or less open stacks simultaneously.Each arc of the future interval graph means that, fora period of time, the two stacks (the respective vertices of the arc)will remain both open. The initial graph contains only the arcs thatmust be there, in any possible sequence in which the patterns canbe processed. The rest of the arcs that are ad<strong>de</strong>d later to the graphwill differ according to the sequence of the patterns. It is the choiceof these arcs that <strong>de</strong>fines which are the other simultaneously openstacks. Our mo<strong>de</strong>l for this problem consists in finding out whichedges should be ad<strong>de</strong>d to the original MOSP graph G = (V,E) inor<strong>de</strong>r to get an interval graph H = (V,E ∪ F) that minimizes themaximum number of simultaneously open stacks.2.1. The variablesWe set an or<strong>de</strong>ring for opening the stacks by assigning a numberto each item cut, with a bijective function ϕ : V → {1,...,N}. Thislinear or<strong>de</strong>ring of the vertices is set by the <strong>de</strong>cision variables x i j :{1 if ϕ(i) < ϕ( j)x i j =∀i, j ∈ V0 otherwiseNotice that x ii = 0 for any i ∈ V and also that we havex i j = 1 ⇔ x ji = 0These variables are setting an orientation into the arcs, for us tokeep track of the sequence of the items in the current instance. Ifx i j = 1 then item i starts being cut before the item j is, even thoughthe corresponding stacks may overlap or not, i.e., in spite of havingan arc between the two vertices or not.The other <strong>de</strong>cision variables that will be used are concerned to thearcs that are necessary to add to the original graph G = (V,E) toget an interval graph H = (V,E ∪ F) and, together with variablesx, <strong>de</strong>termine which intervals will overlap in the <strong>de</strong>sired intervalgraph. To <strong>de</strong>ci<strong>de</strong> which of these additional arcs are to be ad<strong>de</strong>d,we <strong>de</strong>fine a variable y i j for each arc [i j] that did not exist before inthe graph:{1 if [i j] /∈ F and ϕ(i) < ϕ( j)y i j =∀i, j ∈ V : [i j] /∈ E0 if [i j] ∈ F or ϕ(i) ≥ ϕ( j)Notice that y i j is 1 when the arc [i j] is NOT ad<strong>de</strong>d, because thevariable y i j works like an “eraser”variable. To get an intervalgraph, if we <strong>de</strong>ci<strong>de</strong>d to add to the original graph all the arcs thatwere missing, and then remove some of them - the ones that wedo not need to have an interval graph, then variable y is 1 for theseadditional arcs which are to be removed.Variables y <strong>de</strong>pend on the linear or<strong>de</strong>ring of vertices, so it followsthat there is an anti-reflexive relation:y i j = 1 ⇒ y ji = 0When y i j = 1, the arc [i j] is not nee<strong>de</strong>d in the interval graph, so,by <strong>de</strong>finition of interval graph, if there is not an arc [i j], then theintervals i and j do not intersect. Consequently, one of the intervalsshould finish before the other one starts. As i ≺ j, the interval iopens and finishes before the interval j starts. It means that thestacks for items i and j will never be open at the same time, sothey can share the same stack space.To explain the relations between the intervals horizontally, we willadd an extra set of variables z, based on the asymmetric representativesformulation for the vertex coloring problem by Campêlo etal. [6]. The value of the optimum of the MOSP is equal to the sizeof the biggest clique in the solution graph ω(H) and, because intervalgraphs are perfect graphs, it is equal to the chromatic numberof the graph χ(H), which is the number of colors nee<strong>de</strong>d to assignto the vertices of the graph such that there are no two adjacentvertices of the same color.If we assign colors to the vertices of the <strong>de</strong>sired interval graph,such that no two adjacent vertices have the same color, we cancount the maximum number of simultaneously open stacks by countingthe minimum number of different colors nee<strong>de</strong>d, because simultaneouslyopen stacks will get different colors, and stacks thatdo not overlap can have the same color.ALIO-EURO <strong>2011</strong> – 48


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>The variables that we will use are:{1 if vertex i represents vertex jz i j =0 otherwise∀i, j ∈ V : [i j] /∈ EHaving <strong>de</strong>veloped a fully functional integer programming mo<strong>de</strong>lfor the minimization of open stacks problem, we then exploresome variants of this mo<strong>de</strong>l.Note that if i ∈ V is a representative vertex then z ii = 1.We will use the variable K ∈ N to <strong>de</strong>note the maximum number ofsimultaneously open stacks.2.2. The main mo<strong>de</strong>lUsing this variables we present the following integer programmingmo<strong>de</strong>l for the MOSP:MinimizeSubject to:K0 ≤ x i j + x jk − x ik ≤ 1 ∀i, j,k = 1,...,N,i < j < k (1)y i j − x i j ≤ 0 ∀i, j = 1,...,N,i < j,[i j] /∈ E (2)y i j + x ji ≤ 1 ∀i, j = 1,...,N, j < i,[i j] /∈ E (3)y i j − x k j ≤ 0 ∀i, j,k = 1,...,N,k < j,[i j] /∈ E,[ik] ∈ E (4)y i j + x jk ≤ 1 ∀i, j,k = 1,...,N, j < k,[i j] /∈ E,[ik] ∈ E (5)0 ≤ y ik − y i j + x k j ≤ 1 ∀i, j,k = 1,...,N,k < j,[i j],[ik] /∈ E (6)0 ≤ y i j − y ik + x jk ≤ 1 ∀i, j,k = 1,...,N, j < k,[i j],[ik] /∈ E (7)j−1 NN∑ x i j + ∑ (1 − x ji ) − ∑ y i j + 1 ≤ K ∀ j = 1,...,N (8)i=1 i= j+1i=1[i j]/∈Ey i j + y ki ≤ 1 ∀i, j,k = 1,...,N with [i j],[ik] /∈ E,[ jk] ∈ E (9)y i j + y jk ≤ 1 ∀i, j,k = 1,...,N with [i j],[ jk] /∈ E,[ik] ∈ E (10)y i j + y lk ≤ 1 ∀i, j,k,l = 1,...,N with [i j],[kl] /∈ E,[ jl],[ik] ∈ E (11)y i j + y jk − y ik ≤ 1 ∀i, j,k = 1,...,N with [i j],[ jk],[ik] /∈ E (12)y ik + y ki + y jl + y l j ≤ 1y il + y li + y ik + y ki + y jl ++y l j + y jm + y m j + y mk + y km ≤ 3∀i, j,k,l = 1,...,N with i ≠ j ≠ k ≠ l,[ik],[ jl] /∈ E,[i j],[ jk],[kl],[li] ∈ E∀i, j,k,l,m = 1,...,N with i ≠ j ≠ k ≠ l ≠ m,[ik],[il],[ jl],[ jm],[km] /∈ E,[i j],[ jk],[kl],[lm],[mi] ∈ E(13)(14)N∑ z ii = K (15)i=1N N∑ ∑ z i j = N (16)i=1 j=1[i j]/∈E [i j]/∈EN∑ z i j = 1 ∀ j = 1,...,N (17)i=1[i j]/∈Ez i j ≤ y i j ∀i, j = 1,...,N with [i j] /∈ E (18)z i j + z ik − y jk − y k j ≤ 1 ∀i, j,k = 1,...,N with [i j],[ik],[ jk] /∈ E (19)z i j ≤ z ii ∀i, j = 1,...,N with [i j] /∈ E (20)z i j + z ik ≤ z ii ∀i, j,k = 1,...,N with j < k,[i j],[ik] /∈ E,[ jk] ∈ E (21)z i j + z ik + z il ≤ z ii∀i, j,k,l = 1,...,N with j < k < l,[i j],[ik],[il] /∈ E,[ jk],[kl],[l j] ∈ E(22)∀i, j,k,l,m = 1,...,N with j < k, j < l,k < m,z i j + z ik + z il + z im ≤ z ii (23)[i j],[ik],[il],[im] /∈ E,[ jk],[ jl],[ jm],[kl],[km],[lm] ∈ Ez il + z li + z ik + z ki + z jl ++z l j + z jm + z m j + z mk + z km ≤ 2∀i, j,k,l,m = 1,...,N with i ≠ j ≠ k ≠ l ≠ m,[ik],[il],[ jl],[ jm],[km] /∈ E,[i j],[ jk],[kl],[lm],[mi] ∈ E(24)x i j ∈ {0,1} ∀i, j = 1,...,N with i < j (25)y i j ∈ {0,1} ∀i, j = 1,...,N with i ≠ j,[i j] /∈ E (26)z i j ∈ {0,1} ∀i, j = 1,...,N with [i j] /∈ E (27)K ∈ N (28)3. MINIMUM INTERVAL GRAPH COMPLETIONThe main i<strong>de</strong>a behind the integer programming mo<strong>de</strong>l presented isthe completion of the MOSP graph with suitable fill edges, withthe purpose of constructing an interval graph. There are severaledge completion problems documented in literature [7]. Here weaddress the Minimum Interval Graph Completion, which searchesfor the minimum number of fill edges that should be ad<strong>de</strong>d to agraph to obtain an interval graph. With small changes in the objectivefunction and using some of the previous constraints, wecan build an integer programming mo<strong>de</strong>l for this problem in GraphTheory.We will not need the variables z i j because the number of stacksis irrelevant in the minimum interval graph completion problem.Therefore, inequalities (8), (15) to (24), (27) and (28) are dropedfor this case.The objective is simply completing the graph with the smallestnumber of edges to obtain an interval graph. The sum of all variablesy gives the number of edges that are not ad<strong>de</strong>d to the graphG when completing it to an interval graph H. By maximizing thissum, we get a minimum number of ad<strong>de</strong>d edges.More formally, the objective function for the minimum intervalgraph completion problem ismax ∑ y i j (29)[i j]/∈E4. MINIMIZING THE STACK OCCUPATIONThe mo<strong>de</strong>l we have <strong>de</strong>veloped for the minimization of open stackscan be used in another pattern sequencing problem, where the objectiveis to find an optimal sequence to process the cutting patternsin or<strong>de</strong>r to minimize the occupation of the stacks.The problem we address now is similar to minimizing the flowtime of the or<strong>de</strong>rs: besi<strong>de</strong>s having the minimum number of openstacks, we also want to minimize the sum of the time that the stacksremain open within the system.The sequence in which preset cutting patterns are processed canaffect the flow and total completion time, so it is <strong>de</strong>sirable to optimizethe occupation of the stacks to eliminate unnecessary dispersion.When consi<strong>de</strong>ring the MOSP, it is usual to find more than oneoptimal solution, in the sense that there is more than one sequenceof the cutting patterns that achieves the same maximum numberof open stacks. We may be interested in choosing between theseoptimal solutions of the MOSP according to a different criterion.A natural choice is the minimization of the or<strong>de</strong>r spread.Noticing that in most instances there are alternative optimal solutionsfor the MOSP, we tried to take the problem further and ad<strong>de</strong><strong>da</strong> second step with a new objective function: the minimization ofthe or<strong>de</strong>r spread. This pattern sequencing problem similar to theMOSP is also related with the minimum interval graph completionproblem.Our mo<strong>de</strong>l consists in finding out which arcs should be ad<strong>de</strong>d tothe original MOSP graph G = (V,E) in or<strong>de</strong>r to get an intervalgraph H = (V,E ∪ F) that minimizes the stack occupation whilekeeping the minimum number of simultaneously open stacks.The mo<strong>de</strong>l we present is divi<strong>de</strong>d in two steps. In a first step, theALIO-EURO <strong>2011</strong> – 49


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>minimum number of open stacks is <strong>de</strong>termined, and then in a secondstep, we search for a new sequence of the patterns that improvesthe total stack spread while using the optimal number ofopen stacks.In the first step the formulation is the same as before, with theobjective to minimize the maximum number of open stacks. Then,in the second step, the objective becomes the minimization of thestack spread. To minimize the average or<strong>de</strong>r spread is equivalentto minimizing the total stack spread. This is also equivalent tominimizing the number of fill-in zeros obtained in the matrix ofthe <strong>de</strong>scription of the cutting patterns after the columns have beenrearranged to match the sequence in which the patterns will beprocessed.This is done by minimizing the number of arcs that are ad<strong>de</strong>d to theMOSP graph in or<strong>de</strong>r to obtain an interval graph. As the variablesy i j are 1 when an arc is not ad<strong>de</strong>d to the graph, we can minimizethe number of ad<strong>de</strong>d arcs by maximizing the sum of the variablesy i j . Therefore the objective function in step 2 is expression (29).To guarantee that the optimal number of open stacks does not increasefrom step 1 to step 2, some of the inequalities have to bemodified accordingly. Let us <strong>de</strong>note the optimal number of openstacks found in step 1 by MOSP ∗ . For step 2, in the inequalities(8) and (15), the variable K is replaced by MOSP ∗ .5. COMPUTATIONAL RESULTSThe integer programming mo<strong>de</strong>ls were tested on the instances ofthe Constraint Mo<strong>de</strong>ling Challenge 2005, available at:http://www.cs.st-andrews.ac.uk/ ipg/challenge/instances.htmlThe instances were provi<strong>de</strong>d by the participants in the challengeand present different kinds of difficulty, such as size, sparsenessand symmetry. Computational tests were performed with ILOGOPL Development Studio 5.5 on an IntelRCore2 Duo T7200@2.00GHz 0.99GB RAM. For each instance, the best objectivevalue found by the mo<strong>de</strong>l, the best lower bound, the gap, the numberof no<strong>de</strong>s of the search tree and the runtime were recor<strong>de</strong>d.In small instances we found the optimal solution for MOSP in justa few seconds. In larger instances we found the optimal solutionin a few seconds as well, but it takes too long to prove that it isoptimal, specially in instances with many symmetries. In reallylarge instances the mo<strong>de</strong>ls could not be started because there wasnot enough memory to handle so many variables and inequalities.For the problem of minimizing the stack occupation, in the secondstep we were able to obtain the optimal solution in every instancestested. This second step allowed to reduce the or<strong>de</strong>r spread inalmost every instance, while maintaining the same optimal numberof open stacks. This reduction was very significant in many cases,<strong>de</strong>creasing around 75% of the number of ad<strong>de</strong>d edges.For the Minimum Interval Graph Completion Problem, in all ofthe instances tested, the optimal solution was reached and provedoptimal.6. ACKNOWLEDGEMENTSThis work was financially supported by the Portuguese Foun<strong>da</strong>tionfor Science and Technology (FCT) and supported by ESEIG - SuperiorSchool of Industrial Studies and Management - PolytechnicInstitute of Porto.7. REFERENCES[1] M. C. Golumbic, Algorithmic graph theory and perfectgraphs. New York: Aca<strong>de</strong>mic Press, 1980.[2] D. G. Corneil, S. Olariu, and L. Stewart, “The ultimate intervalgraph recognition algorithm? (Exten<strong>de</strong>d Abstract),” inSymposium on Discrete Algorithms, 1998, pp. 175–180.[3] A. Linhares and H. H. Yanasse, “Connections betweencutting-pattern sequencing, VLSI <strong>de</strong>sign, and flexible machines,”Computers & Operations Research, vol. 29, no. 12,pp. 1759–1772, 2002.[4] H. H. Yanasse, “Minimization of open or<strong>de</strong>rs - polynomialalgorithms for some special cases,” Pesquisa Operacional,vol. 16, no. 1, pp. 1–26, June 1996.[5] ——, “A transformation for solving a pattern sequencingproblem in the wood cut industry,” Pesquisa Operacional,vol. 17, no. 1, pp. 57–70, 1997.[6] M. Campêlo, V. A. Campos, and R. C. Corrêa, “On theasymmetric representatives formulation for the vertex coloringproblem,” Discrete Applied Mathematics, vol. 156, no. 7,pp. 1097 – 1111, 2008, GRACO 2005 - 2nd Brazilian Symposiumon Graphs, Algorithms and Combinatorics.[7] M. C. Golumbic, H. Kaplan, and R. Shamir, “On the complexityof DNA physical mapping,” Advances in Applied Mathematics,1994.ALIO-EURO <strong>2011</strong> – 50


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>OPTFRAME: A Computational Framework for Combinatorial OptimizationProblemsIgor Machado Coelho ∗ Pablo Luiz Araujo Munhoz ∗ Matheus Nohra Had<strong>da</strong>d †Vitor Nazario Coelho † Marcos <strong>de</strong> Melo Silva ∗ Marcone Jamilson Freitas Souza †Luiz Satoru Ochi ∗∗ Fluminense Fe<strong>de</strong>ral University, UFFNiteroi, RJ, Brazil{imcoelho, pmunhoz, mmsilva, satoru}@ic.uff.br† Fe<strong>de</strong>ral University of Ouro PretoOuro Preto, MG, Brazil{mathad<strong>da</strong>d, vncoelho}@gmail.com, marcone@iceb.ufop.brABSTRACTThis work presents OptFrame, a computational framework for the<strong>de</strong>velopment of efficient heuristic based algorithms. The objectiveis to provi<strong>de</strong> a simple C++ interface for common components oftrajectory and population based metaheuristics, in or<strong>de</strong>r to solvecombinatorial optimization problems. Since many methods arevery common in literature, we provi<strong>de</strong> efficient implementationsfor simple versions of these methods but the user can <strong>de</strong>velop“smarter” versions of the methods consi<strong>de</strong>ring problem-specificcharacteristics. Moreover, parallel support for both shared-memoryand distributed-memory computers is provi<strong>de</strong>d. OptFrame hasbeen successfully applied to mo<strong>de</strong>l and solve some combinatorialproblems, showing a good balance between flexibility and efficiency.Keywords: Framework, Metaheuristics, General Variable NeighborhoodSearch, TSP, Eternity II1. INTRODUCTIONIn the <strong>de</strong>velopment of optimization systems it is common to faceup with combinatorial NP-Hard problems. To produce algorithmsthat solve such problems is often a hard and long task, since thealgorithm must solve the problem with low gaps in short computationaltime. That is, the heuristic algorithm must find good solutionsat each execution. The solutions should be good enoughfor the application that uses the method and the elapsed time togenerate them must be acceptable in terms of the application. Oneway of speeding up the <strong>de</strong>velopment of such algorithms is by usingtools that provi<strong>de</strong> classic algorithms for combinatorial problems,both in practical and theoretical cases. This fact often motivatesthe use of a framework.The architecture of a framework, that typically follows the objectorientedparadigm, <strong>de</strong>fines a mo<strong>de</strong>l for co<strong>de</strong> reuse [1]. This factjustifies the <strong>de</strong>velopment of frameworks that seek to find goodsolutions for optimization problems by means of heuristics andmetaheuristics. Mainly because metaheuristics are essentially in<strong>de</strong>pen<strong>de</strong>ntof the addressed problem structure. In the context ofmetaheuristics <strong>de</strong>velopment, the <strong>de</strong>velopers that do not use anyframework or library in general expend much effort by writing andrewriting co<strong>de</strong>. Thus, the focus that should be at the problem andits efficient resolution is often directed to many programming aspects.This work presents OptFrame 1 , a white-box object oriented frameworkin C++ for the <strong>de</strong>velopment of efficient heuristic based algorithms.Our objective is to provi<strong>de</strong> a simple interface for commoncomponents of trajectory and population based metaheuristics.Since many methods are very used in literature we provi<strong>de</strong>efficient implementations for simple versions of these methods butthe user can <strong>de</strong>velop smarter versions of the methods consi<strong>de</strong>ringproblem-specific characteristics.The present work is organized as follows. Section 2 <strong>de</strong>scribessome optimization frameworks in literature. Section 3 <strong>de</strong>fines importantoptimization concepts about metaheuristics that are behindOptFrame architecture. In Section 4 we present OptFrame architecturein <strong>de</strong>tails. Section 5 conclu<strong>de</strong>s the work with some applicationsand benchmarks on the framework.2. FRAMEWORKS IN OPTIMIZATIONMany authors have already proposed frameworks for optimizationproblems, among which we cite: TabOO Buil<strong>de</strong>r [2], NP-Opt[3], HotFrame [1], EasyLocal++ [4], ParadisEO [5], iOpt [6] andjMetal [7]. Now, we present some of them in <strong>de</strong>tails.In [3] it is presented NP-Opt, a computational framework for NPclass problems. The framework proposes to minimize co<strong>de</strong> rewritingwhen the focused problem is changed. NP-Opt supports fivedistinct problems: Single Machine Scheduling, Parallel MachineScheduling, Flowshop Scheduling with job families, Grid MatrixLayout (VLSI <strong>de</strong>sign) and non-linear continuous function optimization.The built-in heuristic methods are based on Memetic andGenetic Algorithms, so as Multiple Start. The authors of NP-Optpoints to a co<strong>de</strong> reuse of 75% when <strong>de</strong>aling with a new problem.The framework is programmed in Java language.[1] present the C++ computational framework HotFrame, that sharessome similarities with OptFrame, proposed in this work. Hot-Frame, so as OptFrame, was firstly <strong>de</strong>signed for Iterated LocalSearch, Simulated Annealing and Tabu Search metaheuristics. An<strong>da</strong>lso in this sense HotFrame is very complete, since the authorsshow many implementation <strong>de</strong>tails and many variations of thesemetaheuristics. According to the authors a framework provi<strong>de</strong>sa<strong>da</strong>ptable software components, which encapsulate common domainabstractions. To <strong>de</strong>velop a framework requires solid knowledgein the consi<strong>de</strong>red domain.1 OptFrame website: http://sourceforge.net/projects/optframe/ALIO-EURO <strong>2011</strong> – 51


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[4] point that local search is a common interest theme of scientificcommunity, at the same time that there isn’t a stan<strong>da</strong>rd softwarein this sense. So, the authors propose EasyLocal++, a computationalobject-oriented framework for the <strong>de</strong>sign and analysis oflocal search algorithms. According to the authors the architectureof EasyLocal++ allows co<strong>de</strong> modularization and the combinationof basic techniques and neighborhood structures. Some successfulapplications of EasyLocal++ are showed and according to the authorsEasyLocal++ provi<strong>de</strong>s flexibility enough for the implementationof many scheduling problems.ParadisEO [5] is a white-box object-oriented framework writtenin C++ and <strong>de</strong>dicated to the reusable <strong>de</strong>sign of parallel and distributedmetaheuristics. This framework is based on a conceptualseparation of the solution methods from the problems they are inten<strong>de</strong>dto solve. According to the authors, this separation gives theusers maximum co<strong>de</strong> and <strong>de</strong>sign reuse. ParadisEO provi<strong>de</strong>s somemodules that <strong>de</strong>als with population based metaheuristics, multiobjectiveoptimization, single-solution based metaheuristics, and italso provi<strong>de</strong>s tools for the <strong>de</strong>sign of parallel and distributed metaheuristics.ParadisEO, as the OptFrame, is one of the rare frameworksthat provi<strong>de</strong> parallel and distributed mo<strong>de</strong>ls. Their implementationis portable on distributed-memory machines as well ason shared-memory multiprocessors, as it uses stan<strong>da</strong>rd librariessuch as MPI, PVM and PThreads.The Intelligent Optimization Toolkit (iOpt), proposed by [6] can beseen as an IDE for the rapid construction of combinatorial problems.The iOpt takes as input problems mo<strong>de</strong>led in one-way constraintsand uses metaheuristics to solve them. The authors showhow to mo<strong>de</strong>l the Vehicle Routing Problem with iOpt and goodresults are reported. Finally, the authors conclu<strong>de</strong> that a better un<strong>de</strong>rstandingof the problem can be achieved by a fairer comparisonbetween heuristic methods.jMetal [7] is an object-oriented Java-based framework aimed atfacilitating the <strong>de</strong>velopment of metaheuristics for solving multiobjectiveoptimization problems (MOPs). According to the authors,this framework provi<strong>de</strong>s a rich set of classes which can beused as the building blocks of multi-objective metaheuristics; thus,taking advantage of co<strong>de</strong>-reusing, the algorithms share the samebase components, such as implementations of genetic operatorsand <strong>de</strong>nsity estimators, so making the fair comparison of differentmetaheuristics for MOPs possible.In general, frameworks are based on previous experience with theimplementation of many methods for different problems. In thiswork we also review some important concepts of combinatorialproblems and metaheuristics, in or<strong>de</strong>r to propose an architecturethat is both problem and heuristic in<strong>de</strong>pen<strong>de</strong>nt. The following sectionshows the theoretical mo<strong>de</strong>ling of combinatorial problems behindOptFrame architecture.3. METAHEURISTICSWe present now some important concepts of metaheuristics andcombinatorial optimization problems.Let S be a set of discrete variables s (called solutions) and f : S → Ran objective function that associates each solution s ∈ S to a realvalue f (s). We seek any s ∗ ∈ S such that f (s ∗ ) f (s),∀s ∈ S forminimization problems, or f (s ∗ ) f (s),∀s ∈ S for maximizationproblems. The solution s ∗ is called a global optimum.A function N associates a solution s ∈ S to a set N(s) ⊆ S (calledneighborhood of s). This is also an important concept in the subjectof heuristic based algorithms. This way, a neighbor s ′ of sis such that s ′ = s ⊕ m, where m is called a move operation. Thecost of a move m is <strong>de</strong>fined as ̂f = f (s ′ ) − f (s), which means thats ′ = s⊕m =⇒ f (s ′ ) = f (s)+ ̂f . So, a local optimum (in terms of aneighborhood N) is a solution s ′ such that f (s ′ ) f (s),∀s ∈ N(s ′ )for minimization problems, or f (s ′ ) f (s),∀s ∈ N(s ′ ) for maximizationproblems.Many combinatorial optimization problems are classified as NP-Hard and it is common to use heuristic algorithms to find goodsolutions for these problems. These methods have the capabilityof finding good local optimums in short computational times.Classical local search heuristics stop on the first local optimumfound. However, metaheuristics can go beyond the local optimumand thus these methods are able to produce final solutions of betterquality.4. OPTFRAMEOptFrame is a white-box object oriented framework in C++. Inthe following sections its implementation and <strong>de</strong>sign aspects arepresented and discussed.4.1. Representation and MemoryThe OptFrame framework is mainly based on two important structures:the solution representation and the memory.The Representation is the <strong>da</strong>ta structure used to represent a validsolution for a specific problem. For example, for the TravelingSalesman Problem (TSP) [8] a user may wish to represent the solutionas an array of integers. In this case, the representation in thisheuristic approach for TSP is vector < int > (in C++ language).On the other hand, the Memory is a set of auxiliary <strong>da</strong>ta structuresnee<strong>de</strong>d for a smarter version of the method.4.2. Solution and EvaluationThere are two important container classes 2 in OptFrame: Solutionand Evaluation. Solution carries a reference to a Representation ofthe problem, while a Evaluation carries a reference to a Memorystructure. To <strong>de</strong>velop a smarter version of the method, the informationin the Memory structure along with an earlier evaluationcan be used to reevaluate a Solution in a faster way, for example.4.3. EvaluatorsThe Evaluator concept is very important in OptFrame. It encapsulatesthe function f : S → R (<strong>de</strong>fined in Section 3) as an specificcase of its function f : S → E, where E = (R,R,M). The tuple Ecan be seen as the Evaluation class <strong>de</strong>fined in Subsection 4.2.The first value of the tuple E is the objective function value itselfand the second one is an infeasibility measure value. By evaluatinga solution this way you can implement heuristic methodsthat are able to see unfeasible solutions, by giving a high penaltyvalue to the infeasibility measure value. When the infeasibilitymeasure value is zero the solution is consi<strong>de</strong>red feasible. So, theevaluation function value over a solution consists in the sum ofob jective_ f unction_value + in f easibility_measure_value.The third value M of the tuple E is called memory <strong>de</strong>fined in Subsection4.1. In this context the memory can record some steps ofthe evaluation algorithm, so they won’t be repeated in future evaluations.This way, some future computational effort can be avoi<strong>de</strong>d.2 What we name here as a container class is in some ways related to withProxy Pattern [9] since the i<strong>de</strong>a is to carry a reference to an object (representationor memory) and to <strong>de</strong>lete it when the container itself is <strong>de</strong>stroyed.But in this case a container is also used to provi<strong>de</strong> some extra operationsover the carried object like printing, reference counting and cloning.ALIO-EURO <strong>2011</strong> – 52


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>There is also a more general <strong>de</strong>finition for the evaluation methodwhere the function f is <strong>de</strong>fined by f : (S,E) → E. This way itis possible to <strong>de</strong>velop smarter versions of a Evaluator by usinginformations of a previous evaluation E.4.4. MovesA move operation <strong>de</strong>fines a neighborhood structure. In OptFramethe Move class has two most important methods: canBeApplie<strong>da</strong>nd apply.The canBeApplied method of a Move object m returns true if theapplication of m to a solution s will produce a valid solution. Otherwiseit returns false. This is method is often used before theapply method.The apply method of a Move m to a solution s transforms s into aneighbor s ′ and returns another Move m that can undo the changesma<strong>de</strong> by m. Since complete copies of solutions are expensive operationsit is possible to avoid them by <strong>de</strong>veloping efficient implementationsof the reverse Move m.4.5. Neighborhood StructuresThere are three types of neighborhood structure in OptFrame: NS,NSSeq and NSEnum.NS is the simplest <strong>de</strong>finition of a neighborhood structure. It onlyrequires the user to <strong>de</strong>fine a move(s) method, that returns a randommove operation of the neighborhood type. Although not in focusof this paper, it is possible to <strong>de</strong>fine neighborhood structures forcontinuous problems optimization using this kind of structure.NSSeq is a more elaborated version of NS. It also requires the userto <strong>de</strong>fine a getIterator(s) method, that returns an object capableof generating moves of the neighborhood structure in a sequentialway. The returned object must implement the NSIterator interface,that itself implements the Iterator Pattern [9].NSEnum is the most complete <strong>de</strong>finition of a neighborhood structurein OptFrame. It provi<strong>de</strong>s an enumerable set of move operationsfor a given combinatorial problem. Although it only requiresthe user to <strong>de</strong>fine the move(int) and size() methods, with thesemethods it is possible to <strong>de</strong>fine <strong>de</strong>fault implementations for themove(s) and getIterator(s) methods of NS and NSSeq.4.6. Heuristic based methodsHeuristic methods are mainly divi<strong>de</strong>d in two classes: trajectorybased and population based methods [10].In or<strong>de</strong>r to maximize the co<strong>de</strong> reuse and to favor testing of HybridMetaheuristics [11], all heuristic methods should be implementedusing the Heuristic class abstraction. With this abstraction we havealready been able to implement the following methods: First Improvement,Best Improvement, Hill Climbing and other classicalheuristic strategies [12]; Iterated Local Search, Simulated Annealing,Tabu Search, Variable Neighborhood Search and other basicversions of many famous trajectory based metaheuristics [13]; and,finally, the basic versions of population based metaheuristics GeneticAlgorithm and Memetic Algorithm [13].So, there are four <strong>de</strong>finitions of the method exec and the user mustimplement at least two of them. For trajectory based heuristics, theuser must implement:void exec(Solution){ ... }void exec(Solution, Evaluation){ ... }For population based heuristics:void exec(Population){ ... }void exec(Population, FitnessValues){ ... }where: Population is a list of Solutions andFitnessValues is a list of Evaluations.The first one is the simplest version of the method while the secondis a more elaborated version. But if the user wish to implementonly one of them it is possible to implement one and the other oneonly calls the first.4.7. Other structuresSome metaheuristics may require specific structures, but they canalso be <strong>de</strong>fined in specific files, e.g., Perturbation for Iterated LocalSearch; Mutation and Crossover operators for Genetic and MemeticAlgorithms.5. COMPUTATIONAL EXPERIMENTS ANDCONCLUDING REMARKSThis work presents OptFrame, a white-box object oriented frameworkin C++ for the <strong>de</strong>velopment of efficient heuristic based algorithms.Our objective is to provi<strong>de</strong> a simple interface for commoncomponents of trajectory and population based metaheuristics.OptFrame’s architecture is inten<strong>de</strong>d to minimize the differencesamong co<strong>de</strong> and theoretical concepts of combinatorial optimization.Thus, this paper <strong>de</strong>scribes a C++ mo<strong>de</strong>ling of the framework,but this mo<strong>de</strong>l can also be applied to other programminglanguages, since generic programming features are available.As a benchmark for the framework, we propose to implement aheuristic algorithm based on General Variable Neighborhood Search[14] for two different optimization problems.The first problem is the classical Traveling Salesman Problem, andthe second is the Eternity II Puzzle optimization problem (more<strong>de</strong>tails on [15]). We also want to show the flexibility of the <strong>de</strong>velopedinterface by implementing the proposed heuristic in twodifferent programming languages: C++ and Java 3 .To guarantee that the algorithms will follow the same paths (evenon different languages), we have implemented the Mersenne Twister[16] random number generator, using the same seeds for both tests.Table 1 shows the average time (in seconds) of 10 executions of theproposed algorithm. “Java GCJ” is a compiled version of the Javaco<strong>de</strong> (using the most optimized flags); “Java JRE” is an interpretedversion of the Java co<strong>de</strong>; and “C++” is a compiled version of theco<strong>de</strong> using GCC compiler (with the most optimized flags).Table 1: Computational experimentsJava GCJ (s) Java JRE (s) C++ (s)Eternity II 121.60 33.08 8.35TSP 115.52 33.45 7.32As expected, in both problems C++ got the lowest computationaltimes, while the compiled Java version got the highest times. Theinterpreted version of Java was faster than the compiled one, butslower than C++ version.This way, OptFrame showed to be a good tool for a fair comparisonbetween heuristic methods for different optimization problems an<strong>de</strong>ven with different programming languages.3 The Java version of OptFrame is called JOptFrame and it is also availableon http://sourceforge.net/projects/joptframe/ALIO-EURO <strong>2011</strong> – 53


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>OptFrame is a free software licensed un<strong>de</strong>r LGPLv3. The <strong>de</strong>velopmentversion and newer stable version of OptFrame are available athttp://sourceforge.net/projects/optframe/. Ithas been successfully applied to mo<strong>de</strong>l many realistic optimizationproblems.Users are invited to visit our homepage and collaborate with theproject. Co<strong>de</strong> reuse must be maximized, with clear abstractionsbased on optimization concepts, but always keeping in mind thatthe target user should use only simple C++ on his/her co<strong>de</strong>.6. ACKNOWLEDGMENTSThe authors are grateful to CNPq (CT-INFO and UNIVERSAL),CAPES (PROCAD and PRO-ENG), FAPERJ and FAPEMIG thatpartially fun<strong>de</strong>d this research.7. REFERENCES[1] A. Fink and S. Voß, “HotFrame: a heuristic optimizationframework,” in Optimization Software Class Libraries,S. Voß and D. L. Woodruff, Eds. Boston: Kluwer Aca<strong>de</strong>micPublishers, 2002, pp. 81–154.[2] M. Graccho and S. C. S. Porto, “TabOOBuil<strong>de</strong>r: An objectorientedframework for building tabu search applications.” in<strong>Proceedings</strong> of the Third Metaheuristics International Conference,Angra dos Reis, Rio <strong>de</strong> Janeiro, 1999, pp. 247–251.[3] A. Men<strong>de</strong>s, P. França, and P. Moscato, “NP-Opt: an optimizationframework for np problems,” in <strong>Proceedings</strong> of theIV SIMPOI/POMS 2001, Guarujá, São Paulo, 2001, pp. 11–14.[4] L. D. Gaspero and A. Schaerf, “EasyLocal++: an objectorientedframework for the flexible <strong>de</strong>sign of local-search algorithms,”Softw. Pract. Exper., vol. 8, no. 33, pp. 733–765,2003.[5] S. Cahon, N. Melab, and E.-G. Talbi, “Paradiseo: A frameworkfor the reusable <strong>de</strong>sign of parallel and distributed metaheuristics,”Journal of Heuristics, vol. 10, no. 3, pp. 357–380, 2004.[6] R. Dorne, P. Mills, and C. Voudouris, “Solving vehicle routingusing iOpt,” in <strong>Proceedings</strong> of MIC 2005 - The 6th MetaheuristicsInternational Conference, Viena, Áustria, 2005.[7] J. J. Durillo, A. J. Nebro, F. Luna, B. Dorronsoro, andE. Alba, “jMetal: A java framework for <strong>de</strong>veloping multiobjectiveoptimization metaheuristics,” Departamento <strong>de</strong>Lenguajes y Ciencias <strong>de</strong> la Computación, University ofMálaga, E.T.S.I. Informática, Campus <strong>de</strong> Teatinos, Tech.Rep. ITI-2006-10, 2006.[8] D. L. Applegate, R. E. Bixby, V. Chvatal, and W. J. Cook,The Traveling Salesman Problem: A Computational Study.United Kingdom: Princeton University Press, 2006.[9] E. Gamma, R. Helm, R. Johnson, and J. Vlissi<strong>de</strong>s, DesignPatterns: Elements of Reusable Object-Oriented Software.Addison-Wesley, 1995.[10] C. Ribeiro and M. Resen<strong>de</strong>, “Path-relinking intensificationmethods for stochastic local search algorithms,” AT&T LabsResearch, Tech. Rep. NJ 07932, 2010.[11] C. Blum and A. Roli, Hybrid Metaheuristics. Springer,2008.[12] P. Hansen and N. Mla<strong>de</strong>nović, “First vs. best improvement:an empirical study,” Discrete Appl. Math., vol. 154, no. 5, pp.802–817, 2006.[13] F. W. Glover and G. A. Kochenberger, Handbook of Metaheuristics.Springer, 2003.[14] Hansen, Mla<strong>de</strong>novic, and Perez, “Variable neighborhoodsearch: methods and applications,” 4OR: Quarterly journalof the Belgian, French and Italian operations research societies,vol. 6, pp. 319–360, 2008.[15] I. M. Coelho, B. N. Coelho, V. N. Coelho, M. N. Had<strong>da</strong>d,M. J. F. Souza, and L. S. Ochi, “A general variable neighborhoodsearch approach for the resolution of the eternity ii puzzle,”in International Conference on Metaheuristics and NatureInspired Computing, Tunisia, Djerba Island, 2010, p. 3.[16] M. Matsumoto and T. Nishimura, “Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random numbergenerator,” ACM Trans. Mo<strong>de</strong>l. Comput. Simul., vol. 8,pp. 3–30, January 1998.ALIO-EURO <strong>2011</strong> – 54


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>RAMP: An Overview of Recent Advances and ApplicationsDorabela Gamboa ∗ César Rego †∗ Escola Superior <strong>de</strong> Tecnologia e Gestão <strong>de</strong> Felgueiras, CIICESI, GECAD, Instituto Politécnico do PortoApt. 205, 4610-156 Felgueiras, Portugaldgamboa@estgf.ipp.pt† School of Business Administration, University of MississippiMS 38677, USAcrego@bus.olemiss.eduABSTRACTThe Relaxation A<strong>da</strong>ptive Memory Programming (RAMP) metaheuristicapproach has been applied to several complex combinatorialoptimization problems, exhibiting an extraordinary performanceby producing state-of-the art algorithms. We <strong>de</strong>scribe someof these applications and consi<strong>de</strong>r mo<strong>de</strong>ling techniques and implementation<strong>de</strong>tails that proved effective in enhancing RAMP algorithms.Keywords: RAMP, Scatter Search, Cross-Parametric Relaxation,A<strong>da</strong>ptive Memory, Metaheuristics1. INTRODUCTIONIn recent years, innovations in metaheuristic search methods haveexpan<strong>de</strong>d our ability to solve hard problems, and have increasedthe size of problems that can be consi<strong>de</strong>red computationally tractable.Advances have notably come from <strong>de</strong>signs of variable-<strong>de</strong>pthneighborhood constructions [1, 2] and the application of a<strong>da</strong>ptivememory search methods originated by the framework of TabuSearch [3, 4], and from recent <strong>de</strong>velopments in the area of evolutionarymethods represented by the frameworks of Genetic Algorithms[5], Evolutionary Programming [6] and Scatter Search [7].Some of the most significant advances <strong>de</strong>rive from a marriage ofthe a<strong>da</strong>ptive memory Tabu Search approaches with the evolutionarymethod of Scatter Search (SS). Scatter Search embodies manyof the principles of Tabu Search, and the union of these methods istypically implicit in SS applications.A new advance has occurred with the emergence of RelaxationA<strong>da</strong>ptive Memory Programming (RAMP), a method that integratesAMP with mathematical relaxation procedures to produce a unifiedframework for the <strong>de</strong>sign of dual and primal-dual metaheuristicsthat take full advantage of a<strong>da</strong>ptive memory programming [8].The RAMP metaheuristic has been applied to several complexcombinatorial optimization problems, exhibiting an extraordinaryperformance by producing state-of-the art algorithms. We <strong>de</strong>scribesome of these applications and consi<strong>de</strong>r mo<strong>de</strong>ling techniques andimplementation <strong>de</strong>tails that proved effective in enhancing RAMPalgorithms.RAMP) and its primal-dual extension (PD-RAMP). The RAMPmethod, at the first level, operates by combining fun<strong>da</strong>mental principlesof mathematical relaxation with those of a<strong>da</strong>ptive memoryprogramming, as expressed in tabu search. The exten<strong>de</strong>d PD-RAMP method, at the second level, integrates the RAMP approachwith other more advanced strategies. We i<strong>de</strong>ntify specific combinationsof such strategies at both levels, based on Lagrangeanand surrogate constraint relaxation on the dual si<strong>de</strong> and on scattersearch and path relinking on the primal si<strong>de</strong>, in each instancejoined with appropriate gui<strong>da</strong>nce from a<strong>da</strong>ptive memory processes.The framework invites the use of alternative procedures for both itsprimal and dual components, including other forms of relaxationsand evolutionary approaches such as genetic algorithms and otherprocedures based on metaphors of nature.The implementation mo<strong>de</strong>l of a RAMP algorithm can be seen asan incremental process, starting with one of the simplest forms ofthe method and successively applying more complex forms, adjustingthe <strong>de</strong>sign of the algorithm based on the analysis of theresults obtained in previous levels of implementation in the questfor attaining the best results possible.An instance of such an incremental approach may be illustrated bythe application of the RAMP method to the Capacitated MinimumSpanning Tree (CMST) [9]. In this application, the <strong>de</strong>velopment ofthe RAMP algorithm involved the following incremental steps: (1)the <strong>de</strong>sign of a basic surrogate constraints relaxation coupled witha projection method based on a constructive heuristic; (2) the <strong>de</strong>signof an enhanced surrogate relaxation using cutting planes; (3)the <strong>de</strong>velopment of tabu search procedure used as an improvementmethod; (4) the implementation of a subgradient-based procedureto appropriately connect primal with dual components of the algorithm;(4) the <strong>de</strong>velopment of a scatter search solution combinationmethod to create compound memory structures.Recent applications featuring the <strong>de</strong>sign and implementation ofeffective RAMP algorithms in a variety of settings ranging fromfacility location to assignment and resource allocation <strong>de</strong>monstratethe effectiveness of this approach. These problems are classicalin combinatorial optimization and arise in numerous applications.The results obtained for these recognizably difficult problemsclearly <strong>de</strong>monstrate the superiority of the RAMP methodcomparatively to the current state of the art algorithms for the solutionof these problems.2. RAMPThe Relaxation A<strong>da</strong>ptive Memory Programming framework is embodiedin two approaches, its basic form (Simple RAMP or just3. CONCLUSIONSIn spite of its freshness, the RAMP framework has already showngreat potential by obtaining excellent results with every applica-ALIO-EURO <strong>2011</strong> – 55


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>tion of the method <strong>de</strong>veloped so far. In fact, in all these applications,the method revealed impressive effectiveness, frequentlyattaining optimal solutions for the problems tested, and in manycases, where the optimal solutions are unknown, the method findssolutions with better quality than the previously best known.4. ACKNOWLEDGEMENTSThe authors would like to acknowledge FCT, FEDER, POCTI,POSI, POCI, POSC, and COMPETE for their support to R&DProjects.5. REFERENCES[1] R. K. Ahuja, O. Ergun, J. B. Orlin, and A. P. Punnen, “A surveyof very large-scale neighborhood search techniques,” DiscreteApplied Mathematics, vol. 123, pp. 75–102, 2002.[2] C. Rego and F. Glover, “Ejection chain and filter-and-fanmethods in combinatorial optimization,” Annals of OperationsResearch, vol. 175, pp. 77–105, 2010.[3] F. Glover, “Tabu search - Part I,” ORSA Journal on Computing,vol. 1, pp. 190–206, 1989.[4] ——, “Tabu search - Part II,” ORSA Journal on Computing,vol. 2, pp. 4–32, 1990.[5] C. Reeves, Mo<strong>de</strong>rn Heuristic Techniques for CombinatorialProblems. Blackwell Scientific Publishing, 1993.[6] D. B. Fogel, “Evolutionary programming: An introductionand some current directions,” Statistics and Computing, vol. 4,pp. 113–130, 1994.[7] F. Glover, Scatter Search and Path Relinking. McGraw Hill,1999, pp. 297–316.[8] C. Rego, RAMP: A New Metaheuristic Framework for CombinatorialOptimization. Kluwer Aca<strong>de</strong>mic Publishers, 2005,pp. 441–460.[9] C. Rego, F. Mathew, and F. Glover, “Ramp for the capacitatedminimum spanning tree problem,” Annals of Operations Research,vol. 181, pp. 661–681, 2010.ALIO-EURO <strong>2011</strong> – 56


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A Polyhedral Study of Mixed 0-1 SetsAgostinho Agra ∗Mahdi Doostmohammadi ∗∗ Department of Mathematics and CIDMAUniversity of Aveiro{aagra, mahdi}@ua.ptABSTRACTWe consi<strong>de</strong>r a variant of the well-known single no<strong>de</strong> fixed chargenetwork flow set with constant capacities. This set arises from therelaxation of more general mixed integer sets such as lot-sizingproblems with multiple suppliers. We provi<strong>de</strong> a complete polyhedralcharacterization of the convex hull of the given set.Keywords: Mixed Integer Set, Polyhedral Description, Valid Inequality,Convex Hull1. INTRODUCTIONWe consi<strong>de</strong>r mixed integer sets of the formX = {(w,z,y) ∈ R n + × B n × B |where N = {1,...,n}.}∑ w j ≤ Dy, (1)j∈Nw j ≤ Cz j , j ∈ N (2)These sets are much related with the well-known single no<strong>de</strong> fixedcharge network flow setW = {(w,z) ∈ R n + × B n | ∑ w j ≤ D,w j ≤ Cz j , j ∈ N}j∈Nwhile binary variables z j are associated with the arcs inci<strong>de</strong>nt tothe no<strong>de</strong> (see Figure 1), indicating whether the arc is open or not,binary variable y in associated with the no<strong>de</strong> itself. These binaryvariables allow us to mo<strong>de</strong>l cases where there are fixed costs associatedto the use of each arc and no<strong>de</strong>, respectively.w 1 ≤ Cz l♣♣❍ ✛✘❍❥✚✙w ✟ ✟✯n ≤ Cz n❄ DyFigure 1: Single no<strong>de</strong> fixed charge set.Here we investigate the polyhedral <strong>de</strong>scription of the convex hullof X, <strong>de</strong>noted by P. This study is motivated by the interest intightening more general mixed integer sets, and, in particular, thesingle-item Lot-sizing with Supplier Selection (LSS) problem. Inthe LSS problem a set of suppliers is given, and in each time periodone needs to <strong>de</strong>ci<strong>de</strong> a subset of suppliers to select and thelot-sizes. Let T be the set of production periods and N be the setof suppliers. We assume that d t > 0 is the <strong>de</strong>mand in period t ∈ T ,h t is unit holding cost, f p t and p t represent the production set-upcost and variable production cost in period t, respectively, and c jtand f s jt are variable and fixed sourcing set-up cost for supplier jin period t. D and C are production and supplying capacities. Inaddition, several types of <strong>de</strong>cision variables are <strong>de</strong>fined. We let x tbe the quantity produced in period t; s t be the stock level at the endof period t ∈ T ; w jt be the quantity sourced from supplier j ∈ Nin period t ∈ T . We <strong>de</strong>fine also the binary variables y t indicatingwhether there is a setup for production in period t or not, and z jttaking value 1 if and only if supplier j is selected in period t. TheLSS problem can be formulated as follows (see [5]):Min∑ h t s t + ∑ ∑ (p t + c jt )w jt + ∑ f p t y t + ∑ ∑ f s jt z jtt∈T t∈T j∈Nt∈T t∈T j∈Ns.t. s t−1 + x t = d t + s t : ∀t ∈ T, (3)x t ≤ Dy t : ∀t ∈ T, (4)x t = ∑ w jt : ∀t ∈ T, (5)j∈Nw jt ≤ Cz jt : ∀ j ∈ N,∀t ∈ T, (6)s 0 = s |T | = 0, (7)x t ,s t ≥ 0 : ∀t ∈ T, (8)w jt ≥ 0 : ∀ j ∈ N,∀t ∈ T, (9)y t ∈ {0,1} : ∀t ∈ T, (10)z jt ∈ {0,1} : ∀ j ∈ N,∀t ∈ T. (11)For a fixed t, set X arises from (4)-(6), (9)-(11). Valid inequalitiesfor W can be converted into valid inequalities for X.The polyhedral <strong>de</strong>scription of the convex hull of W, <strong>de</strong>noted byQ is given [4]. In [2] is studied the polyhedral characterizationa similar set where lower bounds are imposed on the flow on thearcs. Valid inequalities for SNFC sets with multiple upper andlower bounds also in [3].We study the polyhedral characterization of P. Although X is veryclose related to W, and valid inequalities for X can be easily convertedinto valid inequalities for W and vice-versa, we show that Phas, in general, many more facet-<strong>de</strong>fining inequalities than Q. Ourmain contribution is the full polyhedral <strong>de</strong>scription of P.2. POLYHEDRAL RESULTSIn this section we provi<strong>de</strong> a polyhedral characterization of P an<strong>de</strong>stablish the main differences between polyhedra P and Q. Weassume D > C > 0 and assume that C does not divi<strong>de</strong> D.We start by an intuitive result.Proposition 2.1. P and Q are full dimensional polyhedra.It is well-known, see [4], that in addition to inequalities <strong>de</strong>finingW, the following set of facet-<strong>de</strong>fining inequalities is enough to <strong>de</strong>-ALIO-EURO <strong>2011</strong> – 57


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>scribe Q.∑(w j − rz j ) ≤ D − ⌈ Dj∈SC ⌉r, S ⊆ N,|S| ≥ ⌈ D ⌉, (12)Cwhere r = D − ⌊ D C ⌋C.Polyhedral <strong>de</strong>scription of P is somewhat more complex. It is notdifficult to verify the following property relating valid inequalitiesfor the two sets.Proposition 2.2. The inequalityis valid for W, if and only ifis valid for X.∑ α j w j + ∑ β j z j ≤ αj∈N j∈N∑ α j w j + ∑ β j z j ≤ αyj∈N j∈NOne can also check that facet-<strong>de</strong>fining inequalities for Q are convertedinto facet-<strong>de</strong>fining inequalities for P. However, the conversedoes not hold in general.Next we introduce two families of valid inequalities for P.Proposition 2.3. Let D > C > 0 and assume D is not a multiple ofC. The inequalityis valid for X.w j ≤ Cy, j ∈ N, (13)Proof: Validity of (13) follows from (2) and z j ≤ 1.Proposition 2.4. Let D > C > 0 and assume D is not a multipleof C. Define r = D − ⌊ D C ⌋C. Let S 1,S 2 ⊆ N such that S 1 ∩ S 2 = /0and 0 ≤ |S 1 | < ⌈ D C ⌉, ⌈ D C ⌉ ≤ |S 1| + |S 2 | ≤ n. Then the followinginequality is valid for X.where k = ⌈ D C ⌉ − |S 1|.∑ w j + ∑ (w j − rz j ) ≤ (D − kr)y (14)j∈S 1 j∈S 2Proof: We prove the validity as follows. If y = 0, then constraint(1) implies that w j = 0,∀ j ∈ N. Since w j = 0, z j ≥ 0,∀ j ∈ N, andr > 0, so the inequality (14) is satisfied.If y = 1, then we take a k = ⌈ D C ⌉ − |S 1|. Inequality (14) can berepresented in the following way.∑ w j ≤ D + r( ∑ z j − k) (15)j∈S 1 ∪S 2 j∈S 2We consi<strong>de</strong>r the following two cases.(i) If ∑ j∈S2 z j ≥ k, then r(∑ j∈S2 z j − k) ≥ 0. So,∑ w j ≤ D ≤ D + r( ∑ z j − k)j∈S 1 ∪S 2 j∈S 2which shows that (15) is satisfied.□(ii) If ∑ j∈S2 z j = k − a with a ≥ 1, then we must prove that∑ j∈S1 ∪S 2w j ≤ D − ar. So by the assumption, <strong>de</strong>finitions ofk and r, and the fact that C > r,∑ w j = ∑ w j + ∑ w j ≤ C |S 1 | + ∑ Cz jj∈S 1 ∪S 2 j∈S 1 j∈S 2 j∈S 2= C(|S 1 | + ∑j∈S 2z j ) = C(|S 1 | + k − a)= C(⌈ D C ⌉ − a) = C(⌊ D C ⌋ + 1 − a)= C⌊ D C ⌋ −C(a − 1) ≤ C⌊ D ⌋ − r(a − 1)C= D − r − r(a − 1) = D − arTherefore (14) is valid for X.A key point not shown here is that (13) and (14) <strong>de</strong>fine facetsof P. From Proposition 2.2, valid inequalities for X are valid forW. However, consi<strong>de</strong>ring for instance (14) with S 1 ≠ /0, the correspondingvalid inequality for W∑ w j + ∑ (w j − rz j ) ≤ D − kr,j∈S 1 j∈S 2do not <strong>de</strong>fine a facet of Q since every point lying in the face <strong>de</strong>finedby the inequality must satisfy z j = 1, j ∈ S 1 .Example 2.5. Consi<strong>de</strong>r an instance with n = 4, D = 11, andC = 4. Using the software PORTA we obtain 57 facet-<strong>de</strong>fininginequalities for P and 18 facet-<strong>de</strong>fining inequalities for Q. Consi<strong>de</strong>ringthe case with y, we have the following examples of facet<strong>de</strong>fininginequalities for k = 1,2, and 3.w 1 + w 2 + w 4 − 3z 2 ≤ 8y : k = 1,w 1 + w 2 + w 3 − 3z 2 − 3z 3 ≤ 5y : k = 2,w 1 + w 2 + w 3 − 3z 1 − 3z 2 − 3z 3 ≤ 2y : k = 3.Note that for k = 3, there exist 5 facet-<strong>de</strong>fining inequalities for Pand these inequalities appear in Q as a facet-<strong>de</strong>fining inequalitiesby setting y = 1. However for k = 1 and k = 2 the correspondinginequalities for Q, obtained by setting y = 1 are not facet-<strong>de</strong>fining.Next we establish the main result.Theorem 2.6. The <strong>de</strong>fining inequalities of X with the inequalities(13) and (14) suffice to <strong>de</strong>scribe the convex hull of P.3. CONCLUSION AND FUTURE RESEARCHWe provi<strong>de</strong> a polyhedral <strong>de</strong>scription of a mixed 0-1 set which canbe regar<strong>de</strong>d as a variant of the single no<strong>de</strong> fixed charge networkflow set where setups are associated to the no<strong>de</strong> and to each arc.We consi<strong>de</strong>r the constant capacitated case. Although this set ismuch related to the well-known set W (where there is not binaryvariable associated to the no<strong>de</strong>) we have shown that many newfacets appear that can not be obtained from facets of the convexhull of W. Some results established here can easily be generalizedfor the case with different capacities on the arcs. Currentlywe are following this direction of research as well as investigatingthe new facet-<strong>de</strong>fining inequalities that might occur for theset with constant lower bounds whose polyhedral <strong>de</strong>scription wasstudied by Constantino [2] and the set with constant lower and upperbounds whose polyhedral <strong>de</strong>scription was given by Agra andConstantino[1].□ALIO-EURO <strong>2011</strong> – 58


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>4. REFERENCES[1] A. Agra and M. Constantino, "Polyhedral <strong>de</strong>scription of theinteger single no<strong>de</strong> flow set with constant bounds", MathematicalProgramming, vol. 105, no. 2-3, pp. 345-364, 2006.[2] M. Constantino, "Lower Bounds in Lot-sizing Mo<strong>de</strong>ls: aPolyhedral Study", Mathematics of Operations Research,vol. 23, no. 1, pp. 101-118, 1998.[3] M. X. Goemans, "Valid Inequalities and Separation forMixed 0-1 Constraints with Variable Upper Bounds", OperationsResearch Letters, vol. 8, pp. 315-322, 1989.[4] M. W. Padberg and T. J. Van Roy and L. A. Wolsey, "ValidLinear Inequalities for Fixed Charge Problems", OperationsResearch, vol. 22, no. 4, pp. 842-861, 1985.[5] Y. Zhao and D. Klabjan, "A Polyhedral Study of Lot-sizingwith Supplier Selection", to appear in Discrete Optimization.ALIO-EURO <strong>2011</strong> – 59


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Multi-Objective Economic Lot-Sizing Mo<strong>de</strong>lsWilco van <strong>de</strong>n Heuvel ∗ H. Edwin Romeijn † Dolores Romero Morales ‡Albert P.M. Wagelmans ∗∗ Econometric Institute, Erasmus University Rotter<strong>da</strong>mP.O. Box 1738, 3000 DR Rotter<strong>da</strong>m, The Netherlandswvan<strong>de</strong>nheuvel@ese.eur.nl,wagelmans@few.eur.nl† Department of Industrial and Operations Engineering, University of Michigan1205 Beal Avenue, Ann Arbor, Michigan 48109-2117, USAromeijn@umich.edu‡ Saïd Business School, University of OxfordPark End Street, Oxford OX1 1HP, United Kingdomdolores.romero-morales@sbs.ox.ac.ukABSTRACTNowa<strong>da</strong>ys, companies are forced to think about their environmentalimpact and their levels of pollution. In the production setting,pollution stems from the setup of the machinery, the functioningof the machinery during production as well as from holding inventory.Bearing in mind this environmental awareness, the choice ofa production plan can be mo<strong>de</strong>led as a Multi-Objective EconomicLot-Sizing problem, in which we aim at minimizing the total lotsizingcosts including production and inventory holding costs, aswell as minimizing the total production and inventory emissioncosts. Different multi-objective optimization mo<strong>de</strong>ls can be obtained<strong>de</strong>pending on time horizon in which the emissions are minimized.We can minimize the emission costs for the whole planninghorizon, yielding a bi-objective mo<strong>de</strong>l (BOLS), or we canminimize the emission costs in each period of the planning horizonyielding a truly multi-objective optimization mo<strong>de</strong>l (MOLS).In this talk, we aim at <strong>de</strong>scribing Pareto efficient solutions for both(BOLS) and (MOLS). We first show that, in general, this task isNP-complete. We then present classes of problem instances forwhich these Pareto efficient solutions can be found in polynomialtime.Keywords: Lot-sizing, Pollution, Pareto efficient solutions1. INTRODUCTIONNowa<strong>da</strong>ys, companies are forced to think about their environmentalimpact and their levels of pollution. In the production setting,pollution stems from the setup of the machinery, the functioningof the machinery during production as well as from holding inventory.Bearing in mind this environmental awareness, the choice ofa production plan can be mo<strong>de</strong>led as a Multi-Objective EconomicLot-Sizing problem. This is a generalization of the Economic Lot-Sizing Problem (ELSP) in which we aim at minimizing the totallot-sizing costs including production and inventory holding costs,as well as minimizing the total production and inventory emissioncosts.Consi<strong>de</strong>r a planning horizon of length T . For period t, let f t bethe setup lot-sizing cost, c t the unit production lot-sizing cost, h tthe unit inventory holding lot-sizing cost, and d t the <strong>de</strong>mand. Similarly,for period t, let ˆf t be the setup emission cost, ĉ t the unitproduction emission cost and ĥ t the unit inventory emission holdingcost. Let M be a constant such that M ≥ ∑ T t=1 d t.Let us consi<strong>de</strong>r the following biobjective economic lot-sizing mo<strong>de</strong>l(BOLS):( Tminimize ∑ [ f t y t + c t x t + h t I t ],t=1subject toT∑t=1)[ ˆf t y t + ĉ t x t + ĥ t I t ](BOLS)x t + I t−1 = d t + I t t = 1,...,T (1)x t ≤ My t t = 1,...,T (2)I 0 = 0 (3)y t ∈ {0,1} t = 1,...,Tx t ≥ 0 t = 1,...,TI t ≥ 0 t = 1,...,Twhere y t indicates whether a setup has been placed in period t,x t <strong>de</strong>notes the quantity produced in period t, and I t <strong>de</strong>notes theinventory level at the end of period t. In the following, we willrefer to a production period as a period in which production occurs,i.e., x t > 0. The first objective in (BOLS) mo<strong>de</strong>ls the usuallot-sizing costs, i.e., the production and inventory holding costsover the whole planning horizon. The second objective mo<strong>de</strong>lsthe total emission of pollution across the whole planning horizon.Constraints (1) mo<strong>de</strong>l the balance between production, storage and<strong>de</strong>mand in period t. Constraints (2) impose that production levelis equal to zero if no setup is placed in period t. Constraints (3)impose that the inventory level is equal to zero at the beginning ofthe planning horizon. The last three constraints <strong>de</strong>fine the range inwhich the variables are <strong>de</strong>fined.Alternatively, we can <strong>de</strong>fine a (truly) multi-objective economic lotsizingmo<strong>de</strong>l, where the emission of pollution is minimized in eachperiod of the planning horizon. The mo<strong>de</strong>l reads as follows:( Tminimize ∑ [ f t y t + c t x t + h t I t ],t=1ˆf 1 y 1 + ĉ 1 x 1 + ĥ 1 I 1 ,..., ˆf T y T + ĉ T x T + ĥ T I T)ALIO-EURO <strong>2011</strong> – 60


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>subject tox t + I t−1 = d t + I t t = 1,...,Tx t ≤ My t t = 1,...,TI 0 = 0y t ∈ {0,1} t = 1,...,Tx t ≥ 0 t = 1,...,TI t ≥ 0 t = 1,...,T.(MOLS)(P( ˆB)) and (P((ˆb t ))) reduce to an ELSP and, therefore, are polynomiallysolvable. Also, it is not difficult to see that the CapacitatedLot-Sizing problem (CLSP) is a particular case of Problem(P((ˆb t ))). Propositions 1 and 2 show that, in general, both(P( ˆB)) and (P((ˆb t ))) are N P-complete.Proposition 1. Problem (P( ˆB)) is N P-complete.Proposition 2. Problem (P((ˆb t ))) is N P-complete.When the lot-sizing cost function is concave, the classical ELSPis solvable in polynomial time in T , see [12]. More efficient algorithmsfor special cases have been <strong>de</strong>veloped in [1, 4, 11]. Inthis paper, we aim at <strong>de</strong>scribing Pareto efficient solutions for both(BOLS) and (MOLS). In Section 2, we show that, in general, thistask is NP-complete. Therefore, in Sections 3 and 4, we proposeclasses of problem instances for which this task can be performedin polynomial time. We conclu<strong>de</strong> the paper with Section 5.2. PARETO OPTIMAL SOLUTIONSWhen more than one objective function is optimized, Pareto efficientsolutions are sought. These can be found by minimizing oneof the objective functions, for instance, the lot-sizing costs, whileconstraining the rest of objectives.Given ˆB ∈ R + , the following problem <strong>de</strong>fines a Pareto efficientsolution for (BOLS):subject toTminimize ∑[ f t y t + c t x t + h t I t ]t=1x t + I t−1 = d t + I t t = 1,...,Tx t ≤ My t t = 1,...,TI 0 = 0y t ∈ {0,1} t = 1,...,Tx t ≥ 0 t = 1,...,TI t ≥ 0 t = 1,...,T(P( ˆB))T∑[ ˆf t y t + ĉ t x t + ĥ t I t ] ≤ ˆB. (4)t=1Given (ˆb t ) ∈ R T +, the following problem <strong>de</strong>fines a Pareto efficientsolution for (MOLS):Tminimize ∑[ f t y t + c t x t + h t I t ]t=1subject to (P((ˆb t )))x t + I t−1 = d t + I t t = 1,...,Tx t ≤ My t t = 1,...,TI 0 = 0y t ∈ {0,1} t = 1,...,Tx t ≥ 0 t = 1,...,TI t ≥ 0 t = 1,...,Tˆf t y t + ĉ t x t + ĥ t I t ≤ ˆb t t = 1,...,T. (5)Both mo<strong>de</strong>ls, (P( ˆB)) and (P((ˆb t ))), can be found in [2]. Wemay observe that if the emission constraints are not binding, both3. POLYNOMIALLY SOLVABLE SCENARIOS FOR(P( ˆB))In the following we discuss several scenarios for which (P( ˆB))can be solved in polynomial time.Recall that, for a given ˆB, (P( ˆB)) yields a Pareto efficient solutionof (BOLS). When possible we also discuss the running timeof a procedure that <strong>de</strong>scribes the whole efficient frontier, i.e., therunning time of solving the parametric problem (P( ˆB)), for allˆB ≥ 0.3.1. Setup emissionsIf ĥ t = 0, for all t, and ˆf t and ĉ t are stationary, then (P( ˆB)) is polynomiallysolvable. First note that ∑ T t=1 x t = ∑ T t=1 d t. Therefore, ifthe production emissions are stationary, then ∑ T t=1 ĉtx t = ĉ∑ T t=1 d t.Now (4) can be written asT∑ y t ≤ ⌊ ˜B⌋,t=1where ˜B = 1ˆf ( ˆB − ĉ∑ T t=1 d t). Thus, the problem can be written asan ELSP with a bound on the number of production periods. LetF n (t) be the optimal cost of the subproblem consisting of periods1,...,t with n production periods. Clearly, we can solve the lotsizingproblem with a bound on the number of production periodsif we have at hand the values F n (T ) for n = 1,...,T .The values F n (t) can be found by the following dynamic programmingrecursionF n (t) = mini=n,...,t {F n−1(i − 1) +C(i,t)},where C(i,t) is the total lot-sizing cost incurred for satisfying the<strong>de</strong>mand in interval [i,t] by production in period i. Note that thereare n − 1 production periods in the interval [1,i − 1] and there is1 production period in the interval [i,t]. This recursion is initializedby F 0 (0) = 0 and F 0 (t) = ∞ for t = 1,...,T . Clearly, thisDynamic Programming (DP) algorithm runs in O(T 3 ) time. Asimilar recursion can be found in [7]. In [9], it is shown that allvalues F n (t) can be found in O(T 2 ) time when the lot-sizing costsare such that there are no speculative motives to hold inventory.The same running time is shown in [3] in case of stationary setupcosts.Back to (P( ˆB)), its optimal solution value is equal tomin F n (T ),n≤⌊ ˜B⌋which can be found in O(T 3 ) time. (Savings can be achieved bynoting that the maximum number of production periods is ˜B, yieldingan algorithm that runs in O(T 2 ˜B) time.) If the lot-sizing costsare such that there are no speculative motives to hold inventory,(P( ˆB)) can be solved in O(T 2 ) time.The following proposition shows that if ˆf t are general, (P( ˆB)) isN P-complete.ALIO-EURO <strong>2011</strong> – 61


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Proposition 3. If ĉ t = ĥ t = 0, for all t, Problem (P( ˆB)) is N P-complete.For the class of problem instances in this section, the efficient frontierof (BOLS) can <strong>de</strong> <strong>de</strong>scribed in polynomial time too, since weonly need to solve (P( ˆB)) for T possible values to ˆB, namelyˆB = n ˆf , where n = 1,...,T . Thus, the efficient frontier can befound in O(T 3 ) time, while for the special cases mentioned above,it can be found in O(T 2 ) time.3.2. Production emissionsIf ˆf t = ĥ t = 0, for all t, and ĉ t are stationary, it is trivial to seethat (P( ˆB)) is polynomially solvable. This can be easily seen bynoticing that the problem is feasible ifT∑ d t ≤ ˜B.t=1where ˜B = ˆBĉ . If the problem is feasible, then constraint (4) isredun<strong>da</strong>nt and the problem reduces to an ELSP.If ĉ t are general, the complexity of problem (P( ˆB)) is unknown.In this case, constraint (4) readsT∑ ĉ t x t ≤ ˆB,t=1i.e., it imposes an upper bound on the weighted productions.For the class of problem instances in this section, the efficient frontierof (BOLS) can clearly be <strong>de</strong>scribed in polynomial time for thisscenario.3.3. Inventory emissionsSuppose that for the lot-sizing costs we have f t = f and c t = c,while for the emissions we have ˆf t = ˆf , ĉ t = ĉ and ĥ t = α h t .We will show that in this case (P( ˆB)) is solvable in polynomialtime by fixing the number of production periods. Note that such aproblem instance satisfies the Zero Inventory Or<strong>de</strong>r property, i.e.,x t I t−1 = 0 for all t, because of the non-speculative motives assumption(both in the emission and lot-sizing cost).Two observations are given before we present the procedure to findthe optimal solution. First, for a production plan with n productionperiods, constraint (4) can be written asTTα ∑ h t I t ≤ ˆB − ˆf n − ĉ ∑ d t . (6)t=1t=1Second, because both the setup and the unit production lot-sizingcosts are stationary, the objective function of (P( ˆB)) boils downtoTT T∑( f t y t + c t x t + h t I t ) = f n + c ∑ d t + ∑ h t I t .t=1t=1 t=1Thus, when the number of production periods is fixed, minimizingthe total lot-sizing costs is equivalent to minimizing the total inventorycost. Moreover, the objective function also minimizes theleft hand si<strong>de</strong> of (6).The following procedure solves the problem to optimality. Givenn = 1,...,T , solve the ELSP with n production periods, this can bedone in polynomial time, as already shown in Section 3.1. If theinventory levels of the optimal solution satisfy (6), this solution iskept. After evaluating all possible values of n, we will have at mostT solutions, from which we choose the solution having the lowestlot-sizing costs.Notice that if ĥ t are general, the complexity of problem (P( ˆB)) isunknown. In this case, constraint (4) can be rewritten asTT t∑ c t x t ≤ ˆB − ∑ ĥ t ∑ d t ,t=1t=1 τ=1where c t = ∑ T τ=t ĥt. Therefore, this reduces to a problem of theform given in Section 3.2, and thus its complexity is unknown.Again, for the class of problem instances in this section, we can<strong>de</strong>scribe the whole efficient frontier in polynomial time. Fromabove, it is clear that the only possible Pareto efficient solutionswill be the ones returned by the ELSP with n production periods,n = 1,...,T . Also, it is clear that the total inventory levels of thesesolutions will become the breakpoints of ˜B in the Pareto efficientfrontier.4. POLYNOMIALLY SOLVABLE SCENARIOS FOR(P((ˆb T )))In the following, we discuss several scenarios for which (P((ˆb t )))can be solved in polynomial time.4.1. Setup emissionsIn this section, we show that (P((ˆb t ))) is polynomially solvableif ĉ t = ĥ t = 0. In this case, constraint (5) implies y t = 0 if ˆf t > ˆb t ,and otherwise it is redun<strong>da</strong>nt. This can be easily incorporated intothe dynamic programming approach that solves the ELSP in polynomialtime, without increasing the running time, and thereforeremaining polynomial.4.2. Production emissionsIn this section, we show that (P((ˆb t ))) is polynomially solvableif ˆf t = ĥ t = 0. In this case, constraint (5) can be written as a constrainton x t . The new capacity constraints are stationary and thereforethe problem can be solved in polynomial time, [5] and [8].4.3. Inventory emissionsIn this section, we show that (P((ˆb t ))) is polynomially solvableif ˆf t = ĉ t = 0. In this case, constraint (5) can be written as a constrainton I t . This problem was shown to be polynomially solvablein [10].4.4. Setup, production and inventory emissionsIn this section, we show that (P((ˆb t ))) is polynomially solvableun<strong>de</strong>r the following assumptions. With respect to the lot-sizingcosts, we assume that the setup costs are non-increasing and thereare no speculative motives to hold inventory. With respect to theemissions, we assume that all parameters are stationary.Definition 4. We will say that period t is a tight period ifˆf t y t + ĉ t x t + ĥ t I t = ˆb t .As usual in the literature, we will refer to a regeneration period asa period in which the inventory level at the end of the period isequal to zero, i.e., I t = 0. We will refer to a subplan as the subproblem<strong>de</strong>fined by two consecutive regeneration points. Withoutloss of optimality, we can assume that the inventory levels withina subplan must all be positive. We will <strong>de</strong>compose the problemALIO-EURO <strong>2011</strong> – 62


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>into subplans using the regeneration periods, and <strong>de</strong>fine a straightforwardDynamic Programming algorithm to solve (P((ˆb t ))). Inor<strong>de</strong>r to show that the problem is polynomially solvable we needto show that the costs of a subplan can be calculated in polynomialtime. Let us therefore focus on a given subplan, and its optimalcosts.Proposition 5. There is at most one non-tight production periodin a subplan.Proposition 6. Without loss of optimality, the only possible nontightproduction period in a subplan is the first period.Proposition 7. There exists an optimal solution satisfying I t−1 0,• Ī t−1<strong>de</strong>f= I t − ¯x t + d t > 0.Then period t is a tight production period in the subplan with productionquantity ¯x t .We can now use Proposition 8 to construct an optimal solution toany non-<strong>de</strong>generate subplan [u,v] (i.e., it does not <strong>de</strong>compose intomultiple subplans) in a backward way. Assume that we arrive insome period t > u, that I t is known (note that I v = 0 in the initializationof the procedure) and we want to <strong>de</strong>termine x t and I t−1 . Weconsi<strong>de</strong>r the following cases:• ¯x t ≤ 0: The subplan is infeasible, since constraint (5) is violatedfor period t or some period before t. Note that ¯x t isthe maximum production quantity in period t without violatingthe emission constraint. It follows from the proof ofProposition 8 that any feasible production quantity in periods (s < t) is at most equal to ¯x t . In other words, anyperiod with a positive production amount before period twill violate the emission constraint.• ¯x t > 0 and Ī t−1 ≤ 0: In this case period t cannot be a tightproduction period, since production would be too much.Therefore, we set x t = 0 and I t−1 = I t + d t . Note that thesubplan would be <strong>de</strong>generate in case Ī t−1 = 0.• ¯x t > 0 and Ī t−1 > 0: By Proposition 8 period t is tight.Hence, we set x t = ¯x t and I t−1 = Ī t−1 .This procedure is applied until we arrive at period u. If 0 < d u +I u+1 ≤ ¯x u , then subplan [u,v] is feasible and non-<strong>de</strong>generate witha production quantity equal to x u = d u + I u+1 .For given periods u and v, the cost of subplan [u,v] can be <strong>de</strong>terminedin linear time. Hence, a straightforward implementationwould lead to an O(T 3 ) time algorithm. However, note thatwhen <strong>de</strong>termining subplan [1,v], we also find subplans [u,v] foru = 1,...,v. This means that all subplans can be found in O(T 2 )time, and so the optimal solution to the problem.5. CONCLUSIONSIn this paper, we have studied lot-sizing mo<strong>de</strong>ls incorporating pollutionemissions, and mo<strong>de</strong>led them as multi-objective problems.We have shown that finding Pareto efficient solutions to this problemsis, in general, an NP-complete task. We have presented classesof problem instances for which these Pareto efficient solutions canbe found in polynomial time.6. REFERENCES[1] A. Aggarwal and J.K. Park. Improved algorithms for economiclot-size problems. Operations Research, 41(3):549–571, 1993.[2] S. Benjaafar, Y. Li, and M. Daskin. Carbon footprint and themanagement of supply chains: Insights from simple mo<strong>de</strong>ls.Research report, 2010.[3] A. Fe<strong>de</strong>rgruen and J. Meissner. Competition un<strong>de</strong>r timevarying<strong>de</strong>mands and dynamic lot sizing costs. Naval ResearchLogistics, 56(1):57–73, 2009.[4] A. Fe<strong>de</strong>rgruen and M. Tzur. A simple forward algorithm tosolve general dynamic lot sizing mo<strong>de</strong>ls with n periods inO(nlogn) or O(n). Management Science, 37:909–925, 1991.[5] M. Florian and M. Klein. Deterministic production planningwith concave costs and capacity constraints. Management Science,18:12–20, 1971.[6] M.R. Garey and D.S. Johnson. Computers and intractability:a gui<strong>de</strong> to the theory of NP-completeness. W.H. Freeman andcompany, New York, 1979.[7] S.M. Gilbert. Coordination of pricing and multi-period productionfor constant priced goods. European Journal of OperationalResearch, 114(2):330–337, 1999.[8] C.P.M. van Hoesel and A.P.M. Wagelmans. An O(T 3 ) algorithmfor the economic lot-sizing problem with constant capacities.Management Science, 42(1):142–150, 1996.[9] C.P.M. van Hoesel and A.P.M. Wagelmans. Parametric analysisof setup cost in the economic lot-sizing mo<strong>de</strong>l withoutspeculative motives. International Journal of Production Economics,66:13–22, 2000.[10] S.F. Love. Boun<strong>de</strong>d production and inventory mo<strong>de</strong>ls withpiecewise concave costs. Management Science, 20(3):313–318, 1973.[11] A. Wagelmans, S. van Hoesel, and A. Kolen. Economic lotsizing: An O(nlogn) algorithm that runs in linear time in theWagner-Whitin case. Operations Research, 40(1):S145–S156,1992.[12] H.M. Wagner. A postscript to dynamic problems of the theoryof the firm. Naval Research Logistics Quarterly, 7:7–12,1960.ALIO-EURO <strong>2011</strong> – 63


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>An Optimization Mo<strong>de</strong>l for the Traveling Salesman Problem withThree-dimensional Loading ConstraintsLeonardo Junqueira ∗ José Fernando Oliveira † Maria Antónia Carravilla †Reinaldo Morabito ∗∗ Departamento <strong>de</strong> <strong>Engenharia</strong> <strong>de</strong> Produção, Universi<strong>da</strong><strong>de</strong> Fe<strong>de</strong>ral <strong>de</strong> São CarlosRodovia Washington Luís, km 235 - SP-310, 13565-905, São Carlos - São Paulo - Brasilleo_junqueira@yahoo.com, morabito@ufscar.br† <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> <strong>Engenharia</strong>, Universi<strong>da</strong><strong>de</strong> do PortoRua Dr. Roberto Frias s/n, 4200-465, Porto, Portugal{jfo, mac}@fe.up.ptABSTRACTIn this paper, we present a mixed integer linear programming mo<strong>de</strong>lfor the traveling salesman problem that consi<strong>de</strong>rs three-dimensionalloading constraints. Computational tests with the proposed mo<strong>de</strong>lwere performed with randomly generated instances using an optimizationsolver embed<strong>de</strong>d into a mo<strong>de</strong>ling language. The resultsvali<strong>da</strong>te the mo<strong>de</strong>l and show that it is able to handle only problemsof a mo<strong>de</strong>rate size. However, the mo<strong>de</strong>l can be useful tomotivate future research to solve larger problems, especially whenthis problem appears as a sub-problem of another problem, as wellas mo<strong>de</strong>ling the more general vehicle routing problem with threedimensionalloading constraints.Keywords: Traveling salesman problem, Three-dimensional loading,Combinatorial optimization, Mathematical mo<strong>de</strong>ling1. PROBLEM DESCRIPTIONThe vehicle routing literature has been recently merged with thecontainer loading literature to treat cases where the goods requiredby the customers are wrapped up in discrete items, such as boxes.This effort arises from the attempt to avoid expressing the <strong>de</strong>mandsof the customers simply as their weights or volumes. In otherwords, if the <strong>de</strong>mand constraints are seen in one-dimensional pointof view, it is assumed that each <strong>de</strong>mand fills one certain section ofthe vehicle or that the cargo shapes up smoothly according to thevehicle shape. However, when <strong>de</strong>aling with rigid discrete items,their geometry may lead to losses of space or even to infeasiblesolutions if the vehicle has not enough capacity. If other practicalconstraints are also consi<strong>de</strong>red ([1]), the coupling of the routingand loading structures becomes even more complex. Constraintssuch as vertical and horizontal stability of the cargo, load bearingstrength and fragility of the cargo, grouping or separation ofitems insi<strong>de</strong> a container, multi-drop situations, complete shipmentof certain item groups, container weight limit, weight distributionwithin a container, among others, are common in the containerloading literature and can also be embed<strong>de</strong>d into vehicle routingproblems.One of these combined problems, the 3L-CVRP (e.g., [2], [3],[4]), consi<strong>de</strong>rs a fleet of i<strong>de</strong>ntical vehicles that must run minimumcost routes to <strong>de</strong>liver boxes to a set of customers, <strong>de</strong>parting fromand returning to a <strong>de</strong>pot. Besi<strong>de</strong>s the non-overlap of the threedimensionalboxes, the constraints that have been usually consi<strong>de</strong>re<strong>da</strong>re the vertical stability of the cargo, the load bearing strengthof the boxes and the multi-dropping of the boxes. The 2L-CVRP(e.g., [5], [6], [7]) is a particular case of the above problem wherethe boxes are too heavy for being stacked and only the floor of thevehicle is consi<strong>de</strong>red for the boxes’ placement. The approachesused to solve these problems have been mainly heuristic.In this paper, we address another variant of these combined problems,named the 3L-TSP. In this problem, a set of customers makerequests of goods, that are packed into boxes, and the objectiveis to find a minimum cost <strong>de</strong>livery route for a single vehicle that,<strong>de</strong>parting from a <strong>de</strong>pot, visits all customers only once and returnsto the <strong>de</strong>pot, while consi<strong>de</strong>ring some three-dimensional loadingconstraints. Apart the constraints that ensure that the boxes donot overlap each other, the vertical stability of the cargo, the loadbearing strength of the boxes (including fragility), and the multidroppingof the boxes are also taken into account. It is assumedthat the boxes and the vehicle are of rectangular shape, and that thecargo completely fits insi<strong>de</strong> the vehicle. We present a mixed integerlinear programming mo<strong>de</strong>l for the problem, aiming to show theimpact of the loading constraints. We are not aware of other papersthat have presented mathematical formulations for the 3L-TSP andwhich explicitly <strong>de</strong>al with such constraints.2. THREE-DIMENSIONAL LOADING CONSTRAINTSIn a recent study, [8] and [9] presented mathematical formulationsfor the container loading problem with cargo stability, load bearingstrength and multi-drop constraints. Cargo stability refers tothe support of the bottom faces of boxes, in the case of verticalstability (i.e., the boxes must have their bottom faces supportedby other box top faces or the container floor), and the support ofthe lateral faces of boxes, in the case of horizontal stability. Loadbearing strength refers to the maximum number of boxes that canbe stacked one above each other, or more generally, to the maximumpressure that can be applied over the top face of a box, soas to avoid <strong>da</strong>maging the box. We note that fragility is a particularcase of load bearing where boxes cannot be placed above afragile box, since its top face does not bear any kind of pressure.Multi-drop constraints refer to cases where boxes that are <strong>de</strong>liveredto the same customer (<strong>de</strong>stination) must be placed close toeach other in the vehicle, and the loading pattern must take intoaccount the <strong>de</strong>livery route of the vehicle and the sequence in whichthe boxes are unloa<strong>de</strong>d. The practical importance of incorporatingthese constraints to the problem is to avoid loading patterns whereboxes are “floating in mid-air” insi<strong>de</strong> the vehicle, where productsare <strong>da</strong>maged due to <strong>de</strong>formation of the boxes that contain them, orwhere an unnecessary additional handling is incurred when eachdrop-off point of the route is reached. In the present study, weALIO-EURO <strong>2011</strong> – 64


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>have exten<strong>de</strong>d these i<strong>de</strong>as in the context of the traveling salesmanproblem.3. CONCLUSIONSComputational tests with the proposed mo<strong>de</strong>l were performed withrandomly generated instances using an optimization solver embed<strong>de</strong>dinto a mo<strong>de</strong>ling language. The results vali<strong>da</strong>te the mo<strong>de</strong>l andshow that it is able to handle only problems of a mo<strong>de</strong>rate size.However, the mo<strong>de</strong>l can be useful to motivate future research tosolve larger problems, especially when the 3L-TSP appears as asub-problem of another problem, as well as mo<strong>de</strong>ling the moregeneral vehicle routing problem with three-dimensional loadingconstraints.4. ACKNOWLEDGEMENTSThis research was partially supported by FAPESP (Grant 09/07423-9) and CAPES (Grant BEX 3187/10-1).5. REFERENCES[1] E. E. Bischoff and M. S. W. Ratcliff, “Issues in the <strong>de</strong>velopmentof approaches to container loading,” Omega, vol. 23,no. 4, pp. 377–390, 1995.[2] M. Gendreau, M. Iori, G. Laporte, and S. Martello, “A tabusearch algorithm for a routing and container loading problem,”Transportation Science, vol. 40, no. 3, pp. 342–350, 2006.[3] C. D. Tarantilis, E. E. Zachariadis, and C. T. Kiranoudis, “Ahybrid metaheuristic algorithm for the integrated vehicle routingand three-dimensional container-loading problem,” IEEETransactions on Intelligent Transportation Systems, vol. 10,no. 2, pp. 255–271, 2009.[4] G. Fuellerer, K. F. Doerner, H. F. Hartl, and M. Iori, “Metaheuristicsfor vehicle routing problems with three-dimensionalloading constraints,” European Journal of Operational Research,vol. 201, no. 3, pp. 751–759, 2010.[5] M. Gendreau, M. Iori, G. Laporte, and S. Martello, “A tabusearch heuristic for the vehicle routing problem with twodimensionalloading constraints,” Networks, vol. 51, no. 1, pp.4–18, 2008.[6] G. Fuellerer, K. F. Doerner, R. F. Hartl, and M. Iori, “Antcolony optimization for the two-dimensional loading vehiclerouting problem,” Computers & Operations Research, vol. 36,no. 3, pp. 655–673, 2009.[7] E. E. Zachariadis, C. D. Tarantilis, and C. T. Kiranoudis, “Agui<strong>de</strong>d tabu search for the vehicle routing problem with twodimensionalloading constraints,” European Journal of OperationalResearch, vol. 195, no. 3, pp. 729–743, 2009.[8] L. Junqueira, R. Morabito, and D. S. Yamashita, “Threedimensionalcontainer loading mo<strong>de</strong>ls with cargo stability andload bearing constraints,” (to appear in Computers & OperationsResearch, doi:10.1016/j.cor.2010.07.017).[9] ——, “Mip-based approaches for the container loading problemwith multi-drop constraints,” (submitted to Annals of OperationsResearch).ALIO-EURO <strong>2011</strong> – 65


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Rect–TOPOS: A constructive heuristic for the rectilinear packing areaminimization problemMarisa Oliveira ∗ Eduar<strong>da</strong> Pinto Ferreira ∗ † A. Miguel Gomes ‡∗ ISEP – Instituto Superior <strong>de</strong> <strong>Engenharia</strong> do PortoDr. António Bernardino <strong>de</strong> Almei<strong>da</strong>, 431 4200-072 Porto Portugal{mjo, epf}@isep.ipp.pt† GECAD – Knowledge Engineering and Decision Support Research CenterDr. António Bernardino <strong>de</strong> Almei<strong>da</strong>, 431 4200-072 Porto Portugal‡ INESC Porto, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> <strong>Engenharia</strong>, Universi<strong>da</strong><strong>de</strong> do PortoRua Dr. Roberto Frias, s/n 4200-465 Porto Portugalagomes@fe.up.ptABSTRACTIn this paper we propose a constructive heuristic, Rect–TOPOS,to solve the problem of minimizing the enclosing rectangular areathat contains, without overlapping, a set of rectilinear pieces (e.g.,L and T shaped pieces). This is a NP-hard combinatorial optimizationproblem, which belongs to the class of cutting and packingproblems. To evaluate the Rect–TOPOS heuristic computationaltests were performed to vali<strong>da</strong>te it for the presented problem.In these tests, instances with different characteristics wereused, namely the total number of pieces, and the shaped diversityof the pieces. The results show that this is a heuristic that canquickly and easily to <strong>de</strong>al with all the rectilinear shaped pieces.Keywords: Combinatorial optimization, Cutting and packing, Constructiveheuristic, Area minimization1. INTRODUCTIONIn the rectilinear packing area minimization problem (RPAMP)one wishes to pack a set of rectilinear shaped pieces (pieces with90 or 270 interior angles) while minimizing the area of the enclosingrectangle without overlapped pieces (Figure 1). This problemarises in many industrial applications such as VLSI <strong>de</strong>sign, facilitylayout problems, newspaper layout, etc. It is NP-hard combinatorialoptimization problem [1] and belongs to the class of cuttingand packing problems (C&P), which are combinatorial problemswith a strong geometric component. Approaches to solve C&Pproblems can be based on any of the usual techniques available forsolving general combinatorial optimization problems like: mixedinteger programming, heuristics, metaheuristics, etc. Given thecombinatorial nature of these problems, the exact techniques arenot able to <strong>de</strong>al effectively with instances of large dimension andit becomes necessaryTo solve the RPAMP we propose a variant of the constructiveheuristic TOPOS. The main differences between the proposed variant,Rect–TOPOS, and TOPOS come from the shapes of the pieces,rectilinear shapes instead of irregular shapes, and the objectivefunction, area minimization instead of layout length minimization.Additionally, the criteria used to select the next piece to place, itsorientation and the best placement point nee<strong>de</strong>d to be a<strong>da</strong>pted.This paper is structured as follows: section 2 presents a <strong>de</strong>tailed<strong>de</strong>scription of the RPAMP; in section 3, the constructive heuristicproposed, Rect- TOPOS, is presented; in section 4, computationalFigure 1: Rectilinear Packing Area Minimization Problem.results are shown and, finally, in Section 5 some concluding remarksare presented.2. RECTILINEAR PACKING AREA MINIMIZATIONPROBLEMThe objective of the RPAMP is to pack, without overlapping, a setof given rectilinear shaped pieces while minimizing the area of theenclosing rectangle. The dimensions of the pieces are fixed andthey must be placed orthogonally (i.e., with si<strong>de</strong>s parallel to thehorizontal and vertical axes), though a 90 0 , 180 0 or 270 0 rotationof the pieces are allowed. This is a two-dimensional problem and,according to the typology of C&P problems proposed in [2], isclassified as an open dimension problem (ODP) since the dimensionsof the enclosing rectangle are unknown.The RPAMP arises in many real word applications such as theplacement of modules in Very Large Scale Integration (VLSI) circuits,in the <strong>de</strong>signing of facility, newspaper layouts, etc. For example,in VLSI circuits rectilinear shaped pieces appeared to facilitatethe usage of circuit area and improve the connectivity betweenthe pieces, increasing the circuit performance. Early works that appearedin the literature to solve area minimization problems only<strong>de</strong>alt with rectangles and the main concern was to find efficient<strong>da</strong>ta structures to represent layouts. These representations enco<strong>de</strong>solutions as sequences, graphs or trees. Over time, new representationsappeared, justified by improvements in the efficiencyof solution evaluation, the type of encoding schemes, the amountALIO-EURO <strong>2011</strong> – 66


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>of redun<strong>da</strong>ncy that exists in the encoding and the total number ofconfigurations. An early work by Wong et al. [3] proposed analgorithm for slicing layouts 1 using a tree structure. One importantbreakthrough is the introduction of the Sequence Pair (SP), byMurata et al. [1], for representing non-slicing layouts. This representationis based on a pair of sequences that specifies the relativepositions of the rectangles. Many other representations haveemerged after the sequence pair. The existing representations forthe rectangle packing have been a<strong>da</strong>pted to enable its applicabilityto problems with rectilinear shaped pieces.Unlike what happens in most approaches in the literature to theRPAMP the proposed approach does not <strong>de</strong>al with representationsof the layout but it works directly on the layout. The next sectionprovi<strong>de</strong>s a <strong>de</strong>scription of the proposed heuristic to solve thisproblem.3. RECT–TOPOSTo solve the RPAMP we propose a variant of the TOPOS algorithm[4] which was originally <strong>de</strong>veloped to solve problems with irregularshapes 2 . The main i<strong>de</strong>a behind it is successively adding a newpiece to a partial solution. In the TOPOS algorithm two differentlevels are used: a first one to choose the best placement point foreach piece to place in each admissible orientation (nesting strategies)and a second one to choose, from all the possible candi<strong>da</strong>tesfrom the previous level, the best overall placement (layout evaluation).Three nesting strategies which aim to evaluate the best fitof two irregular shapes (partial solution and the piece chosen) withfixed orientations have been used: minimizing the area of the enclosureof the two pieces, minimizing the length of the enclosure ofthe two pieces and maximizing the overlap between the rectangularenclosures of the two pieces. To evaluate and compare differentlayouts three different criteria have been used: the difference betweenthe area of the rectangular enclosure of the partial solutionand the area of all pieces already placed (waste), the overlap betweenthe rectangular enclosure of the piece un<strong>de</strong>r evaluation andthe rectangular enclosure of each piece already placed and, finally,the eucli<strong>de</strong>an distance between the centre of the rectangular enclosureof the piece un<strong>de</strong>r evaluation and the centre of the rectangularenclosure of the partial solution.The overall objective is to minimize the layout length since in theseproblems the layout width is fixed.In our variant, Rect–TOPOS, we follow the same general i<strong>de</strong>a,successively adding a new piece to a partial solution while minimizingthe enclosing rectangular area. We choose to use a singlelevel to select the next piece to place, its orientation and the bestplacement point simultaneously. The existence of a single levelallows choosing the best piece to place between all possibilitiesunlike what happens when there are two levels, in which there isan initial selection of the placement point for each piece to place.We used the waste and distance evaluation criteria, taken directlyfrom the criteria used in the second level of the TOPOS, and introduce<strong>da</strong> new criterion, the perimeter minimization. This newcriterion tries to minimize the perimeter between the piece un<strong>de</strong>revaluation and the current partial solution.The third criterion used in TOPOS, overlap maximization, was removedsince it is not appropriate for situations where there are alarge number of rectangles to place. In these situations, the enclosingrectangle of a rectangle is the rectangle itself, it makes nosense trying to maximize the overlap of two rectangles becausepieces are not allowed to overlap.1 A layout is said to be slicing if it can be obtained by successive horizontaland vertical cuts, from one si<strong>de</strong> to another, which divi<strong>de</strong> it into tworectangles.2 An irregular shape is a polygon with arbitrary angles.Figure 2: Construction of L and T-shaped pieces from rectangles.As in TOPOS, the iterative process needs to have an initial nonemptypartial solution, so we used another criteria to select thefirst piece of the partial solution. For this selection we chose to use3 criteria that favor the selection of the larger pieces: piece withlarger area; piece with larger perimeter or piece with larger width.4. COMPUTATIONAL RESULTSThis section presents the computational results with the heuristicRect–TOPOS. The tests were performed on a Linux workstationequipped with a Intel XEON Dual Core 5160, 3GHz. Althoughthe workstation has two CPUs, only one thread was usedin the tests. The test instances used have different characteristics,particularly in the total number of pieces, number of pieces withdifferent shapes (number of types of pieces) and in the shape ofthe pieces (rectangular and other shapes with rectangular components).To evaluate the heuristic Rect–TOPOS we used the followingfour sets of instances:• instances of the reference set MCNC (http://vlsicad.eecs.umich.edu/BK/MCNCbench/HARD/), which isa benchmark set with origins in the project of VLSI circuits,in which all the pieces have a rectangular shape andwhere the total number of pieces to place does not exceed50 (APTE, XEROX, HP, AMI33, AMI49);• instances also composed only by rectangles, however, differfrom the previous one by having higher number of pieces,from 100 to 500 ( http://www.simplex.tu-tokyo.ac.jp/imahori/packing/) (RP100, RP200, PCB146,PCB500);• instances taken from [5] (NAKATAKE1, NAKATAKE2), [6](LIN) and [7](AMI49L, AMI49LT) containing a mix of piecesthat are rectangles, L-shaped and/or T-shaped pieces andother pieces with rectangular components (U, +, H, etc.);• instances AMI33LTa and AMI49LTa were generated from instancesAMI33 e AMI49 from the MCNC reference set. Therule used to obtain these two intances was to change approximately10% of the total number of rectangles in Land/or T pieces. Each of the new L or T shaped have integerdimensions and have an area similar to the area ofthe original rectangle accordingly to the procedure shownin Figura 2.The instances chosen to test and evaluate the heuristic Rect–TOPOShave very different characteristics, namely in what concerns thetotal number of pieces, the number of different pieces types, theshape of the pieces (rectangular, L-shaped, T-shaped, etc.). Thischaracteristics are shown in Table 1.Table 2 summarizes the computational tests performed to test an<strong>de</strong>valuate the heuristic Rect–TOPOS. We tested the three criteriafor choosing the next piece to place, its orientation and placementpoint previously presented (WASTE, DISTANCE and PERIMETER),and, for each one of them, we consi<strong>de</strong>red the three possibilitiesALIO-EURO <strong>2011</strong> – 67


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong># Pieces # Rectilinear piecesInstance Total # Types # Rect. # OthersAPTE 9 3 9 —XEROX 10 10 10 —HP 11 6 11 —AMI33 33 31 33 —AMI49 49 46 49 —RP100 100 99 100 —PCB146 146 22 200 —RP200 200 99 146 —PCB500 500 417 500 —AMI49L 28 28 7 21AMI49LT 27 27 6 21NAKATAKE1 40 35 30 10NAKATAKE2 19 19 5 14LIN 29 21 22 7AMI33LTa 33 31 30 3AMI49LTa 33 46 41 5Table 1: Characteristics of the used instances.Figure 3: Layout obtained for PCB500 instance.to choose the piece to start the partial solution (AREA, PERIME-TER and DISTANCE). The values shown in the table are the areausage, measured as the ratio between the sum of the area of theplaced pieces and the area of the enclosing rectangle obtained.The bold values are the best result for each instance. Table 2 alsopresents, for each instance, the average computational time, measuredin seconds. Note that, for each instance, the computationaltimes does not show great variability. Finally we present also foreach instance, the best result found in literature, their area usage,computational time and the publication reference.From Table 2 we can see that the best results were obtained whenusing for choosing the next piece to place and the placement pointthe perimeter criterion, except for instances APTE and XEROX.These two instances are very sensitive to the choice of the firstpiece to place as they have a small number of pieces to place, 9and 10 respectively. Regarding the choice of the first piece, theresults show balance between the three criteria. When comparingthe results obtained with the best published results one should takeinto account that the Rect–TOPOS is only a constructive heuristic,while the best published results were obtained with approachesbased on local search and tree search algorithms. Thus, as expected,the results obtained with the Rect–TOPOS fall short of thepublished ones, but in return the computational times are muchlower. We note that for the PCB500 instance the result obtainedby Rect–TOPOS was better than the best result found in the literature[10]. Table 2 also allows to show the great impact that thenumber of types of pieces have in the Rect–TOPOS heuristic performance.For example, although the PCB146 instance have more46 pieces in total than the RP100 instance its running time is about10 times lower because it has only 22 different pieces types whilethe instance RP100 has 99 different types.Figure 3 shows the layout obtained for the PCB500 instance.5. CONCLUSIONSIn this article we presented a constructive heuristic, Rect–TOPOS,to the Rectilinear Packing Area Minimization Problem. Rect–TOPOS is a fast heuristic which is able to easily handle rectilinearshaped pieces. This heuristic uses several criteria to choose thenext piece to place, its orientation and the placement point. Thequality of solutions proved to be quite satisfactory because it is asimple heuristic with reduced run times. These features suggest, asfuture <strong>de</strong>velopments, the incorporation of Rect–TOPOS heuristicin an approach based on local procedure. In this approach could,at the expense of increased run time, improving the already goodresults achieved by Rect–TOPOS in situations where this was necessary.6. ACKNOWLWDGEMENTSPartially supported by Fun<strong>da</strong>ção para a Ciência e Tecnologia (FCT)Project PTDC/EME-GIN/105163/2008 - EaGLeNest, through the“Programa Operacional Temático Factores <strong>de</strong> Competitivi<strong>da</strong><strong>de</strong>(COMPETE)” of the “Quadro Comunitário <strong>de</strong> Apoio III”, partiallyfun<strong>de</strong>d by FEDER.7. REFERENCES[1] H. Murata, K. Fujiyoshi, S. Nakatake, and Y. Kajitani,“Rectangle-packing-based module placement,” in <strong>Proceedings</strong>of the 1995 IEEE/ACM international conference onComputer-ai<strong>de</strong>d <strong>de</strong>sign, ser. ICCAD ’95. Washington, DC,USA: IEEE Computer Society, 1995, pp. 472–479.[2] G. Wäscher, H. Haußner, and H. Schumann, “An improvedtypology of cutting and packing problems,” European Journalof Operational Research, vol. 183, no. 3, pp. 1109–1130,December 2007.[3] D. F. Wong and C. L. Liu, “A new algorithm for floorplan<strong>de</strong>sign,” in <strong>Proceedings</strong> of the 23rd ACM/IEEE Design AutomationConference, ser. DAC ’86. Piscataway, NJ, USA:IEEE Press, 1986, pp. 101–107.[4] J. F. Oliveira, A. M. Gomes, and J. S. Ferreira, “TOPOS: Anew constructive algorithm for nesting problems,” OR Spectrum,vol. 22, pp. 263–284, 2000.[5] S. Nakatake, K. Fujiyoshi, H. Murata, and Y. Kajitani, “Moduleplacement on BSG-structure and ic layout applications,”in <strong>Proceedings</strong> of the 1996 IEEE/ACM international conferenceon Computer-ai<strong>de</strong>d <strong>de</strong>sign, ser. ICCAD ’96. Washington,DC, USA: IEEE Computer Society, 1996, pp. 484–491.[6] J.-M. Lin, H.-L. Chen, and Y.-W. Chang, “Arbitrarily shapedrectilinear module placement using the transitive closuregraph representation.” IEEE Trans. VLSI Syst., pp. 886–901,2002.ALIO-EURO <strong>2011</strong> – 68


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Waste Distance Perimeter Average Best known resultInstance Area Perim. Width Area Perim. Width Area Perim. Width Time (s) (%) (s)APTE 0.917 0.917 0.917 0.893 0.893 0.893 0.894 0.894 0.894 0.01 0.992 2.38 [8]XEROX 0.801 0.801 0.801 0.804 0.804 0.804 0.788 0.788 0.788 0.09 0.977 9812 [8]HP 0.848 0.848 0.695 0.834 0.834 0.695 0.924 0.924 0.936 0.03 0.987 891 [8]AMI33 0.813 0.813 0.875 0.712 0.712 0.745 0.832 0.832 0.863 0.84 0.986 2.01 [9]AMI49 0.807 0.807 0.807 0.792 0.792 0.792 0.843 0.843 0.843 1.97 0.983 56.61 [9]RP100 0.819 0.819 0.857 0.721 0.721 0.773 0.924 0.924 0.905 9.35 0.968 200 [10]PCB146 0.622 0.622 0.622 0.786 0.786 0.786 0.881 0.881 0.881 0.95 0.977 300 [10]RP200 0.876 0.876 0.878 0.746 0.746 0.754 0.929 0.929 0.913 13.2 0.963 400 [10]PCB500 0.865 0.865 0.865 0.781 0.781 0.781 0.967 0.967 0.967 221.0 0.963 1000 [10]AMI49L 0.625 0.625 0.667 0.761 0.761 0.761 0.829 0.829 0.792 1.11 0.956 2728 [11]AMI49LT 0.731 0.731 0.663 0.787 0.787 0.753 0.793 0.793 0.823 1.08 0.951 2843 [11]NAKATAKE1 0.825 0.825 0.763 0.807 0.807 0.784 0.852 0.852 0.867 1.35 0.969 10.24 [11]NAKATAKE2 0.590 0.619 0.590 0.669 0.614 0.669 0.731 0.770 0.731 2.14 0.802 93.32 [11]LIN 0.789 0.740 0.780 0.780 0.780 0.740 0.840 0.840 0.910 0.82 1.000 2.72 [11]AMI33LTa 0.764 0.764 0.711 0.752 0.752 0.731 0.832 0.832 0.844 1.05 — — —AMI49LTa 0.875 0.875 0.875 0.761 0.761 0.761 0.881 0.881 0.881 2.41 — — —Table 2: Comparison of results obtained by the Rect–TOPOS vs literature results.[7] J. Xu, P.-n. Guo, and C.-K. Cheng, “Rectilinear block placementusing sequence-pair,” in <strong>Proceedings</strong> of the 1998 internationalsymposium on Physical <strong>de</strong>sign, ser. ISPD ’98. NewYork, NY, USA: ACM, 1998, pp. 173–178.[8] H. Chan and I. Markov, “Practical slicing and non-slicingblock-packing without simulated annealing,” in ACM/IEEEGreat Lakes Symp. on VLSI 2004, 2004, pp. 282–287.[9] M. Chen and W. Huang, “A two-level search algorithmfor 2D rectangular packing problem,” Comp. & Ind. Eng.,vol. 53, no. 1, pp. 123 – 136, 2007.[10] S. Imahori, M. Yagiura, and T. Ibaraki, “Improved localsearch algorithms for the rectangle packing problem withgeneral spatial costs,” EJOR, vol. 167, no. 1, pp. 48 – 67,2005.[11] D. Chen, J. Liu, Y. Fu, and M. Shang, “An efficient heuristicalgorithm for arbitrary shaped rectilinear block packingproblem,” Comput. Oper. Res., vol. 37, pp. 1068–1074, June2010.ALIO-EURO <strong>2011</strong> – 69


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Local search methods for leather nesting problemsPedro Brás Cláudio Alves José Valério <strong>de</strong> CarvalhoCentro ALGORITMI / Departamento <strong>de</strong> Produção e Sistemas, Universi<strong>da</strong><strong>de</strong> do Minho4710-057 Braga, Portugal{pedro.bras,claudio,vc}@dps.uminho.ptABSTRACTWe <strong>de</strong>scribe a set of new local search based algorithms for a realleather nesting problem (LNP) arising in the automotive industry.The problem consists in finding the best layouts for a set of irregularshapes within large natural leather hi<strong>de</strong>s with highly irregularcontours, and which may have holes and quality zones. Our casestudy comes from a multinational company that produces car seats.The irregular shapes that must be cut from the hi<strong>de</strong>s are pieces ofthese car seats, and they may contain holes and different qualityzones. A relevant characteristic of the problem addressed is thatthe cutting patterns are not subject to any special constraint thatmay reduce the set of feasible solutions, and hence simplify theproblem. The directionality constraints arising in the shoe industryare an example of such constraints.Very few solution methods were proposed in the literature for thisvariant of the LNP. The value of the potential savings contrast withthis very small number of contributions. Here, we intend to contributewith new solution methods that embeds a new constructiveheuristic that we proposed recently in [1]Keywords: Leather nesting, Variable neighbourhood search1. INTRODUCTIONThe leather nesting problem (LNP) consists in finding the best layoutsfor a set of irregular shapes within the boun<strong>da</strong>ries of naturalleather hi<strong>de</strong>s. The leather hi<strong>de</strong>s are natural products with irregularcontours and a very inhomogeneous surface with holes anddifferent quality levels. Here, we address the real case of a multinationalcompany that produces car seats. The irregular shapes tobe cut from the leather hi<strong>de</strong>s are pieces of these car seats. The correspondingLNP is one of the most general 2-dimensional nestingproblem. The pieces may have holes, and the surface from whichthey are cut must satisfy minimum quality requirements <strong>de</strong>fined bythe clients. These requirements translate into quality zones withinthe pieces, which in turn restrict the position of the pieces withinthe hi<strong>de</strong>s. The <strong>de</strong>tails of this LNP are introduced in Section 2.The first algorithm reported in the literature for this LNP is due toHeistermann and Lengauer [2]. These authors <strong>de</strong>veloped a greedyheuristic that starts by i<strong>de</strong>ntifying a limited and empty region ofthe hi<strong>de</strong> where to place one of the available pieces. The selectionof this region can be fixed a priori, or it may vary from one iterationto another. The placement of the pieces in this region is evaluatedusing different criteria such as the area of the piece and the distancebetween its contour, the bor<strong>de</strong>rs of the hi<strong>de</strong> and the current partiallayout. To repair the eventually infeasible layouts that are built inthis way, the authors resort to compaction. The authors argue thattheir approach is competitive compared to humans. However, theypresent their results without distinguishing the type of instancesfrom which these results are obtained although this may have acritical impact on the quality of the layouts. In<strong>de</strong>ed, in the furnitureindustry, for example, the pieces tend to be much larger than in theautomotive industry, and as a consequence, these instances maylead to better layouts.More recently, Alves et al.[1] analyzed a set of constructive heuristicsfor this LNP. These heuristics rely on the computation of no-fitand inner-fit polygons to ensure feasible placements on the hi<strong>de</strong>s.The authors explored different strategies that use directly the informationprovi<strong>de</strong>d by these polygons to gui<strong>de</strong> the selection ofthe pieces and their placement. Additionally, they explored differentcriteria to evaluate the quality of a placement. An extensiveset of computational experiments on real instances are reported,which pointed to the efficiency of some of the original heuristicsexplored.We extend the work of [1], and propose new local search basedheuristics that embed the best strategies <strong>de</strong>scribed in this paper.We propose three alternative sequence-based neighborhood structures.These structures <strong>de</strong>pend on the values provi<strong>de</strong>d by the evaluationfunction used to assess the quality of the placement points.The different neighborhoods are obtained by varying the size of thesets of pieces in the sequence from which a piece can be removed.The pieces that are removed are replaced by another piece. Thenumber of candi<strong>da</strong>te pieces is another parameter that <strong>de</strong>fine ourneighborhoods. These neighborhoods are explored using the variableneighborhood search metaheuristic <strong>de</strong>scribed in [3].In Section 2, we <strong>de</strong>scribe the relevant aspects of our LNP. In Section3, we introduce the constructive strategies followed in ourheuristics. In Section 4, we discuss some of the <strong>de</strong>tails of ourlocal search procedures, namely the neighborhood structures.2. THE LEATHER NESTING PROBLEMIn the LNP, we are given a set of small two-dimensional irregularshapes (the pieces of the car seats) and a larger irregular shape representingthe leather hi<strong>de</strong>s. The objective is to place the pieces onthe hi<strong>de</strong> so as to minimize the total empty space (or equivalently,maximize the yield).The contour of the leather hi<strong>de</strong>s is irregular, and their interior isusually inhomogeneous. It may have holes, <strong>de</strong>fects and regionswith different levels of quality (the quality zones). The holes and<strong>de</strong>fects of the hi<strong>de</strong>s are treated as any other piece that may be alreadyplaced on the surface of the hi<strong>de</strong>s. The quality zones aretreated differently. A piece can only be placed on a given regionof the hi<strong>de</strong> only if the quality of this region is greater or equal thanthe quality requirements of the piece. In the automotive industry,four quality zones are used (A, B, C and D). A stands for the bestquality zone. The quality <strong>de</strong>creases from A to D. Some parts atthe boun<strong>da</strong>ries of the hi<strong>de</strong>s are consi<strong>de</strong>red as waste because theirquality is too low to cut any piece.The pieces that must be placed on the hi<strong>de</strong>s are also irregular. Theymay have holes and different quality requirements. The qualityzone of a piece can never be placed on a region of the hi<strong>de</strong> witha lower quality. The characteristics of the pieces that must be cutALIO-EURO <strong>2011</strong> – 70


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>from the hi<strong>de</strong>s <strong>de</strong>pend on the application. In the shoe industry, theshapes are small compared to the size of the hi<strong>de</strong>s. In the furnitureindustry, many of the pieces are large. In the automotive industry,there is many different pieces. The area of the pieces ranges from0.1% to 6% of the area of the hi<strong>de</strong>s.A layout consists in the pieces that are placed on the hi<strong>de</strong>, andtheir corresponding position and rotation. In our case, a layout isfeasible if and only if all the pieces do not overlap, if all the piecesare placed insi<strong>de</strong> the usable area of the hi<strong>de</strong>, and if all the qualityconstraints are satisfied.3. PLACEMENT STRATEGIES BASED ON INNER-FITPOLYGONSThe no-fit polygons (NFP) are used to <strong>de</strong>termine wether two piecesoverlap or not, while the inner-fit polygons (IFP) are used to <strong>de</strong>terminewether a piece is completely contained within another, ornot. As noted in [4], the concepts of NFPs and IFPs allow the<strong>de</strong>finition of new placement approaches. In [1], we <strong>de</strong>fined newconstructive heuristics that use the information provi<strong>de</strong>d by thesepolygons to gui<strong>de</strong> the selection of the pieces, and the placement ofthese pieces into specific regions of the hi<strong>de</strong>s.The heuristics proposed in [1] can be divi<strong>de</strong>d in four stages. Thepieces are first grouped according to a given criterion (area, irregularity,value, for example). These groups are <strong>de</strong>fined such that thepieces with almost the same attributes are treated with the samepriority. Then, a piece is selected to be placed on the hi<strong>de</strong>. Oneof the criteria that we used for selecting a piece was based on thecharacteristics of the IFP of this piece with the hi<strong>de</strong>. After a piecehas been selected, we choose a region insi<strong>de</strong> the hi<strong>de</strong> where thepiece will be placed, and we evaluate the possible placement pointsinsi<strong>de</strong> that region. The point that maximizes a given criterion is selected,and the piece is placed at this point. Note that when a pieceis selected according to the characteristics of its IFP, the region ofthe hi<strong>de</strong> where this piece will be placed is inevitably this IFP.The sequence of pieces that will be used to <strong>de</strong>fine our neighborhoodstructures are obtained with the constructive procedure thatrelies on the characteristics of the IFPs. To evaluate a placementposition, we used a function based on the relative area between anoffset of the piece and the area of the polygon resulting from theintersection of this offset with the current layout and the bor<strong>de</strong>r ofthe hi<strong>de</strong>.4. VARIABLE NEIGHBORHOOD SEARCHOur algorithms are based on the variable neighbourhood search(VNS) metaheuristic. New neighbourhood structures are proposedbased on a representation of the solutions as a sequence of piecescombined with the constructive heuristic allu<strong>de</strong>d above.The selection process generates a sequence of pieces. Each pieceis placed in a given region of the hi<strong>de</strong>, which corresponds in factto a particular IFP of the piece with the hi<strong>de</strong>. For the smallestpieces, the IFP that is chosen is the smallest IFP associated to thepiece, while for the largest pieces, the IFP that is selected is thelargest one. The next step of the constructive heuristic consistsin <strong>de</strong>termining the placement position where the piece should beplaced. The possible placement positions of the hi<strong>de</strong> are evaluatedbased on the criterion <strong>de</strong>scribed above. It <strong>de</strong>pends on an offsetof the piece, and the intersection of this offset with the currentlayout and the boun<strong>da</strong>ry of the hi<strong>de</strong>. For the sake of clarity, wewill <strong>de</strong>signate this value as the fitness of the piece.Our neighborhood structures <strong>de</strong>pend on the sequence of pieces,on the values given by the evaluation function for each piece andon the value of the yield achieved after placing each one of thepieces of the sequence. Let i j <strong>de</strong>note the in<strong>de</strong>x of the piece in thesequence with a corresponding yield of j%. We explored threeneighborhood structures using the following <strong>de</strong>finitions:• for the pieces between i j1 and i j2 , let k be the piece with thelowest fitness, and g be the group of this piece. The neighborhoodsolutions consists in all the solutions obtained byremoving k, replacing it by a piece from the group g up tog − p (p is a parameter with p ≤ g), and completing thesequence by running the constructive heuristic;• for the pieces between i j1 and i j2 , we select a subsequenceof n pieces with the lowest total fitness. We replace thefirst piece of this set (k of group g) by another piece fromthe group g up to g − p. The remaining n − 1 pieces ofthe set are replaced by running the constructive heuristic.The final part of the original sequence remains unchanged.The corresponding pieces are placed on the hi<strong>de</strong> using thecriteria used in the constructive heuristic;• for the pieces between i j1 and i j2 , we select n pieces withthe lowest fitness. These pieces are replaced by other piecesfrom the corresponding group g up to the group g− p, whilethe remaining subsequences of the original sequence remainsunchanged.Note that, in the previous <strong>de</strong>finitions, j 1 , j 2 , p, and n are all parametersthat allow to configure the different neighborhoods thatwill be explored using VNS.In our first implementation, we consi<strong>de</strong>red the basic version ofVNS <strong>de</strong>scribed in [3]. The preliminary tests realized on a set ofreal instances yield promising results. Further experiments are beingconducted on an extensive set of real instances to analyze thebest set of parameters that should be applied, and also to analyzethe impact of using different constructive heuristics.5. CONCLUSIONSThe LNP with no specific constraints remains a challenge that <strong>de</strong>servesattention given the potential for savings associated to thevalue of the raw material involved. Recently, the authors proposednew constructive heuristics for this problem. In this exten<strong>de</strong>d abstract,we gave some of the <strong>de</strong>tails of a local search approach thatextends our previous work on that problem. The methods propose<strong>da</strong>re based on three different neighborhood structures that <strong>de</strong>pendson the sequence of pieces generated by the constructive procedure.6. ACKNOWLEDGEMENTSThis work was partially supported by the Algoritmi Research Centerof the University of Minho for Cláudio Alves and José Valério<strong>de</strong> Carvalho and by the Portuguese Science and Technology Foun<strong>da</strong>tionthrough the research grant SFRH/ BDE/15650/2007 for PedroBrás.7. REFERENCES[1] C. Alves, P. Brás, J. Valério <strong>de</strong> Carvalho, and T. Pinto, “Newconstructive algorithms for leather nesting in the automotiveindustry,” submitted, <strong>2011</strong>.[2] J. Heistermann and T. Lengauer, “The nesting problem inthe leather manufacturing industry.” Annals of Operations Research,vol. 57, pp. 147–173.ALIO-EURO <strong>2011</strong> – 71


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[3] P. Hansen and N. Mla<strong>de</strong>novic, “Variable neighborhood search:principles and applications,” European Journal of OperationalResearch, vol. 130, pp. 449–467, 2001.[4] J. Bennell and J. Oliveira, “The geometry of nesting problems:a tutorial,” European Journal of Operational Research, vol.184, no. 2, pp. 397–415, 2008.ALIO-EURO <strong>2011</strong> – 72


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Nesting Problems: mixed integer formulations and valid inequalitiesAntonio Martínez Sykora ∗ Ramón Álvarez-Valdés Olaguíbel ∗José Manuel Tamarit Goerlich ∗∗ Universi<strong>da</strong>d <strong>de</strong> Valencia, Departamento <strong>de</strong> Estadística e Investigación OperativaC/Dr. Moliner, 50, 46100, Burjassot, Valencia{antonio.martinez-sykora, ramon.alvarez, jose.tamarit}@uv.esABSTRACTCutting and packing problems involving irregular shapes, usuallyknown as Nesting Problems, are common in industries rangingfrom clothing and footwear to engineering and shipbuilding. Theresearch publications on these problems are relatively scarce, comparedwith other cutting and packing problems with rectangularshapes, and have been mostly focused on heuristic approaches. Inthis paper we propose a new mixed integer formulation for theproblem and <strong>de</strong>rive some families of valid inequalities, as a firststep for <strong>de</strong>veloping an exact Branch & Cut Algorithm.Keywords: Cutting and Packing, Nesting, Integer Programming1. INTRODUCTIONNesting problems are two-dimensional cutting and packing problemsinvolving irregular shapes. These problems arise in a wi<strong>de</strong>variety of industries like garment manufacturing, sheet metal cutting,furniture making and shoe manufacturing.There are several types of nesting problems <strong>de</strong>pending on the rotationof the shapes. We can <strong>de</strong>fine three types of problems:• Without rotation: The pieces have a fixed orientation.• With specific angles of rotation: The pieces can be placedwith any of the specific angles. Usually these angles are 0 o ,90 o and 180 o .• With rotation: Pieces can be placed with any angle θ ∈[0,2π[.In this work we study the nesting problem as the problem of arranginga set of two-dimensional irregular shapes without overlappingin a rectangular stock sheet with fixed width where theobjective is to minimize the require length. We will consi<strong>de</strong>r thatpieces cannot be rotated. This problem arises, e.g, in the garmentmanufacturing, because in this industry the pattern of the fabricmust be respected. An example of a layout from the garment manufacturingindustry is provi<strong>de</strong>d in figure 1.The main difficult of nesting problems is to ensure that the pieceshave a non-overlapping configuration. This question has been studied<strong>de</strong>eply in recent years and there are several approaches which<strong>de</strong>termine when two polygons overlap. Bennell and Oliveira [2]give a tutorial of the different approaches which study the geometryof nesting problems. The problem is NP-complete and as aresult solution methodologies predominantly utilise heuristics.We consi<strong>de</strong>r the pieces approximately <strong>de</strong>scribed by polygons. Themost used tool to check if two polygons overlap is the Non FitPolygon (NFP). It can be used, along with the vector difference ofthe position of the two polygons, to <strong>de</strong>termine whether these polygonsoverlap, touch, or are separated, by conducting a simple testto i<strong>de</strong>ntify whether the resultant vector is insi<strong>de</strong> the NFP.The formulation proposed in this paper uses the Non Fit Polygonsto create inequalities for separating each pair of pieces. There aretwo different formulations using the NFPs. The first one is usedin the Simulated Annealing Algorithm proposed by Gomes andOliveira ([1]). In this formulation, they use binary variables andthe big M constant to activate and inactivate each convex regiongiven by the NFP. Fischetti and Luzzi ([3]) propose a more efficientformulation by <strong>de</strong>fining slices to have a partition of the feasibleplaces in which to arrange each pair of pieces without overlap.The slices must be disjoint but they do not specify how they buildthem. Our formulation is similar to the Fischetti and Luzzi formulation(FLF), but we consi<strong>de</strong>r horizontal slices.2. MIXED INTEGER FORMULATION FOR NESTINGPROBLEMSLet P = {p 1 ,..., p N } be the set of pieces to arrange in the strip.We consi<strong>de</strong>r that the reference point of each piece is the bottomleft corner of the enclosing rectangle. We <strong>de</strong>note by (x i ,y i ) thecoordinates of the reference point of piece p i . Let l i (w i ) be thelength (width) and let L and W be the length and width of the strip.We consi<strong>de</strong>r that the bottom left corner of the strip is placed at theorigin.!The NFP i j is the region in which the reference point of piece p jcannot be placed because it would overlap with piece p i (see figure2). The feasible zone to place p j with respect to p i is a non-convexpolygon or it could be unconnected. In the next section we presentthe Horizontal Slices, which consist of dividing this feasible zoneinto convex polygons and assigning a binary variable to each oneof these polygons.Figure 1: An example layout from garment manufacturingALIO-EURO <strong>2011</strong> – 73


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Piece iNFP ijPiece jwhere the coefficients α k fi j and β k fi j are the coefficients of the NFPinequalityf and δ k f hi j are the greatest value the left hand si<strong>de</strong> cantake when slice <strong>de</strong>fined by b i jh is turned on.Note that for each NFP i j it is necessary that one binary variableb i jk ∈ V NFPi j takes value one for separating pieces p i and p j .Then we need the following equalities in the formulation:m i j∑k=1b i jk = 1, ∀1 ≤ i ≤ j ≤ N (2)Figure 2: NFP i j . If the reference point of p j is in the NFP i j thenp j overlaps p i .2.1. Horizontal SlicesLet NFP i j := {r 1 ,...,r n } be the NFP of pieces p i and p j such thatr t , ∀t ∈ {1,...,n}, represents the vertices of the NFP in anticlockwiseor<strong>de</strong>r. In or<strong>de</strong>r to build the horizontal slices, we require theNFP i j to be convex. There are two possibilities:• The NFP i j has no concavities. We <strong>de</strong>fine one horizontalslice for each edge.• The NFP i j has concavities. We close all the concavities inor<strong>de</strong>r to obtain a convex polygon. In this case we build ahorizontal slice for each edge of the modified NFP i j andfor each created hole. If the polygon has k concavities thenwe build k holes of the NFP i j .To each slice we associate a binary variable b k which takes thevalue 1 if the reference point of piece j is in the slice and 0 otherwise.The set of all binary variables associated with a NFP i j is<strong>de</strong>noted by V NFP i j . In figure 3 we can find the set of variablesassociated to NFP i j . Variable b i j4 corresponds to the concavity ofthe NFP i j .2.3. Bounds for the position of the piecesEach piece must be placed entirely into the strip so the referencepoint must satisfy some bound constraints. The usual bound constrainsare:0 ≤ x i ≤ L − l i , ∀i = 1,...,N (3)0 ≤ y i ≤ W − w i , ∀i = 1,...,N (4)We add to the formulation more bound constraints by lifting theseinequalities. Let L i j (R i j ) and D i j (U i j ) be the subsets of binaryvariables such that piece i protru<strong>de</strong> from the left (right) or below(over), respectively, of piece j. Let λi kj (µk i j ) be the minimum quantitysuch that piece p j protru<strong>de</strong> horizontally (vertically) to piece p iwhen the slice <strong>de</strong>fined by b k ∈ V NFP i j is turned on.For each one of the inequalities (3) and (4) we build N inequalitiesby adding binary variables as follows:x i ≤ L − l i − ∑ λi kj b i jk, ∀i, j ∈ {1,...,N} (5)b i jk ∈L i jy i ≤ W − w i − ∑b i jk ∈D i jµ k i j b i jk, ∀i, j ∈ {1,...,N} (6)Inequalities (5) indicate that if any binary variable b i jk which forcespiece p j to be placed at the right of piece p i is turned on then thelength of the strip L must be greater than x i + l i + λi kj . Inequalities(6) have the same i<strong>de</strong>a in a vertical direction.We use a similar i<strong>de</strong>a to lift the inequalities on the left of (below)the strip. In (8) and (9) of the formulation we can see these boundconstraints.2.4. Mixed Integer FormulationFigure 3: Horizontal SlicesThe Horizontal Slices Formulation (HSF) is the following one:2.2. NFP constraintsFor each pair of pieces (p i , p k ), we use the NFP i j to build thenecessary constraints to place this pair of pieces without overlap.Let us consi<strong>de</strong>r the binary variables b i j ∈ V NFP i j <strong>de</strong>fined above.Let us <strong>de</strong>note by m i j the number of binary variables consi<strong>de</strong>red inV NFP i j . Each slice is <strong>de</strong>scribed by several inequalities. The slicesare limited by L sup , an upper bound for the length of the strip.We use the constraints proposed by Fischetti and Luzzi (FLF) [3],a<strong>da</strong>pting them to our horizontal slices and closed concavities:α k fi j (x j − x i ) + β k fm i ji j (y j − y i ) ≤ ∑h=1δ k f hi j b i jh (1)Objective Function: minL (7)s.t.∑ λi kj b i jk ≤ x i ≤ L − l i − λi kj b i jk, (8)b i jk ∈R i j b i jk ∈L i j∀i, j ∈ {1,...,N}∑ µ i k j b i jk ≤ y i ≤ W − w i − ∑ µ i k j b i jk, (9)b i jk ∈U i j b i jk ∈D i j∀i, j ∈ {1,...,N}α k fi j (x j − x i ) + β k f∑m i ji j (y j − y i ) ≤ ∑h=1δ k f hi j b i jh , (10)ALIO-EURO <strong>2011</strong> – 74


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>∀1 ≤ i ≤ j ≤ N, ∀k = 1,...,m i j (11)m i j∑k=1b i jk = 1, ∀1 ≤ i ≤ j ≤ N (12)b i jk ∈ {0,1}, ∀1 ≤ i ≤ j ≤ N (13)The objective function minimizes the required strip length (7).Constraints (8) constraints (9) are the bound constraints of thepieces. Inequalities (10) are the corresponding NFP inequalitiesand constraints (12) indicate that one slice of each NFP must beturned on (inequalities (2)).3. VALID INEQUALITIES FOR NESTING PROBLEMSIn this section we present some valid inequalities for the HSF.When we relax the integer conditions of the Mixed Integer Formulationwe usually obtain a non integer solution. The inequalitiespresented here can be very useful if we want to cut some nonvalid solutions. The first kind of inequalities are the LU covers.These inequalities ensure that the columns of pieces fit into thestrip. The same i<strong>de</strong>a is used in the second inequalities, the cliquesand covers. The third inequalities are the Transitivity Constraintsin which the i<strong>de</strong>a is to place a set of pieces consistently, and do notturn on variables which are incompatible. Finally, we introducethe impenetrability constraints relating binary variables with thevariables associated to the reference points of the pieces.3.1. LU-cover inequalitiesLet us consi<strong>de</strong>r the polygon of the NFP i j where the referencepoint of piece p i is placed at (0,0). Let us <strong>de</strong>note by Y i j (Y i j )the maximum (minimum) value of the NFP i j on the Y − axis andlet y i jk (y i jk) be the maximum (minimum) value of the slice on theY − axis.Let us consi<strong>de</strong>r that variable b i jk is turned on. If we want to knowhow much the piece p j protru<strong>de</strong>s from the piece p i (or viceversa)in a vertical way we need to calculate Y i j − y i jk(if y i jk> 0) or(−1)Y i j − (−1)y i jk (if y i jk < 0). This difference can be viewe<strong>da</strong>s the quantity of width that the pieces share. Then we comparethis difference with the minimum width of the pieces p i andp j (min i, j {w i ,w j }). If the difference is lower than the minimumwidth, there is a part of piece p j which protru<strong>de</strong>s from piece p i . Incase that y i jk< 0 and y i jk > 0 the slice allows to place the referencepoint of the two pieces on the same y-coordinate, and in thiscase the pieces do not pile up.Let p i y p j be two pieces. We <strong>de</strong>note by U ∗ ij (D∗ ij ) the subsets ofbinary variables which <strong>de</strong>fine slices of the NFP i j such that, whenthey are turned on, they put p j above p i (p j below p i ):U ∗ ij := {b i jk | Y i j − y i jk< w i j }D ∗ ij := {b i jk | (−1)Y i j − (−1)y i jk < w i j }where w i j := min{w i ,w j }.Let C = {p 1 ,..., p r }, 1 < r ≤ N, be a subset of r pieces, and letU ′ st ⊆ U ∗ st, U ′ st ≠ /0 and D ′ st ⊆ D ∗ st, D ′ st ≠ /0, ∀1 ≤ s < t ≤ r. We<strong>de</strong>note by UD ′ st := U ′ st ∪ D ′ st. Note that U ′ st = D ′ ts ∀p s , p t ∈ C.Proposition:Letr−1δ := max { ∑τ∈π{C} t=1∑l∈U ′ τ(t)τ(t+1)q τ(t)τ(t+1)l b τ(t)τ(t+1)l }and let q τ(t)τ(t+1)l be the amount of overlapping along the Y-axisbetween piece τ(t + 1) and τ(t) when b τ(t)τ(t+1)l is turned on.π{C} is the set of all the permutations of the pieces in C. Therefore,δ is the maximum overlap between the pieces of C in anyor<strong>de</strong>r.If inequality (14) is satisfied, then constraint (15) is a valid inequalityfor the Nesting problem. We say that constraint (15) is aLU-cover inequality.r∑ w s − δ > W (14)s=1r−1 r∑ ∑ ∑ b slks=1 l=s+1 k∈UD ′ sl3.2. Cliques and covers≤r−1∑ (r − s) − 1. (15)s=1These constraints are based on the same i<strong>de</strong>a of the LU covers inequalitiesbut in this case we consi<strong>de</strong>r a fixed permutation of the rpieces, e.g {p 1 ,..., p r }, and we have to check whether condition(14) is satisfied by the given permutation. In this case we onlyconsi<strong>de</strong>r the NFPs that separate adjacent pieces in the or<strong>de</strong>r givenby the permutation. That implies that inequality (15) has fewervariables.We present only the case of three pieces, but it could be generalizedto r pieces. The case of the three pieces, called cliques, has aright hand si<strong>de</strong> of 1, and the case of r (r > 3) pieces, called covers,has a right hand si<strong>de</strong> of r − 2.Proposition:Let us consi<strong>de</strong>r three pieces, p i , p j and p k . If there are two subsetsU ′ 1 ⊆ U jk, U ′ 2 ⊆ U kl,U ′ 2 ≠ /0, such that ∀s ∈ U′ 1 and ∀t ∈ U′ 2y s i j i k+ y t i k i l> W − w l is satisfied, then inequality (16) is valid.∑ b jks + ∑ b kls ≤ 1. (16)s∈U 1′ s∈U 2′These inequalities could also be <strong>de</strong>fine in a horizontal sense.3.3. Transitivity InequalitiesThese constraints are <strong>de</strong>signed to forbid incompatible slices beingturned on. In other words, if two slices separating pieces 1 − 2and 1 − 3 are turned on, the relative position of pieces 2 − 3 canbe limited and there could exist slices from NFP 23 such that areincompatible with the previous ones.In this section we present only the transitivity inequalities involvingthree pieces. This i<strong>de</strong>a could be generalized consi<strong>de</strong>ring npieces, but it would be more complicated with more computationaleffort.Proposition:Let us consi<strong>de</strong>r 3 pieces, i, j y k. Let b i j1 , b ik1 and b jk1 be threevariables <strong>de</strong>fining, respectively, one slice of the NFP i j , NFP ik andNFP jk . If b i j1 = b ik1 = 1 they <strong>de</strong>fine a region for the relative positionof p k with respect to p j . If the slice <strong>de</strong>fined by b jk1 does notintersect this region then these three variables cannot be equal to 1simultaneously and the corresponding transitivity constraint is:b i j1 + b ik1 + b jk1 ≤ 2 (17)If there are other variables of NFP i j incompatibles with b ik1 andb jk1 then can be ad<strong>de</strong>d to the right hand si<strong>de</strong> of (17).ALIO-EURO <strong>2011</strong> – 75


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>3.4. Impenetrability InequalitiesThe Impenetrability inequalities are based on the study of the sumof the coordinates of the pieces. If we relax the integer conditionsof the variables and we solve the problem, then it is usual to findthat all the pieces have been placed close to the origin. The i<strong>de</strong>a ofthese inequalities is to move the pieces beyond the origin, <strong>de</strong>pendingon which binary variables are positive.Let p i and p j be two pieces, 1 ≤ i < j ≤ N. Then, we study howmuch the value of the sum S := x i +x j +y i +y j could be improvedusing the binary variables. The i<strong>de</strong>a is to minimize S in each oneof the slices <strong>de</strong>fined by the NFP i j . An Impenetrability constrainthas the following form:m i j∑S ≥ ωi k j b i jk, (18)k=1where the coefficients ωi k j are the solutions of the linear problemwhich consist of minimizing S subject to the constraints that <strong>de</strong>finethe slice b i jk . These inequalities are valid by construction.It would be interesting to add to the inequality other variablescorresponding to other NFPs. Let us consi<strong>de</strong>r p r and a variableb irl ∈ V NFP ir . If we want to inclu<strong>de</strong> this variable to the right handsi<strong>de</strong> of (18), we have to study in which way the coefficients ω k i jhave to be modified. This study requires to check all the coefficientsevery time we want to inclu<strong>de</strong> a new variable.4. CONCLUSIONSIn this paper we have proposed a new Mixed Integer Formulationfor the Nesting Problem. The HS formulation modifies the FLformulation in two ways. On the one hand, the <strong>de</strong>finition of horizontalslices, which restrict the vertical position of the pieces. Onthe other hand, the lifted bound constraints. The use of horizontalslices allows us to fix many binary variables to 0. We have alsointroduced some new valid inequalities, which have been foundstudying the linear relaxation of the formulation. Again, the horizontalslices are very useful for <strong>de</strong>fining strong valid inequalities.In these two aspects, the proposed formulation seems to improvethe previous ones, as a preliminary computational experience hasshown.This work can be consi<strong>de</strong>red the first part of a study about thisproblem that will lead us to the <strong>de</strong>sign and implementation of exactand heuristic procedures. More concretely, in the second phaseof our work we are <strong>de</strong>veloping a Branch-and-Cut algorithm. Theformulation and the valid inequalities presented in this paper arethe basic components of the algorithm, but other important questionshave to be addressed, such as the branching strategy and the<strong>de</strong>velopment of efficient separation algorithms for i<strong>de</strong>ntifying violatedinequalities.5. ACKNOWLEDGEMENTSThis study has been partially supported by the Ministerio <strong>de</strong> Cienciae Innovación of Spain through project DPI2008-02700, co financedby FEDER funds.6. REFERENCES[1] A.M.Gomes and J.F.Oliveira, “Solving irregular strip packingproblems by hybridising simulated annealing and linear programming,”European Journal of Operational Research, vol.171, pp. 811–829, Oct. 2006.[2] J.A.Bennell and J.F.Oliveira, “A typology of cutting and packingproblems,” European Journal of Operational Research,vol. 184, pp. 397–415, Nov. 2008.[3] M.Fischetti and I.Luzzi, “Exact and heuristic mip mo<strong>de</strong>ls fornesting problems,” 2003.ALIO-EURO <strong>2011</strong> – 76


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Matheuristics for Traffic Counter LocationMarco A. Boschetti ∗ Vittorio Maniezzo † Matteo Roffilli †Antonio José Bolufé Röhler ‡∗ Dept. MathematicsUniversity of Bologna, Cesena, Italymarco.boschetti@unibo.it† Dept. Computer ScienceUniversity of Bologna, Cesena, Italyvittorio.maniezzo@unibo.it‡ Dept. Artificial Intelligence and Computer SystemsUniversity of Habana, Habana, Cubabolufe@matcom.uh.cuABSTRACTMatheuristic algorithms have begun to <strong>de</strong>monstrate that they canbe the state of the art for some optimization problems. This paperputs forth that they can represent a viable option also in an applicativecontext. The possibility to get a solution quality vali<strong>da</strong>tionor a mo<strong>de</strong>l groun<strong>de</strong>d construction may become a significantcompetitive advantage against alternative approaches. This viewis substantiated in this work by an application on the problem of<strong>de</strong>termining the best set of locations for a constrained number oftraffic counters, to the end of estimating a traffic origin / <strong>de</strong>stinationmatrix. We implemented a Lagrangean heuristic and testedit on instances of different size. A real world use case is also reported.Keywords: Matheuristics, Traffic counters, Location problems,Real world applications1. INTRODUCTIONMatheuristic algorithms are the state of the art for some optimizationproblems [1, 2, 3] and, besi<strong>de</strong>s their theoretical involvement,they can represent a viable option also in an applicative context. Infact, the possibility to get an online vali<strong>da</strong>tion of the solution quality,for example by means of a bound, or a mo<strong>de</strong>l groun<strong>de</strong>d constructionwhich justifies construction choices, may be a significantcompetitive advantage against alternative approaches. In spite ofthe relative youth of this application field, several works have infact reported about the possibility to use matheuristics techniquesfor implementing applications targeted for real-world <strong>de</strong>ployment.This possibility is substantiated also in this work by an applicationon the problem of <strong>de</strong>termining the best locations for a givennumber of traffic counters, to the end of estimating a traffic Origin- Destination matrix (OD matrix) of traffic flows. The applicationsupports a planner in inferring the OD matrix by <strong>de</strong>terminingwhere to locate counters in such a way that the chosen positionswill be the most informative for the specific estimation algorithmthat shall be used.The problem is already known in the literature, where it was presentedun<strong>de</strong>r the name of Network Count Location Problem (NCLP).A problem closely related to the NCLP is the Link Count LocationProblem (LCLP), which asks to <strong>de</strong>termine the best position of acounter along a link. In this work we are only interested in theNCLP, possibly leaving the LCLP as a further study.The most relevant literature contributions for the NCLP inclu<strong>de</strong>the work of Ehlert et al. [4], where they propose a MIP-basedtool which was put to actual use on a road network of 1414 directedlinks, divi<strong>de</strong>d into 23 O/D zones. This approach is relatedto the one we put forth here, while different approaches were usedby Yang and Zhou [5], who used selection rules, and by Bell andGrosso [6, 7], who used path flows estimations. Overviews arealso available for this problem, for recent ones see Cascetta andPastorino [8] and Wang et al. [9].2. PROBLEM SPECIFICATIONThe general context in which the problem arises is that of inferringan OD matrix of traffic flows. Within this framework, the NCLPasks to <strong>de</strong>termine which is the best positioning for a set C of trafficcounters, that is, the positions which provi<strong>de</strong> most informationto a subsequent OD estimation algorithm. This should take intoaccount also the possibility of having pre-installed fixed counterswhich cannot be moved and whose information must be consi<strong>de</strong>redfor the subsequent OD estimation.One possible formulation of the problem is the following.Given a road network N represented by a multigraph G = (V,A),with V = V s ∪V c and A = A s ∪ A c where A s is the subset of actualroad network arcs, representing the different lanes of the roads ofinterests (or the carriageways in case of motorways), V s the subsetof its endpoints (crossways of the road network), V c is a subset ofdummy no<strong>de</strong>s, each of which is associated with an origin or with a<strong>de</strong>stination and A c is a subset of dummy arcs, which connect eachorigin or <strong>de</strong>stination no<strong>de</strong> to all no<strong>de</strong>s in V s belonging to the zonemo<strong>de</strong>led by that origin or <strong>de</strong>stination.We want to <strong>de</strong>termine the arcs where the counters of set C are to bemost conveniently located. That is, we want to <strong>de</strong>termine the arcsubset Ā, Ā ⊆ A s , on whose arcs a traffic count ¯f i j will be obtained.An obvious precondition is the ability to <strong>de</strong>termine an estimate ofthe traffic flow f i j on each arc (i, j) ∈ A. Details on a possibleprocedure for this can be found in Gabrielli et al. [10, 11]. Anactual traffic count, ¯f i j , could also be already available for the arcsof a subset of A s .The OD matrix is mo<strong>de</strong>led as an in<strong>de</strong>x set Λ = [l] of OD pairs, eachof them with an associated <strong>de</strong>mand ω l ∈ Ω. Demands will even-ALIO-EURO <strong>2011</strong> – 77


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>tually distribute over traffic flows ϕ p , running on directed pathsp, p ∈ Π l , where Π l is the in<strong>de</strong>x set of paths for OD pair l andΠ = ⋃ l∈Λ Π l . The objective asks to minimize an additive generalizedcost, which can be computed for each arc (i, j) in relationto the time nee<strong>de</strong>d for traveling through the arc, in accor<strong>da</strong>nce toWardrop’s principle, and it is a function c i j ( f i j ) of the flow throughit. The basic traffic assignment problem is thus as follows:(TAP) min ∑ c i j ( f i j ) (1)(i, j)∈As.t. ∑ ϕ p = b l ,p∈Π ll ∈ Λ (2)f i j = ∑ δ pi j ϕ p ≤ u i j (i, j) ∈ A (3)p∈ΠHere b l represents the origin to <strong>de</strong>stination <strong>de</strong>mand for OD pairl, δ pi j is a constant equal to 1 if arc (i, j) belongs to path p, 0otherwise, and u i j is the theoretical capacity of arc (i, j).A significant problem to be faced in this kind of applications is theinherent unreliability of the OD matrix. The matrix is usually obtainedfrom interviews and/or inductions from geographic and economical<strong>da</strong>ta, and it is therefore intrinsically approximated. Moreover,OD <strong>da</strong>ta is possibly obsolete. This motivated substantial researchaimed at up<strong>da</strong>ting OD matrices, including several methodsbased on actual traffic counts on road arcs.The OD matrix estimation problem was mo<strong>de</strong>led as a constrainedquadratic optimization problem. Input <strong>da</strong>ta are the flows ϕ p oneach path p ∈ Π, the old OD matrix, Ω = ¯[ ¯ω, l] the set ¯F = { f¯i j }of the sampled flows for each arc in Ā, a lower bound L l and anupper bound U l for each OD pair l ∈ Λ.The new OD matrix is computed as a tra<strong>de</strong>-off between the objectiveof minimizing the quadratic difference from ¯Ω and that ofminimizing the difference of the flows f i j induced in each arc inĀ with ¯f i j , where the ¯f i j are measured by actual traffic counters,un<strong>de</strong>r constraints on L l and U l . To compute it, we need the usageratio of each arc (i, j) for each pair l, which is computed asρi l j = ∑p∈Π δ pl i j ϕ p∑ p∈Πl ϕ p, where Π l is the in<strong>de</strong>x set of all paths for ODpair l as computed by the assignment. The formulation of the ODrefinement problem becomes as follows:()(ODP) min ∑ (ω l − ¯ω l ) 2 + γ ∑ ∑ ω l ρi l j − ¯f i jl∈λ(i, j)∈A l∈Λs.t.L l ≤ ω l ≤ U i j(4)l ∈ Λ(5)where γ is a user-<strong>de</strong>fined parameters which biases the result towardhaving an OD matrix structurally close to the old one and awayfrom having assignments close to the sampled ones, or vice-versa.To <strong>de</strong>termine subset Ā we propose to use the following mo<strong>de</strong>l. Themo<strong>de</strong>l is based on an operational assumption: each counter, whenplaced on a two way road, is able to provi<strong>de</strong> <strong>da</strong>ta for both drivingdirections. Therefore, one counter will provi<strong>de</strong> <strong>da</strong>ta for two arcs inA, when the two correspond to the driving directions of a two-wayroad. We need anyhow to have counting <strong>da</strong>ta associated to arcs inor<strong>de</strong>r to provi<strong>de</strong> the nee<strong>de</strong>d input to the OD estimator.In the mo<strong>de</strong>l, we associate a binary variable x i j to each arc (i, j) ofthe road network N. Each network arc (i, j) ∈ N could correspondto one arc (i, j) ∈ A or to a pair of arcs (i, j) ∈ A, ( j,i) ∈ A, <strong>de</strong>pendingon whether it is a one-way or a two-way road. The x i j variableis equal to 1 iff the arc will be chosen for hosting a counter. Furthermore,we associate a binary variable ξ p to each possible path pbetween origins and <strong>de</strong>stinations in N (i.e., between no<strong>de</strong>s in V c ).The mo<strong>de</strong>l tries to minimize the number of OD pairs (i.e., the numberof paths) which won’t be sampled by any counter. Variables ξact as slacks in the covering constraints, permitting to cover a pathwith an expensive slack variable if no counter can be used. Theprice c p of each ξ p variable could also be a function of prior ODvalues, when available. The problem asks then to solve the followingSet Covering problem with an additional knapsack constraint:(TCL) min ∑ c p ξ p (6)p∈Πs.t. ∑ a p i j x i j + ξ p ≥ 1, p ∈ Π (7)(i j)∈N∑(i j)∈Nx i j ≤ n, (8)x i j ,ξ p ∈ {0,1} (i, j) ∈ N, p ∈ Π (9)where n is the cardinality of C and a p i j is a coefficient equal to 1 ifarc (i, j) enters path p, 0 otherwise. Notice that x variables can befixed to trivially account for pre-installed counters.3. A LAGRANGEAN SOLUTIONFormulation TCL can be effectively solved for small to mid sizedproblem instances. This is already enough for a number a of realworld applications, thus a direct use of a MIP solver is an optionto consi<strong>de</strong>r when facing an actual case. However, instances couldbecome too big to be solved to optimality within a required timelimit. In these cases heuristics are in or<strong>de</strong>r. We propose a Lagrangeanapproach for <strong>de</strong>signing a metaheuristic able to effectivelycope with big TCL instances.3.1. Lagrangean relaxationFormulation TCL can be simplified by relaxing the covering constraints7, or the knapsack constraint 8 or both. After some preliminarytesting, we went for option one and we relaxed the coveringconstraints, keeping the knapsack. The relaxed formulation becomesthe following.(LTCL) min ∑ (c p − λ p )ξ p − ∑p∈Πs.t.∑p∈Π (i j)∈Nλ p a p i j x i j + ∑ λ pp∈Π(10)∑ x i j ≤ n, (11)(i j)∈Nx i j ,ξ p ∈ {0,1}, (i, j) ∈ N, p ∈ Π (12)λ p ≥ 0 p ∈ Π (13)The <strong>de</strong>riving subproblem, with given penalties, can be easily solvedby inspection, by setting to 1 all ξ variables with negative coefficientand by choosing the n variables of type x i j with greater coefficients.3.2. Lagrangean MetaheuristicsFormulation LTCL can be used both for obtaining a bound on theoptimal solution cost and a feasible, high quality solution. Wewent along, implementing a Lagrangean metaheuristic [12] for theTCLP, based on a subgradient solution of the Lagrangean dual ofALIO-EURO <strong>2011</strong> – 78


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>formulation LTCL. The general structure of the algorithm is as inBoschetti, Maniezzo [13]:LAGRHEURISTIC()1: i<strong>de</strong>ntify an "easy" subproblem LR(λ)2: repeat3: solve subproblem LR(λ) obtaining solution x4: check for unsatisfied constraints5: up<strong>da</strong>te penalties λ6: construct problem solution using x and λ7: until (end_condition)where subproblem LR corresponds to LTCL, and penalty up<strong>da</strong>tesis implemented as an a<strong>da</strong>ptive subgradient algorithm, as specifiedin Boschetti et al. [12].In our case, each iteration of the subgradient algorithm directlyprovi<strong>de</strong>s also a feasible problem solution, as the inspection ofLTCL variable costs permits to <strong>de</strong>termine a subset of n arcs, whichwill be those suggested for locating the traffic counters. A simplelocal search is used (and nee<strong>de</strong>d) to fine-tune the solutions.4. USE CASESWe implemented an operational solution, coding the above algorithmin c# un<strong>de</strong>r .Net framework 4. The solution comprises alsoan IP optimization of formulation TCL, empowered by CoinMP(for which a c# wrapper is freely available [14]). Data was importe<strong>da</strong>nd exported from ESRI ArcGis [15] and preprocessed inPostGIS [16]. We had the possibility to test our approach on threereal-world instances, <strong>de</strong>fined on <strong>da</strong>ta of three municipalities innorthern Italy.The main characteristics of the instances are summarized in Table1, where the columns show:• id: an i<strong>de</strong>ntifier of the instance• Surf: the surface of the municipality, in square Km• Inh: the number of inhabitants of the municipality• Dens: the resi<strong>de</strong>nt population <strong>de</strong>nsity of the municipality• No<strong>de</strong>s: the number of no<strong>de</strong>s of the road graph• Arcs: the number of arcs of the road graph• Zones: the number of zones for which the OD movementsare to be estimated• Count: the number of counters to locateIn all instances the number of counters to locate is to be inten<strong>de</strong><strong>da</strong>s a number in addition to those already installed in the territory.MunicipalityRoad graphid Surf Inh Dens No<strong>de</strong>s Arcs Zones CountA 56.89 10651 187 795 1898 14 25B 45.13 25375 562 1904 5210 12 24C 7.58 10275 1355 3469 8136 13 28Table 1: Real world instances.Notwithstanding with the relative small scale of the tested instances- which is anyway aligned with that of the biggest instances so farpresented in the literature - the results were of interest. Each instancecould be solved in less than 10 seconds on a 3 GHz PentiumDuo machine with 2 Gb of RAM, providing solutions which wereof interest for the final user.Figures 1 present input <strong>da</strong>ta (top) and final solution (bottom, counte<strong>da</strong>rcs in red) for instance A, the smallest of the three. A noteworthyFigure 1: Instance A: OD zones and transfer paths (top), counte<strong>da</strong>rcs (bottom).characteristic of the solution was that the counting locations wereset on nonintuitive arcs. In several cases in fact it is suggested tocount traffic flows composed by many paths, which can be disambiguatedconsi<strong>de</strong>ring the whole set of observations.Figure 2 present a wi<strong>de</strong> area view of the territory of interest for instanceB, as several zones were <strong>de</strong>fined outsi<strong>de</strong> of the municipalityof interest because significant flows were originated far from themunicipality. It was requested to also <strong>de</strong>termine the flows specificallyoriginated from the (relatively) far origins. In fact, some arcscorrespond to highway tracts. The different zones internal to themunicipality are here con<strong>de</strong>nsed in the central cluster. Again, thesolution was able to provi<strong>de</strong> a feasible scenario of interest for theoperator.Finally, figure 3 presents a wi<strong>de</strong> area view of instance C, where thesmallest roads are not drawn. The same consi<strong>de</strong>rations ma<strong>de</strong> forinstance B can be applied also here.In conclusion, we like to point out how the proposed procedureproved effective in the operational contexts where it was tested. Astrong point we like to make is that the procedure was used in anoperational process, <strong>de</strong>aling with real-world <strong>da</strong>ta and constraintsand operating on a legacy field system, thus providing an endorsementfor the use of matheuristics in real-world applications.We are now consi<strong>de</strong>ring bigger size instances. We are confi<strong>de</strong>ntthat the procedure can be used also for bigger municipalities asits primary use is for the location of additional counters, and thealready located ones do not increase the instance complexity.ALIO-EURO <strong>2011</strong> – 79


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 2: Instance B: OD zones and transfer paths.Figure 3: Instance C: OD zones and transfer paths.5. REFERENCES[1] P. Hansen, V. Maniezzo, and S. Voss, “Special issue on mathematicalcontributions to metaheuristics editorial,” Journal ofHeuristics, vol. 15, no. 3, pp. 197–199, 2009.[2] M. A. Boschetti, V. Maniezzo, M. Roffilli, and A. B. Röhler,“Matheuristics: Optimization, simulation and control,”in Hybrid Metaheuristics, 2009, pp. 171–177.[3] V. Maniezzo, T. Stützle, and S. Voss, Eds., Matheuristics:Hybridizing Metaheuristics and Mathematical Programming,1st ed., ser. Annals of Information Systems. NewYork: Springer, 2010, no. 10, iSBN: 978-1-4419-1305-0.[4] A. Ehlert, M. G. H. Bell, and S. Grosso, “The optimisationof traffic count locations in road networks,” TransportationResearch Part B: Methodological, vol. 40, no. 6, pp. 460–479, 2006.[5] H. Yang and J. Zhou, “Optimal traffic counting locationsfor origin-<strong>de</strong>stination matrix estimation,” Transportation ResearchPart B: Methodological, vol. 32, no. 2, pp. 109 – 126,1998.[6] M. Bell and S. Grosso, “The path flow estimator as a networkobserver,” Traffic Engineering and Control, vol. 39, no. 10,pp. 540–550, 1998.[7] ——, “Estimating path flows from traffic counts,” in Trafficand Mobility, H. Wallentowitzm, Ed. Berlin, Germany:Springer Verlag, 1999, pp. 85?–105.[8] E. Cascetta and M. Postorino, “Fixed point approaches tothe estimation of o/d matrices using traffic counts on congestednetworks,” Transportation Science, vol. 35, pp. 134–147, 2001.[9] H. Wang, K. Li, J. Sun, and Y. Liu, “Framework on hierarchicaloptimization of traffic count location for city traffic system,”Power Electronics and Intelligent Transportation System,Workshop on, vol. 0, pp. 419–422, 2008.[10] R. Gabrielli, A. Gui<strong>da</strong>zzi, M. A. Boschetti, V. Maniezzo, andM. Roffilli, “Practical origin-<strong>de</strong>stination traffic flow estimation,”in Proc. ODYSSEUS 2006, Third International Workshopon Freight Transportation and Logistics, Altea (Spain),2006.[11] ——, “A<strong>da</strong>ptive traffic flow estimation,” in LION 2007 WorkingPapers, Learning and Intelligent OptimizatioN, An<strong>da</strong>lo(Trento) - Italy, 2007.[12] M. A. Boschetti, V. Maniezzo, and M. Roffilli, “A fully distributedlagrangean solution for a p2p overlay network <strong>de</strong>signproblem,” INFORMS Journal on Computing, <strong>2011</strong>, publishedonline in Articles in Advance.[13] M. A. Boschetti and V. Maniezzo, “Ben<strong>de</strong>rs <strong>de</strong>composition,lagrangean relaxation and metaheuristic <strong>de</strong>sign,” Journal ofHeuristics, vol. 15, no. 3, pp. 283–312, 2009.[14] V. Maniezzo, “A c# wrapper for coinmp,” January <strong>2011</strong>, http://astarte.csr.unibo.it/coinORwrapper/coinORwrapper.htm.[15] ESRI, “Arcgis,” January <strong>2011</strong>, http://www.esri.com/software/arcgis/in<strong>de</strong>x.html.[16] “Postgis,” January <strong>2011</strong>, http://postgis.refractions.net/.ALIO-EURO <strong>2011</strong> – 80


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A Matheuristic Algorithm for Auto-Carrier TransportationMauro Dell’Amico ∗ Simone Falavigna ∗ Manuel Iori ∗∗ DISMI, University of Mo<strong>de</strong>na and Reggio EmiliaVia Amendola 2, 42122 Reggio Emilia, Italy{mauro.<strong>de</strong>llamico, simone.falavigna, manuel.iori}@unimore.itABSTRACTWe study a real-world distribution problem arising in the automotivefield, in which cars and other vehicles have to be loa<strong>de</strong>d onauto-carriers and then <strong>de</strong>livered to <strong>de</strong>alers. The solution of theproblem involves both the computation of the routing of the autocarriersalong the road network and the <strong>de</strong>termination of a feasibleloading for each auto-carrier. We solve the problem by means ofa heuristic algorithm that makes use of simple greedy and localsearch strategies for the routing part, and more complex mathematicalmo<strong>de</strong>ling and branch-and-bound techniques for the loadingpart. Preliminary computational results show that good savingson the total routing distance can be obtained within small computationalefforts.Keywords: Vehicle routing, Matheuristics, Auto-carrier transportation1. INTRODUCTIONThe automotive industry represents a very important sector of mo<strong>de</strong>rneconomies, as confirmed by the weight of turnover in GDP(3.5% in Europe in 2009) and on the number of vehicles that circulateon roads (224 million vehicles in Europe in 2009). Oneof the main logistic issues in this sector concerns the <strong>de</strong>livery ofvehicles (e.g., cars, vans or trucks) to <strong>de</strong>alers.Usually vehicle manufacturers do not <strong>de</strong>liver their products directly,but rely on special logistic companies. These companiesreceive the vehicles from the manufacturers, stock them in storageareas and <strong>de</strong>liver them to the <strong>de</strong>alers when or<strong>de</strong>red. The <strong>de</strong>liveriesare provi<strong>de</strong>d by special trucks, called auto-carriers, composed by atractor and perhaps a trailer, both usually equipped with upper andlower loading planes. An example of a typical auto-carrier is <strong>de</strong>pictedin Figure 1. The <strong>de</strong>picted loading is composed by i<strong>de</strong>nticalvehicles, but, in most of the cases, loadings involving heterogeneousvehicles occur.The loading capacity of an auto-carrier strongly <strong>de</strong>pends on thevehicles dimensions and shapes. To increase such capacity autocarriersare usually equipped with particular loading equipments.For example, vehicles may be partially rotated and the upper loadingplanes may be translated vertically and/or rotated, see againFigure 1. Both upper and lower planes can also be exten<strong>de</strong>d toincrease their lengths. Additional loading constraints come fromtransportation laws, that impose maximum height, length and weightof the cargo. Note that the width is negligible, because vehiclescannot be transported si<strong>de</strong>-by-si<strong>de</strong> on the auto-carriers.The <strong>de</strong>alers are spread out over large areas, and it is infrequent thata single <strong>de</strong>aler or<strong>de</strong>r can fill exactly the capacity of one or moreauto-carriers. For this reason the companies are forced to loaddifferent or<strong>de</strong>rs from different <strong>de</strong>alers into the same auto-carriers.Note also that the auto-carriers are rear-loa<strong>de</strong>d and the loadingsmust preserve a Last In First Out (LIFO) policy: it must alwaysbe possible to unload a vehicle at a <strong>de</strong>aler without moving othervehicles directed to following <strong>de</strong>alers.This work is <strong>de</strong>voted to the study of a real-world case <strong>de</strong>rived fromthe every<strong>da</strong>y activity of one of these logistic companies. The company<strong>de</strong>livers vehicles all over Italy through a large fleet of heterogeneousauto-carriers. Their activity involves multiple <strong>da</strong>ys,multiple <strong>de</strong>pots, pickups-and-<strong>de</strong>liveries, not to mention the uncertaintiesthat typically arise in routing problems. In this work welimit the study to one <strong>da</strong>y (i.e., <strong>de</strong>liveries cannot be postponed)and one <strong>de</strong>pot (the main <strong>de</strong>pot of the company), and focus on theminimization of the number of kilometers traveled.Despite these assumptions, the resulting combinatorial problemis very complex, as it requires not only the solution of a twodimensionalnon-convex loading problem for each auto-carrier, butalso the routing of the auto-carriers along the road network. Boththese two sub-problems are NP-hard. Moreover, the size of theproblems we address is very large: on average 800 vehicles are<strong>de</strong>livered every<strong>da</strong>y to about 200 <strong>de</strong>alers in the instances that wereprovi<strong>de</strong>d to us. It is thus natural to focus on heuristic techniques.We <strong>de</strong>veloped a constructive heuristic and some local search techniquesbased on classical i<strong>de</strong>as from the vehicle routing literature.Any time one of these techniques has to <strong>de</strong>termine the feasibilityof the loading associated to a route, it invokes a given loadingalgorithm. Such algorithm is based on an approximation of theoriginal non-convex two-dimensional loading problem, which issolved by means of 1) an integer linear mo<strong>de</strong>l or 2) a combinatorialbranch-and-bound technique. Our approach can be seen as a particularmatheuristic algorithm, see Maniezzo et al. [1], because itintegrates heuristic search techniques (for the routing) with mathematicalmo<strong>de</strong>ling and exact techniques (for the loading).The remaining of the paper is structured as follows. In Section 2we formally <strong>de</strong>scribe the problem and briefly review the relevantliterature. In Section 3 we present the approach we <strong>de</strong>veloped, andin Section 4 we finally present some preliminary computationalresults.2. PROBLEM DESCRIPTION AND LITERATUREREVIEWFigure 1: An example of an auto-carrier with four loading planes,carrying nine vehicles.In the following we use the term vehicle to <strong>de</strong>note a transporteditem (e.g., a car, a truck, a van), and the term auto-carrier to <strong>de</strong>noteALIO-EURO <strong>2011</strong> – 81


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>a truck that transports vehicles. We are given an heterogeneousfleet of auto-carriers. More in <strong>de</strong>tails, we are given T auto-carriertypes. Each auto-carrier type t has a maximum weight capacityW t and is formed by P t loading planes. There are K t auto-carriersavailable for each type t (t = 1,...,T ).We are also given a complete graph G = (N,E), where N = {0,1,...,n} is the set of vertices and E the set of edges connecting eachvertex pair. Vertex 0 corresponds to the <strong>de</strong>pot, whereas vertices{1,...,n} correspond to the n <strong>de</strong>alers to be served. The edge connecting<strong>de</strong>alers i and j is <strong>de</strong>noted by (i, j) and has an associatedrouting cost c i j (i, j = 0,...,n). The cost matrix is symmetric andsatisfies the triangular inequality.The <strong>de</strong>mand of <strong>de</strong>aler i consists of a set of m i vehicles. Each vehiclek <strong>de</strong>man<strong>de</strong>d by <strong>de</strong>aler i has weight w ik (i = 1,...,n; k =1,...,m i ), and a particular two-dimensional shape, whose <strong>de</strong>tailswill be discussed in Section 3.1. The <strong>de</strong>mand of a <strong>de</strong>aler has tobe completely fulfilled. This can be done by using one or moreauto-carriers (i.e., split <strong>de</strong>liveries are allowed). Let M <strong>de</strong>note thetotal number of vehicles to be transported.We <strong>de</strong>note a route by the triplet (S,τ,φ), where S ⊆ {1,...,M} isa set of vehicles to be transported, τ is an auto-carrier type, andφ : S → N is a function that gives the or<strong>de</strong>r in which a vehicle is<strong>de</strong>livered along the route. In particular all vehicles k <strong>de</strong>man<strong>de</strong>d bythe first <strong>de</strong>aler visited in the route have φ(k) = 1, those <strong>de</strong>man<strong>de</strong>dby the second <strong>de</strong>aler visited in the route have φ(k) = 2, and so on(k = 1,...,|S|). A route (S,τ,φ) is said to be load-feasible if(i) the sum of the weights of the vehicles in S does not exceedthe weight capacity of auto-carrier τ;(ii) there exists a feasible loading of the vehicles in S on the P τplatforms of auto-carrier τ;(iii) when visiting the <strong>de</strong>aler in position ˜ι in the route, all vehiclesk having φ(k) = ˜ι can be downloa<strong>de</strong>d directly from theauto-carrier, without moving vehicles directed to <strong>de</strong>alers tobe visited later on along the route.Checking Condition (i) is easy, whereas checking Conditions (ii)and (iii) involves the solution of a complex two-dimensional nonconvexloading problem, whose <strong>de</strong>tails are shown in Section 3.1.The Auto-Carrier Transportation Problem (A-CTP) calls for the<strong>de</strong>termination of a set of routes such that each route is load-feasible,the <strong>de</strong>mands of the <strong>de</strong>alers are completely fulfilled and the totalcost is minimum.The (A-CTP) belongs to the class of integrated loading and routingproblems. It can be seen as a (particularly difficult) variant ofthe Capacitated Vehicle Routing Problem with Two-dimensionalLoading Constraints (2L-CVRP), see Iori et al. [2]. In the 2L-CVRP the <strong>de</strong>mands are sets of two-dimensional rectangular itemsand have to be loa<strong>de</strong>d into two-dimensional rectangular loadingspaces. Apart from the A-CTP, other variants of the 2L-CVRPthat mo<strong>de</strong>l real-world distribution problems have been studied byGendreau et al. [3] (furniture distribution) and Doerner et al. [4](timber distribution). We refer the rea<strong>de</strong>r to Iori and Martello [5]for a recent survey on routing problems involving loading constraints.For what concerns vehicle routing in general, we refer tothe books by Toth and Vigo [6] and Gol<strong>de</strong>n et al. [7]. The latteralso contains a comprehensive survey (Archetti and Speranza [8])on routing problems involving split <strong>de</strong>liveries.Other auto-carrier problems have been addressed in the literature.Agbegha et al. [9] focused their attention on the loading problem,and mo<strong>de</strong>led it by dividing the auto-carrier into slots and assigningvehicles to slots. Incompatibilities arise as some vehicles cannotbe assigned to adjacent slots. Ta<strong>de</strong>i et al. [10] studied a large autocarrierproblem by consi<strong>de</strong>ring both routing and loading aspects.They solved the loading problem by using the concept of equivalentlength (in practice the length occupied on a plane by a vehicleafter an possible rotation). They consi<strong>de</strong>red the case of <strong>de</strong>liveriesoccurring in multiple <strong>da</strong>ys and solved it through a heuristic basedon an integer programming formulation.3. A SOLUTION APPROACHWe <strong>de</strong>veloped simple heuristic algorithms based on classical strategiesfor the capacitated vehicle routing problem. We start with arandomized closest neighbor heuristic. We initialize a route by selectinga random vehicle among those to be <strong>de</strong>livered and a randomauto-carrier among the available ones. We then extend the route byselecting the vehicle to be <strong>de</strong>livered whose <strong>de</strong>aler is closest to thatof the last loa<strong>de</strong>d vehicle. At any iteration we invoke the algorithmto be <strong>de</strong>scribed below in Section 3.1 to check the feasibility of theloading. We continue extending the current route as long as theloading remains feasible. We then re-iterate by initializing a newroute, until all vehicles are loa<strong>de</strong>d.The solution obtained by the above heuristic is optimized by usingthree simple local search procedures. The first one, <strong>de</strong>noted move1-0, attempts to move all the vehicles assigned to a <strong>de</strong>aler in oneroute to another route. If the loading is feasible and the total costof the involved routes is reduced, then the move is performed. Thelocal search re-iterates, in a first-improvement fashion, until nofurther cost reduction is possible. The two other local search algorithmsoperate in a similar manner but have larger complexities.Local search swap 1-1, resp. swap 2-1, attempts to exchange allthe vehicles <strong>de</strong>man<strong>de</strong>d by a <strong>de</strong>aler, resp. two <strong>de</strong>alers, in one routewith all the vehicles <strong>de</strong>man<strong>de</strong>d by another <strong>de</strong>aler in another route.Also the local search procedures invoke the algorithm of Section3.1 whenever they need to check the feasibility of a loading.3.1. Solution of the loading problemIn this section we present an algorithm to <strong>de</strong>termine if a given route(S,τ,φ) is load-feasible or not. As mentioned before, the exact solutionof the two-dimensional non-convex loading problem is NPhar<strong>da</strong>nd particularly complex in practice. Hence we content uswith an approximate mo<strong>de</strong>l of such problem. The reliability of theapproximate mo<strong>de</strong>ling was tested together with the logistic company,by using their historical <strong>de</strong>livery <strong>da</strong>tabase. Out of 20,335auto-carrier loadings performed by the company (hence feasible),the mo<strong>de</strong>l reported the correct answer for 20,210 cases, proving tobe 99% accurate. Similar results were obtained for loadings thatwere known to be infeasible. In the following we <strong>de</strong>note homogeneousa loading that involves i<strong>de</strong>ntical vehicles, and heterogeneousone that involves different vehicles.The first easy check that our algorithm performs is based on thevehicles weights: if their sum is greater than the auto-carrier capacity,then the load is infeasible. Otherwise we perform a secondquick check. For each type of vehicle and auto-carrier, the logisticcompany provi<strong>de</strong>d us what they <strong>de</strong>fine the load-in<strong>de</strong>x, i.e.,the maximum number of such vehicles that can be loa<strong>de</strong>d on suchauto-carrier. For example, the load-in<strong>de</strong>x is nine for the vehicleand auto-carrier <strong>de</strong>picted in Figure 1. We use d ikτ to <strong>de</strong>note theload-in<strong>de</strong>x, i.e., d ikτ stands for the maximum number of vehicleshaving the same shape of vehicle k <strong>de</strong>man<strong>de</strong>d by <strong>de</strong>aler i that canbe loa<strong>de</strong>d into auto-carrier τ.Let i(k) <strong>de</strong>note the <strong>de</strong>aler <strong>de</strong>manding vehicle k. We compute ˜d =∑ k∈S 1/d i(k)kτ and consi<strong>de</strong>r feasible a loading having ˜d ≤ 1. Notethat the load-in<strong>de</strong>x is a very approximate information and heterogeneousloadings may be feasible also when ˜d > 1. For this reason,whenever 1 < ˜d ≤ 1.2 and the loading is heterogeneous we invokean integer linear program (ILP) to <strong>de</strong>termine the feasibility. Weconsi<strong>de</strong>r infeasible homogeneous loadings with ˜d > 1 and hetero-ALIO-EURO <strong>2011</strong> – 82


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>geneous loadings with ˜d > 1.2.To <strong>de</strong>scribe the ILP we need some quite tedious but necessary notation.Each loading plane p of auto-carrier τ has length L pτ an<strong>da</strong> possible maximum extension A pτ . Given a plane p, let h(p) <strong>de</strong>notethe plane placed si<strong>de</strong> by si<strong>de</strong> horizontally with p, if any (forexample the two lower planes in Figure 1). The total extension ofplanes p and h(p) is limited to be at most à ph(p)τ . A vehicle has acertain length, and, whenever loa<strong>de</strong>d on a plane, can be rotated bya certain <strong>de</strong>gree. We <strong>de</strong>note l kpτ the equivalent length that takesvehicle k when loa<strong>de</strong>d on plane p of auto-carrier τ.Similarly to what done for h(p), let us <strong>de</strong>note v(p) the plane placedvertically above/below plane p, if any (for example the upper andlower planes of the trailer <strong>de</strong>picted in Figure 1). A vehicle beingparticularly high when loa<strong>de</strong>d on p may have a si<strong>de</strong> effect onv(p). For example we might be forced to lower down completelyan upper plane, hence using also completely the length of the lowerplane below, or we might be forced to rotate consistently the upperplane, losing in this way a portion of the lower plane length.To express this constraint we <strong>de</strong>fine λ kv(p)τ the equivalent lengthon plane p used by vehicle k when loa<strong>de</strong>d on plane v(p) of autocarrierτ.We finally <strong>de</strong>fine a prece<strong>de</strong>nce matrix among planes: let b pq takevalue one if loading a vehicle on plane p forbids unloading a vehicleloa<strong>de</strong>d on plane q, 0 otherwise. When b pq = 1 we say thatp prece<strong>de</strong>s q. For example, the right lower plane of Figure 1 prece<strong>de</strong>sall other planes, whereas the right upper plane prece<strong>de</strong>s onlythe left upper plane.To mo<strong>de</strong>l the loading problem as an ILP we <strong>de</strong>fine x kp = 1 if vehiclek is assigned to plane p, 0 otherwise, for k ∈ S, p = 1,...,P τ .We also <strong>de</strong>fine a p = length extension of plane p, for p = 1,...,P τ .We obtain:P τ∑ x kp = 1 k ∈ S (1)p=1∑(l kpτ x kp + λ kv(p)τ x kv(p) ) ≤ L pτ + a p p = 1,...,P τ (2)k∈Sx kp + x lq ≤ 1 p,q = 1,...,P τ : b pq = 1;k,l ∈ S : φ(k) > φ(l) (3)a p + a h(p) ≤ à ph(p)τ p = 1,...,P τ (4)0 ≤ a p ≤ A p,τ p = 1,...,P τ (5)x kp ∈ {0,1} p = 1,...,P τ ;k ∈ S (6)Constraints (1) impose that each vehicle is loa<strong>de</strong>d on a plane. Constraints(2) mo<strong>de</strong>l the maximum length of a plane, but also takinginto account vertical effects. Constraints (3) impose the LIFO policy.Note that we suppose that vehicles having different or<strong>de</strong>r ofvisit and being assigned to the same plane can be loa<strong>de</strong>d in sucha way that the LIFO policy be satisfied. Constraints (4) mo<strong>de</strong>lthe limit on the maximum extension of two planes placed si<strong>de</strong> bysi<strong>de</strong>, and constraint (5) give the appropriate range to the planes extensions.If mo<strong>de</strong>l (1)–(6) produces a feasible solution, then weconsi<strong>de</strong>r the route load-feasible.We also <strong>de</strong>veloped an alternative strategy to the above mo<strong>de</strong>l basedon an enumeration tree. At each level of the tree we create a no<strong>de</strong>by loading any still unloa<strong>de</strong>d vehicle in any plane. For any planewe keep in memory the available residual lengths. For any <strong>de</strong>alerwe keep in memory both the length that still has to be loa<strong>de</strong>d,and the total residual available length in the auto-carrier that canbe used by this <strong>de</strong>aler. When loading a vehicle in a plane, i.e.,when creating a no<strong>de</strong>, we up<strong>da</strong>te all residual lengths by consi<strong>de</strong>ringLIFO policy, horizontal and vertical relations among platforms,if any, and maximum extensions. Whenever the residualavailable length exceeds the length that still has to be loa<strong>de</strong>d for a<strong>de</strong>aler, we fathom the no<strong>de</strong>. The tree is explored in a <strong>de</strong>pth-firstfashion. In Section 4 we compare the performance of this algorithm,<strong>de</strong>noted branch-and-bound, with that of the ILP mo<strong>de</strong>l.4. PRELIMINARY COMPUTATIONAL RESULTSWe co<strong>de</strong>d our algorithms in C++ and run them on a Pentium Dual-Core, with 2.70 Ghz and 1.96 GB RAM, running un<strong>de</strong>r WindowsXP. We tested the algorithms on instances <strong>de</strong>rived from the realworldproblem. We consi<strong>de</strong>red the <strong>da</strong>ily distributions operated bythe logistic company in the month of July 2009, obtaining in total23 instances, one for each working <strong>da</strong>y. We filled the cost matrixby computing the distances of the shortest paths, in kilometers,using a GIS-based software. The fleet we consi<strong>de</strong>r is ma<strong>de</strong> by twotypes of auto-carriers, one with two loading planes and the otherwith four.The results we obtained are reported in Table 1. In the left partof the table, columns n and M report, respectively, the number of<strong>de</strong>alers and the number of vehicles to be <strong>de</strong>livered. The smallestinstance has 96 <strong>de</strong>alers requests, for a total of 272 vehicles tobe <strong>de</strong>livered. The largest instance requires instead the <strong>de</strong>livery of1139 vehicles.We run our algorithms by making use of the two options that we<strong>de</strong>veloped for the solution of the loading problem (see Section3.1). The results that we obtained using the branch-and-bound arereported in the middle part of the table. For the starting heuristicalgorithm and for the following local search methods, we presentthe objective function value of the best solution obtained, in columnkm, and the CPU seconds required by the algorithm, in columnsec. The algorithms are run in sequence, starting from theclosest neighbor heuristic and ending with the Swap (2-1). Eachalgorithm starts from the best solution obtained by the previousone. In the overall columns we report the total CPU time requiredby all algorithms (sec tot ) and the time spent by the loading procedure(sec load ). Note that sec load is a portion of sec tot . The resultsthat we obtained using the mathematical mo<strong>de</strong>l are reported in theright part of the table. We only report, for comparison sake, sec totand sec load . The mo<strong>de</strong>l has been solved using Cplex 11.All algorithms using the branch-and-bound option are very fast.Their execution requires 1.5 seconds, on average, and about 7 secondsin the worst case. About 70% of the cpu time used by thealgorithms is spent in the execution of the loading procedure. Inthis case too, as in other routing and loading problems, the loadingproblem has a crucial effect on the overall problem. The threelocal search procedures are effective in reducing the number ofkilometers traveled. The percentage reduction in the number ofkilometers traveled is consistent for move 1-0 (3.11% with respectto the solution provi<strong>de</strong>d by the greedy) and for swap 1-1 (3.92%with respect to the solution provi<strong>de</strong>d by move 1-0), but less significativefor swap 2-1 (just 0.64% with respect to swap 1-1). Theuse of mo<strong>de</strong>l (1)–(6) instead of the branch-and-bound leads to aconsistent increase in the CPU times. The seconds <strong>de</strong>dicated tothe computation of the loadings raise from 1.06 to 15.32, on average.We can conclu<strong>de</strong> that the branch-and-bound is a more suitablesolution method for these instances.The results show that good savings on the number of kilometerstraveled can be obtained within limited computational effort. Onaverage we are able to reduce by 7.4% the number of kilometersthat were traveled in the routes carried out by the company in July2009. We believe further improvement is possible, and for futureresearch we intend to embed the above local search techniques,and maybe new ones, into a metaheuristic framework.ALIO-EURO <strong>2011</strong> – 83


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>branch-and-bound mo<strong>de</strong>l (1)–(6)instance greedy move (1-0) swap (1-1) swap (2-1) overall overall<strong>da</strong>y n M km sec km sec km sec km sec sec tot sec load sec tot sec load01-Jul 228 832 57,132 0.05 56,184 0.16 54,179 0.92 53,347 0.06 1.19 0.59 14.06 13.3502-Jul 221 1139 69,999 0.02 68,087 0.50 66,676 0.55 66,550 0.19 1.25 0.59 12.27 11.6503-Jul 195 737 46,463 0.03 44,540 0.64 43,160 0.28 43,002 0.08 1.03 0.75 7.95 7.5506-Jul 243 1063 69,135 0.05 65,565 0.47 61,262 1.30 60,968 0.17 1.98 0.94 25.95 24.5807-Jul 165 629 33,469 0.02 31,362 0.14 30,249 0.30 30,179 0.05 0.50 0.28 7.86 7.5508-Jul 206 810 52,028 0.05 48,444 0.38 46,417 0.98 46,066 0.13 1.53 0.98 19.91 19.3309-Jul 200 941 57,682 0.05 56,522 0.77 54,866 1.80 54,538 0.42 3.03 2.57 29.20 28.5310-Jul 199 803 47,632 0.03 45,187 0.69 44,097 0.25 43,884 0.08 1.05 0.80 10.42 10.0813-Jul 244 1030 63,989 0.03 62,724 0.72 60,075 1.44 59,906 0.09 2.28 1.30 34.34 33.2414-Jul 227 826 48,729 0.03 48,281 0.20 46,729 1.26 46,649 0.11 1.61 0.75 20.92 20.2215-Jul 211 729 53,214 0.03 51,464 1.75 48,830 0.56 47,689 0.22 2.56 2.05 22.11 21.5216-Jul 206 833 51,402 0.06 50,068 0.28 47,426 1.17 46,988 0.09 1.61 1.16 18.89 18.2317-Jul 200 801 52,972 0.14 51,517 0.36 48,993 0.36 48,873 0.11 0.97 0.72 6.27 5.9220-Jul 198 707 37,734 0.03 36,862 0.41 36,195 0.48 35,939 0.08 1.00 0.58 16.28 15.9421-Jul 209 940 69,137 0.14 68,084 4.78 65,110 1.86 64,906 0.14 6.92 6.07 18.94 17.8022-Jul 189 614 41,558 0.05 40,661 0.26 39,424 0.39 39,324 0.02 0.72 0.41 7.33 6.9723-Jul 251 875 58,995 0.02 56,465 0.41 54,628 2.06 54,526 0.13 2.61 1.91 34.37 33.3024-Jul 198 811 50,619 0.05 49,699 0.24 47,946 0.51 47,651 0.08 0.88 0.31 10.00 9.6527-Jul 162 552 28,910


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A New MIP Heuristic Based on Randomized Neighborhood SearchDavi<strong>de</strong> Anghinolfi ∗Massimo Paolucci ∗∗ Department of Communication, Computer and Systems SciencesVia Opera Pia 13, Genova{anghinolfi, paolucci}@dist.unige.itABSTRACTA new simple MIP heuristic, called Randomized NeighborhoodSearch (RANS) is proposed, whose purpose is to produce withinshort time bounds high quality solutions especially for large sizeMIP problems as the ones characterizing real industrial applications.Starting from a feasible incumbent solution, RANS exploresa neighborhood randomly <strong>de</strong>fined by calling a MIP solver as ablack box tool. RANS rationale is similar to the one of other MIPheuristics recently appeared in literature but, differently, it exploitsonly a randomization mechanism to gui<strong>de</strong> the MIP solver. RANShas some self-tuning rules so that it needs as single input parameterthe maximum computation time. This paper also presents a procedurefor generating a first feasible solution based on the samerandomization concepts, that can be used as an initialization alternativefor particularly hard instances. RANS effectiveness isshown by an experimental comparison with other MIP heuristics.Keywords: Mixed Integer Programming, MIP heuristics, Neighborhoodsearch1. INTRODUCTIONMixed integer programming (MIP) is a flexible method for mo<strong>de</strong>lingcomplex optimization problems, as the ones emerging frommany application contexts. A general MIP mo<strong>de</strong>l (P) can be <strong>de</strong>fine<strong>da</strong>s finding z = min{ f (x) : Ax = b, x ∈ S}, i.e., minimizing alinear objective function f : S → R subject to a set of linear constraints,where the set of <strong>de</strong>cision variables is partitioned in generalas S = B∪I ∪C, being B, I and C respectively the sets of binary, integerand real variables. In addition, let <strong>de</strong>note G the set of generalinteger variables, i.e. G = B ∪ I.MIP belongs to the class of NP-hard problems and many researchand practical MIP problems are still very difficult to solve. Therefore,complex combinatorial optimization problems from both aca<strong>de</strong>micresearch and real world applications have been tackled byspecialized heuristics or metaheuristics. However, recently, a numberof approaches, called matheuristics, have been proposed tomelt or to associate i<strong>de</strong>as from metaheuristics with MIP solver algorithms(e.g., [1, 2, 3, 4, 5]).In this paper a new simple but effective heuristic approach is proposed,which is able to face complex MIP problems exploiting aMIP solver for finding the solution to a sequence of smaller subproblems.The method, called RAndomized Neighborhood Search(RANS), iteratively performs local search steps seeking for an improvedincumbent solution by calling a MIP solver as a black boxexploring <strong>de</strong>vice. RANS adopts concepts similar to the IteratedGreedy (IG) algorithm proposed in [6] for scheduling problems:IG is a simple algorithm which starts from a feasible incumbentsolution and iterates a <strong>de</strong>struction step followed by a constructionstep in or<strong>de</strong>r to seek for an improved solution. RANS hasa self-tuning mechanism to settle the dimension of the MIP subproblems,so that they should be neither too much trivial nor hardto solve. Experimental tests show that this very simple randomstrategy that uses only hard fixing is quite effective in tackling verytough problems, in particular being able to provi<strong>de</strong> quite good results(i.e., with a reduced gap from the best known solution) inshort computation times.This paper also presents a heuristic method, called RElaxed RAndomizedNeighborhood Search (RERANS), to find an initial feasiblesolution for MIP problems that exploits randomization similarlyto RANS. The i<strong>de</strong>a is to progressively build the solution solvinga sequence of partially relaxed MIP problems where only asubset of randomly chosen variables from G are left integer constrained,whereas the remaining ones are continuous relaxed. Actually,since RERANS needs solving a series of sub-problems, thismethod is not competitive with respect to other state-of-art generalpurpose algorithms for fast generating an initial solution, asfor example the Feasibility Pump (FP) [7]; however, it may bespecifically useful whenever MIP solvers or other initialization approachesneed a very large time to succeed.2. LITERATURE REVIEWMIP heuristic methods recently appeared in literature are LocalBranching (LB) [1], Relaxation Induced Neighborhood Search(RINS) [2], Evolutionary Algorithm for Polishing (Polishing) [3]and Variable Neighborhood Decomposition Search (VNDS) [4].These methods generally inclu<strong>de</strong> a high level component guidingthe solution space exploration through the <strong>de</strong>finition of neighborhoodsof the incumbent solution, and a low level component responsibleof the local search (LS), consisting of the <strong>de</strong>finition ofa MIP sub-problem solved by a MIP solver called as a black boxmodule. All the methods need an initial feasible incumbent solution,usually provi<strong>de</strong>d as the first feasible solution produced by theMIP solver, and adopt as termination condition the maximum timelimit.LB, originally proposed in [1], is a strategic external branchingframework exploiting a MIP solver as black box tactical solutiontool. LB was applied to mixed 0-1 integer programming, and suggestionsabout its extension to general MIP are provi<strong>de</strong>d in [2].The method performs soft variable fixing by means of the so-calledlocal branching constraints that impose a bound k (the neighborhoodradius) on the maximum Hamming distance of the binaryvariables from the incumbent x c , so <strong>de</strong>fining the neighborhood ofx c . Whenever the MIP solver improves the incumbent, the localbranching constraint is reversed and the neighborhood of thenew incumbent is explored. The method, which is exact in principle,is practically transformed in a LS heuristic having impose<strong>da</strong> time limit for the execution of MIP solver; it starts with a givenvalue for the maximum allowed distance k and it both reduces itwhenever the MIP solver does not improve the incumbent and increasesit during a diversification step. LB was successively reimplementedin [2] as a heuristic to improve the incumbent thatis called within the stan<strong>da</strong>rd branching exploration framework of aALIO-EURO <strong>2011</strong> – 85


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>MIP solver whenever a new incumbent is found. The authors in [2]show that the proposed reimplementation outperforms the originalmethod.RINS [2] is a heuristic to seek for improved incumbent that iscalled at no<strong>de</strong>s of a stan<strong>da</strong>rd branching scheme. The method <strong>de</strong>finesthe neighborhood to be explored by performing a set of hardvariables fixing, in particular fixing the integer and binary variablesthat have the same values in the incumbent x c and in x r ,which is the solution of the linear relaxation of the consi<strong>de</strong>redno<strong>de</strong>. When invoked at a no<strong>de</strong> of the branching scheme, RINSdoes not consi<strong>de</strong>r any branching cuts introduced but only globalbounds and cuts. An advantage of RINS is its simplicity: it isembed<strong>de</strong>d in MIP solvers so that diversification is implicitly provi<strong>de</strong>dby stan<strong>da</strong>rd branching; it has no distinction between generalinteger and binary variables; it has no control on the neighborhooddimension. Therefore, being RINS potentially very time consuming,a frequency parameter is used to limit number of no<strong>de</strong>s wherethe method is called.Polishing [3] is a solution improving heuristic that, similarly toRINS, is called at no<strong>de</strong>s of the MIP solver branch-and-cut explorationtree, but it operates exploiting evolutionary algorithm concepts.Polishing maintains a fixed size population of the best P solutionsfound so far and when invoked it first generates M mutatedsolutions and then it performs C solution combinations. Mutationis used to increase both the diversity and the number of the solutionsin the population: it is performed first randomly selectinga seed solution and then solving a MIP sub-problem having hardfixed a subset of randomly selected integer variables to the seedvalues. The fraction of variables to be fixed is initialized to 50% ofthe total number of variables and successively a<strong>da</strong>pted (increasedby 20% if the MIP sub-problem has no solution or no improvementis found; <strong>de</strong>creased by 25% if only the seed solution is found; unchangedif a new incumbent is found). Combination is performe<strong>de</strong>xtending the hard fixing mechanism of RINS: two solutions (orall the solutions) are selected from the population as parents, and aMIP sub-problem is solved having hard fixed the variables whosevalues agree in the parents. The new solution found is ad<strong>de</strong>d tothe population if better than the worst solution currently inclu<strong>de</strong>d.Similarly to RINS, a no<strong>de</strong> limit L is imposed for sub-problem solution.Other algorithm parameters are the population dimensionP, the number M of mutations and the number C of combinationperformed.VNDS is a method very recently introduced in [4] that can be consi<strong>de</strong>re<strong>da</strong>n evolution of Variable Neighborhood Search Branching(VNSB) [8]. Both algorithms differ from the LB and RINS approachesas they do not adopt a branching scheme as high levelcomponent but a Variable Neighborhood Descent (VND) searchstrategy which performs a local search by changing the neighborhoodstructure to avoid to be trapped in local optima. VNDS is atwo-level VND scheme. At first level the absolute distances betweenincumbent and linear relaxation solution components, δ j =∣∣x c j − xr j∣ for j∈B (only binary variables were consi<strong>de</strong>red in [4]),are computed and sorted in not <strong>de</strong>creasing way. Then, at secondlevel, the k variables with smaller δ j are fixed and the consequentsub-problem is solved by a MIP solver. If this improves the incumbent,a VND-MIP step is started, otherwise k is reduced andthe process is iterated. The VND-MIP implements a VND whereneighborhoods are obtained by LB constraints whose r.h.s. is increasedwhen no improvement is found. VNDS adopts a mix ofhard and soft fixing and needs to set a wi<strong>de</strong> number of parameters.Therefore, the method appears more complicated than theones above outlined also for the need of an appropriate parametertuning.3. THE RANS HEURISTICThe RANS heuristic is a simple iterative search strategy that operatessimilarly to an iterated local search. The RANS algorithmstarts from a first feasible solution x c for the original MIP problem(P) and iterates the following main steps until the maximum timelimit is reached:1. Solution <strong>de</strong>struction. A subset F ⊆ G of binary and integervariables is randomly selected such that |F| = k, where kis a parameter initialized as k = 0.1 · |G| and automaticallytuned at each iteration. Then, a partially fixed MIP subproblem(S) is <strong>de</strong>fined, having fixed the variables x j = x c jfor j ∈ G \ F to their value in the incumbent solution.2. Solution construction (local search). Sub-problem (S) issolved by calling a MIP solver with the current upper boundf (x c ) and the maximum allowed time for solving sub-problemst mip as input parameters. Also the parameter t mip isautomatically <strong>de</strong>termined by the algorithm as a function ofthe time nee<strong>de</strong>d to solve the linear relaxation of the originalproblem (P). If a new best solution is found, the incumbentfor the next iteration is up<strong>da</strong>ted.3. Parameter adjustment and iteration. If the sub-problem (S)is optimally solved within the available time, then k is increase<strong>da</strong>s k = k · 1.1; otherwise k is reduced as k = k · 0.9and a new iteration is started. In this simple way the algorithmadjusts the parameter k, which controls the dimensionof the explored neighborhood (that is the number of binary/integervariables in (S)), <strong>de</strong>pending on the experienceddifficulty in solving sub-problems. Hence, the choice of theinitial value of k is not critical.It must be noted that, differently from RINS and LB, the proposedmethod does not operate within any branching framework, but athigher level can be viewed as an iterated LS. In fact, the solutionperturbation, that in iterated LS produces a new starting solution,here consists in the <strong>de</strong>finition of a partial solution obtained froma random <strong>de</strong>struction; then the LS, that here is the resolution of asub-problem, re-constructs a complete solution. RANS neighborhood<strong>de</strong>finition is based only on hard fixing. The neighborhood ofthe incumbent is randomly <strong>de</strong>fined and its dimension is controlledby k so that the exploration is terminated in reasonable short time.The maximum time for solving sub-problems t mip is <strong>de</strong>termined(in seconds) as max{T min ,3 ·t rel }, where t rel is the time nee<strong>de</strong>d tosolve the linear relaxation of (P) and T min is the minimum time allottedto the MIP solver, which can be fixed once for all taking intoaccount of the performances of the used computer and MIP solver.Actually the choice of T min is not critical due to the self-tuningmechanism used for parameter k: anyway T min should be chosenin or<strong>de</strong>r to let the MIP solver a sensible minimum time for exploringthe branching tree also for problems whose linear relaxationis solved in few seconds. Note that setting a maximum time limitt mip for solving sub-problems is not critical also in case of hugeinstances, because the auto-tuning of parameter k always allowsreducing the neighborhood size so that sub-problems can be optimallysolved. After few tests it was fixed T min = 30s taking intoaccount the behavior of Cplex solver on some “easy” instances.Note that the self-tuning of k controlling the sub-problem difficultyis similar to the a<strong>da</strong>ptation of the fraction of variables to behard fixed in Polishing mutation.Besi<strong>de</strong>s the basic behavior <strong>de</strong>scribed in the above three main steps,a differentiation mechanism is introduced in RANS to reduce therisk of stagnation, that is to remain blocked in a local optimum.It must be observed that, when an incumbent solution is not improve<strong>da</strong>fter several iterations, an advantage of the random hardfixing is that it is quite unlikely cycling over the same sub-problems.However, this implicit differentiation may not always be sufficient.ALIO-EURO <strong>2011</strong> – 86


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Hence a simple mechanism is <strong>de</strong>vised based on maintaining a poolof solutions, corresponding to the set of last discovered incumbents,and to randomly backtrack to one of them whenever a maximumnumber of not improving iterations is reached. In particular,the last 10 incumbent solutions are recor<strong>de</strong>d in the pool and themaximum number of not improving iterations is fixed equal to 30(these latter values were chosen after a few tests). As for RINS in[2], it must be observed that the purpose of the proposed methodis to face very difficult MIP problems, finding good solutions incomputation times that are acceptable for real world applications.On the other hand, RANS may not be competitive on problemssolved without difficulty by stan<strong>da</strong>rd MIP solvers neither it can beused to prove optimality.3.1. The initialization methodThe RERANS is a method that can be activated to find an initialfeasible solution in the cases where the MIP solver or other initializationheuristics are not able to succeed within the allowed timelimit. The algorithm iterates the resolution of partially relaxed (R)problems <strong>de</strong>termined from (P) by linearly relaxing all the binaryand integer variables in G with the exclusion of a subset T ⊆ Gof variables that remain binary/integer constrained (T is initiallyempty). At each iteration, c variables randomly chosen among therelaxed ones are ad<strong>de</strong>d to T (c is initialized equal to 0.1 · |G|) andr binary/integer constrained variables in T are relaxed (r is initialize<strong>de</strong>qual to 0 and it is set to a positive value whenever the MIPsolver is not able to find a solution to a sub-problem). The MIPsolver is called to provi<strong>de</strong> within t mip the first feasible solution x 0for problem (R). If it succeeds, then a new partially relaxed problemis <strong>de</strong>fined: first, for each binary/integer constrained variableone <strong>de</strong>viational constraintx j − δ + j + δ − j = x 0 j , j ∈ T (1)is ad<strong>de</strong>d (or possibly up<strong>da</strong>ted if already present in the relaxed problemsolved in the previous iteration), penalizing the <strong>de</strong>viationalvariables δ + j and δ − j in the objective function with a large penaltycost. Then, the value of c is up<strong>da</strong>ted as c = 1.2 · c and r is reset to0. When instead the MIP solver is not able to provi<strong>de</strong> a feasiblesolution for (R) in the given time limit, the algorithm performs arollback of the previous choices: the last c variables ad<strong>de</strong>d to Tare removed from T and the last r variables removed from T arereinserted in T. Then, the value of c is reduced as c = 0.8 · c andthe value of r is set equal to r = min{c,0.2(|T | − c)}, so that thenumber of removed variables is upper boun<strong>de</strong>d by the number ofvariables binary/integer constrained at the next iteration. The introductionof <strong>de</strong>viational constraints at an iteration h correspondsto soft fixing the variables that were in T at iteration h-1 so thatthey are driven towards the values of the feasible solution foun<strong>da</strong>t iteration h-1. Differently to hard fixing, this is a mechanism tomemorize the feasible integer values found at an iteration for variablesin T, without preventing the possibility that the same variablesassume different values in the feasible solution generated atthe next iteration (and consequently up<strong>da</strong>ting the <strong>de</strong>viational constraints).Similarly to RANS, parameter c is self-tuned in or<strong>de</strong>r toadjust the number of variables in T to control the difficulty (i.e.,the time nee<strong>de</strong>d) to solve the partially relaxed problems. Finally,we adopt in RERANS a random backtracking strategy that is activatedwhenever no feasible solution is found for a partially relaxedproblem within the given time limit. In these cases problem (R)is consi<strong>de</strong>red too difficult to solve and then a subset of r variablesare removed from T, i.e., are linearly relaxed. Since a well-knowndifficulty of backtracking in hard fixing is choosing the right variablesto unfix, also in this case we believe that a random choicecan be a simple and effective general purpose strategy.4. COMPUTATIONAL RESULTSThe performance of RANS was tested on a collection of 56 benchmarkinstances which inclu<strong>de</strong>s the ones referred to in [2] and in[4], plus other instances from MIPLIB [9] selected among theones optimally solved in more than one hour or still not optimallysolved by a commercial solver. The RANS algorithm wasimplemented in C++ and the tests were performed on a 2.4GHzIntel Core 2 Duo E6600 computer with 4GB RAM, using Cplex12.2 (configured to use only 1 thread) as general purpose MIPsolver. The co<strong>de</strong> of the implemented algorithm can be found athttp://www.discovery.dist.unige.it/rans.cpp.As the purpose is to evaluate the effectiveness of the comparedmethods in producing quality solutions within reasonable shorttime bounds (so verifying their suitability for industrial applications),a maximum time limit of one hour was fixed. RANS wascompared with Cplex and other four methods: LB, RINS, Polishingand VNDS. Similarly to [4] only pure methods were consi<strong>de</strong>red,in particular LB, RINS and Polishing implementations directlyincorporated within the Cplex branch-and-cut framework(note that for LB this choice corresponds to the re-implementationproposed in [2]). Therefore, the Cplex parameters were set in or<strong>de</strong>rto fix the no<strong>de</strong> limit for sub-MIPs to 1000 for LB and RINS, andthe RINS frequency to 100. These are the same settings adoptedin [2] and [4]. As Polishing is consi<strong>de</strong>red a more time-intensiveheuristic than the others, in Cplex it is not called throughout branchand cut like other heuristics but invoked only once after at leastone feasible solution is available. Therefore, the Cplex parameterswere set so that Polishing is invoked after the first feasible solutionis found, so imposing operational conditions similar to the ones ofRANS and leaving the Polishing evolutionary algorithm exploit atbest the available time. The original VNDS co<strong>de</strong>, kindly ma<strong>de</strong>available by Authors in [4], was used and two slightly differentconfigurations were tested. The first, labeled VNDS1, correspondsto the second one adopted in [4] (there <strong>de</strong>noted as “VNDS 2”), andimposes the maximum time for solving sub-problems (t sub ) and forthe VND-MIP procedure (t vnd ) as t sub = t vnd = 1200s. The secondconfiguration, labeled VNDS2, was instead characterized byt sub = t vnd = 300s.Being a randomized algorithms, 5 runs were executed for RANSand Polishing for each instance, then computing the average objectivevalue. Similarly to [2], the used performance in<strong>de</strong>x was theratio between the objective value obtained by the different methodsand the best known solution, when available, or the best resultobtained during these tests. Then, as in [2], the geometric mean(which is less sensitive to outliers) was adopted to perform an aggregateevaluation of the results. Note that for the sake of brevityonly aggregate results are here shown. The results were aggregate<strong>da</strong>ccording to the total number of binary and integer variables, as reportedin Table 1. From this table RANS appears the most effectivemethod for the Global group that inclu<strong>de</strong>s all the instances. Table1 highlights the aggregate results separating the instances of verysmall dimension from the others, and further subdividing this lattersubset into medium (from 100 to 10.000 binary/integer variables)and large size (more that 10.000 binary/integer variables). Apartfor the very small size instance group, in which a <strong>de</strong>pth branchingis required to find the optimal solution, the performances of RANSare always the best ones.The overall behavior of the compared methods is shown in Figure1 where is <strong>de</strong>picted the evolution of the geometric mean of objectiveratios averaged over the whole benchmark set. Again Figure1 highlights the effective behavior of RANS in finding good solutionswithin short time.Finally, note that only for 3 instances the Cplex solver was notable of finding the initial solution within the t mip bound. In thesecases the the starting solution was generated by the RERANS pro-ALIO-EURO <strong>2011</strong> – 87


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Num. int. var.(Num. inst.)RANSCplexRINSLBVNDS1VNDS2PolishingGlobal (56) 1.45 3.00 2.93 1.51 4.05 3.82 2.03100 (53) 1.17 2.84 2.83 1.32 3.92 3.66 1.24100-10000 (36) 1.15 1.21 1.19 1.19 1.61 1.57 1.21>10000 (17) 1.20 6.28 6.31 1.58 8.67 7.95 1.30Geometric mean of objective ratio2,221,81,61,41,2Table 1: Aggregated average results10 300 600 900 1200 1500 1800 2100 2400 2700 3000 3300 3600Time (sec.)RANSCPLEXLBRINSVNSD1VNSD2PolishingFigure 1: The evolutions of geometric means of objective ratios.cedure. In Table 2 the comparisons between RERANS with Cplexand Cplex with the incorporated FP for the three benchmark instances,i.e. momentum2 (m2), rdrplusc21 (rd21), and van, initializedby RERANS are reported. For this simple test a 3600stime limit was fixed and the algorithm was stopped when the firstfeasible solution is found. The table shows both time ratios (timefor first feasible solution/shortest time among the three methodsfor first feasible solution) and objective ratios for each instanceand method. It can be observed that the time performances ofRERANS for these challenging instances were quite good.Time ratioObjective ratiom2 rd21 van m2 rd21 vanCplex 7.358 1.000 11.98 1.000 1.000 11.39Cplex+FP - 2.425 8.000 - 1.094 11.39RERANS 1.000 1.096 1.000 1.046 1.027 1.000Table 2: RERANS performance results5. CONCLUSIONSThis paper proposes RANS, a new heuristic approach to find inreasonably short time high quality solutions to difficult MIP problems.Perhaps the most relevant advantage of RANS is in its conceptualsimplicity: the paper shows that the randomization strategyused in RANS is effective with respect to other methods, some ofthem quite complicated, as highlighted by the comparative experimentalcampaign performed on a benchmark ma<strong>de</strong> of wi<strong>de</strong>ly referencedinstances. Another advantage is that RANS does not nee<strong>da</strong>ny parameter setting or tuning apart from choosing the maximumavailable time; this feature is mainly due to the adopted parameterself-tuning mechanism that a<strong>da</strong>pts the neighborhood dimensionaccording to the experimented difficulty in solving the partiallyfixed MIP problems in the maximum time available.6. REFERENCES[1] M. Fischetti and A. Lodi, “Local branching,” MathematicalProgramming, vol. 98, no. 1, pp. 23–47, 2003.[2] E. Danna, E. Rothberg, and C. L. Pape, “Exploring relaxationinduced neighborhoods to improve MIP solutions,” MathematicalProgramming, vol. 102, no. 1, pp. 71–90, 2005.[3] E. Rothberg, “An evolutionary algorithm for polishing mixedinteger programming solutions,” INFORMS J. on Computing,vol. 19, pp. 534–541, 2007.[4] J. Lazić, S. Hanafi, N. Mla<strong>de</strong>nović, and D. Urošević, “Variableneighbourhood <strong>de</strong>composition search for 0-1 mixed integerprograms,” Computers & Operations Research, vol. 37,no. 6, pp. 1055 – 1067, 2010.[5] V. Maniezzo, T. Stützle, and S. Voß, Matheuristics: HybridizingMetaheuristics and Mathematical Programming.Springer Publishing Company, 2009, vol. 10.[6] R. Ruiz and T. Stützle, “A simple and effective iterated greedyalgorithm for the permutation flowshop scheduling problem,”European Journal of Operational Research, vol. 177, no. 3,pp. 2033–2049, 2007.[7] M. Fischetti, F. Glover, and A. Lodi, “The feasibility pump,”Mathematical Programming, vol. 104, pp. 91–104, 2005.[8] P. Hansen, N. Mla<strong>de</strong>nović, and D. Urošević, “Variable neighborhoodsearch and local branching,” Computers & OperationsResearch, vol. 33, no. 10, pp. 3034 – 3045, 2006.[9] A. Martin, T. Achterberg, T. Koch, and G. Gamrath, “Miplib2003,” 2010. [Online]. Available: http://miplib.zib.<strong>de</strong>/ALIO-EURO <strong>2011</strong> – 88


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Towards an Ant Colony Optimization algorithm for the Two-Stage KnapsackproblemStefanie Kosuch ∗∗ Institutionen för <strong>da</strong>tavetenskap (IDA)Linköpings Universitet, Swe<strong>de</strong>nstefanie.kosuch@liu.seABSTRACTWe propose an Ant-Colony-Optimization algorithm for the Two-Stage Knapsack problem (TSKP) with discretely distributedweights. Three heuristic utility measures are proposed and compared.We argue why for the proposed measures it is more efficientto place pheromone on arcs instead of vertices or edges of the completesearch graph. Numerical tests show that the algorithm is ableto find near optimal or even optimal solutions after a relativelysmall number of generated solutions.Keywords: Two-stage mo<strong>de</strong>l, Knapsack problem, Ant-Colony optimization,Meta-heuristic, Utility ratio1. INTRODUCTIONThe knapsack problem is a wi<strong>de</strong>ly studied combinatorial optimizationproblem. Special interest arises from numerous real life applicationsfor example in logistics, network optimization and scheduling.The basic problem consists in choosing a subset out of a givenset of items such that the total weight (or size) of the subset doesnot exceed a given limit (the capacity of the knapsack) and thetotal benefit of the subset is maximized. However, most real lifeproblems are non-<strong>de</strong>terministic in the sense that some of the parametersare not (exactly) known at the moment when the <strong>de</strong>cisionhas to be ma<strong>de</strong>. If randomness occurs in the capacity constraint,the main question that has to be answered is if a violation of thecapacity constraint (i.e. an overload) could be acceptable. If anoverload cannot be permitted in any case, the mo<strong>de</strong>l maker has twopossibilities: Either to force the feasible solutions of the resultingproblem to satisfy the capacity constraint in any case. This generallyleads to very conservative <strong>de</strong>cisions and the resulting problemmight even be infeasible or only have trivial feasible solutions. Orto allow for later corrective <strong>de</strong>cisions at, naturally, additional costs.This latter mo<strong>de</strong>l is called a multi-stage <strong>de</strong>cision mo<strong>de</strong>l in the literature(for an introduction to stochastic programming mo<strong>de</strong>ls seee.g. [1]).In this paper we allow the item weights to be random and studya two-stage variant of the knapsack problem, <strong>de</strong>noted T SKP inthe remain<strong>de</strong>r. We assume the weight vector to be discretely distributed,i.e. to only admit a finite number of realizations with nonzeroprobability. In fact, in [2] it has been shown that a stochasticcombinatorial optimization problem can, un<strong>de</strong>r some mild assumptions,be approximated to any <strong>de</strong>sired precision by replacingthe un<strong>de</strong>rlying distribution by a finite random sample.It is well known that in the case of finite weight distributions theT SKP can be equivalently reformulated as a <strong>de</strong>terministic linearprogramming problem with binary <strong>de</strong>cision variables (see e.g. [3]).However, the set of constraints and binary <strong>de</strong>cision variables inthe reformulation grows with both the number of items as wellas the number of scenarios. It is thus typically very large, oreven exponential in the number of items. Consequently, solvingthe <strong>de</strong>terministic equivalent reformulation of the T SKP to optimalityis only possible in very restricted cases. Instead, metaheuristicsshould be consi<strong>de</strong>red in or<strong>de</strong>r to obtain near optimalor even optimal solutions in shorter computing time. The aim ofthis paper is therefore to study some variants of an Ant-Colony-Optimization (ACO) algorithm for the T SKP (for an introductionto ACO-algorithms and stan<strong>da</strong>rd procedures see [4]).In the last <strong>de</strong>ca<strong>de</strong>, several metaheuristics for Stochastic CombinatorialOptimization and Integer Programming problems (in the following<strong>de</strong>noted SIP) have been presented. There are two aspectswhy metaheuristics are important tools to solve SIPs: the size ofSIPs (especially in the case of in<strong>de</strong>pen<strong>de</strong>ntly discretely distributedparameters or simply a high number of possible scenarios) and thequestion of how to evaluate the objective function. In fact, in mostcases evaluating the objective function of an SIP is NP-hard. Inother cases, no <strong>de</strong>terministic equivalent reformulation is knownand only approximate values can be obtained (e.g. using SampleAverage Approximation). Both difficulties can be tackled by applyingappropriate metaheuristics (see e.g. [5]).To the best of our knowledge, no special purpose metaheuristicfor the T SKP has yet been proposed. Our work is, however, inspiredby previous works on ACO-algorithms for the related MultiplyConstrained Knapsack problem MCKP (see e.g. [6],[7]). Wethink that an ACO-algorithm is a good choice to solve the T SKPdue to the possibility to effectively use utility measures. Moreover,ants are building (new) solutions without needing to evaluate theobjective function, which, in the case of the T SKP, is an NP-hardproblem itself. Thus, evaluation needs only to be done in or<strong>de</strong>r tocompare solutions.2. MATHEMATICAL FORMULATION, PROPERTIESAND AN APPLICATIONWe consi<strong>de</strong>r a stochastic knapsack problem of the following form:Given a knapsack with fix weight capacity c > 0 as well as a set ofn items. Each item has a weight that is not known in the first stagebut comes to be known before the second-stage <strong>de</strong>cision has to bema<strong>de</strong>. Therefore, we handle the weights as random variables an<strong>da</strong>ssume that the weight-vector χ ∈ R n is discretely distributed withK possible realizations (or scenarios) χ 1 ,..., χ K . The corresponding,non-zero probabilities are <strong>de</strong>noted p 1 ,..., p K . All weights areassumed to be strictly positive.In the first stage, items can be placed in the knapsack (first-stageitems). The corresponding first-stage <strong>de</strong>cision vector is x ∈ {0,1} n .Placing item i in the knapsack in the first stage results in a rewardr i > 0. At the beginning of the second stage, the weights of allitems are revealed. First-stage items can now be removed and additionalitems be ad<strong>de</strong>d (second-stage items) in or<strong>de</strong>r to make thecapacity constraint be respected and/or increase the total gain.If item i is removed, a penalty d i has to be paid that is naturallystrictly greater than the first-stage reward r i . The removal of itemi is mo<strong>de</strong>led by the <strong>de</strong>cision variable y − i that is set to 1 if theALIO-EURO <strong>2011</strong> – 89


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>item is removed and to 0 otherwise. Similarly, we assume thatthe second-stage reward for this item r i > 0 is strictly smaller thanits first-stage reward. If an item is ad<strong>de</strong>d in the second stage we setthe corresponding binary <strong>de</strong>cision variable y + i to 1. The resultingTwo-Stage Knapsack problem with discrete weight distributionscan be formulated as follows:Two-Stage Knapsack Problem with discretely distributed weights(T SKP)n Kmaxx∈{0,1} ∑ r i x i + ∑ p k Q(x, χ k ) (1)n i=1 k=1ns.t. Q(x, χ) = maxy + ,y − ∈{0,1} ∑ r i y + nn i − ∑ d i y − i (2)i=1 i=1s.t. y + i ≤ 1 − x i , ∀i = 1,...,n, (3)y − i ≤ x i , ∀i = 1,...,n, (4)n∑(x i + y + i − y − i )χ i ≤ c. (5)i=1The T SKP is a relatively complete recourse problem, i.e. for everyfeasible first-stage <strong>de</strong>cision there exists a feasible second-stage<strong>de</strong>cision. Moreover, given a first-stage <strong>de</strong>cision and a realizationof χ, solving the second-stage problem means solving a <strong>de</strong>terministicknapsack problem. Evaluating the objective function for agiven first-stage solution is thus NP-hard.As a simplified application consi<strong>de</strong>r an (online) travel agency thataims to fill the vacant beds (the <strong>de</strong>terministic capacity) of a hotelcomplex. Clients are travel groups whose exact number of travelers(the "weight" of the group) is still unknown at the moment the<strong>de</strong>cision which groups to accept has to be ma<strong>de</strong>. This randomnesscan for example be a result of later cancellations. In or<strong>de</strong>r to maximizethe final occupancy of the beds, the travel agent might allowan overbooking. If, in the end, the number of beds is not sufficient,one or more of the groups need to be relocated in neighboring hotelswhich leads to a loss of benefit. If beds are left unoccupied,last minute offers at reduced priced might be an option to fill thesevacancies. A simple recourse version of this problem with a set ofhotel sites has been previously consi<strong>de</strong>red in [8].3. THE ACO-METAHEURISTICIn the remain<strong>de</strong>r we use the following notations:• A : set of ants• t: "time", i.e. passed number of construction steps in currentiteration (t ≤ n)• S a (t): set of items chosen by ant a after time t• τ i (t): pheromone level on vertex/arc/edge i at time t• η i : utility ratio of item i• ν i : non-utility ratio of item i• ρ ∈ (0,1): global evaporation parameter• ρ loc ∈ (0,1): local evaporation parameter• p a i j (t): transition probability = probability for ant a to gofrom vertex i to vertex j at time tThe basic structure of the ACO-algorithm for the T SKP is givenin Algorithm 3.1. Its functioning is <strong>de</strong>tailed in the following subsection.The Transition of ants step consists of the transition of theants following the transition probabilities and the up<strong>da</strong>te of S a (t).IT ← 0while IT < IT MAX doIT ← IT + 1Initializationt ← 0while t < n and (∃a ∈ A : (n+1) ∉ S a (t − 1)) dot ← t + 1Compute transition probabilityTransition of antsLocal pheromone up<strong>da</strong>teend whileGlobal pheromone up<strong>da</strong>teend whilereturn Best found solutionAlgorithm 3.1: ACO-algorithm for the T SKP3.1. The Complete Search GraphOur search graph is based on the search graph proposed for theMCKP in [6], i.e. on a complete graph whose n vertices representthe n items. Note that the ants only construct the first-stage solution(solution vector x). In or<strong>de</strong>r to mo<strong>de</strong>l the randomness of thefirst item chosen by an ant, we add an additional vertex 0 to thecomplete graph that is connected to all the other n vertices, withp a i0 (t) = 0 for all a ∈ A and t > 0. Initially, all ants are placed onthis vertex. We <strong>de</strong>note this vertex as starting vertex.In the case of the MCKP one has a natural certificate of when anant has come to an end of its solution construction: when eitherall items have been chosen or when adding any of the remainingitems would lead to the violation of at least one of the constraints.As for the T SKP even adding all items in the first stage wouldyield a feasible solution, we add a termination vertex n + 1 whichis connected to all vertices, including the starting vertex.3.2. Pheromone trails and up<strong>da</strong>te procedureSeveral choices could be ma<strong>de</strong> for the way pheromone is laid bythe ants (see [7]). In the simplest setting, the search graph is nondirecte<strong>da</strong>nd pheromone is laid on vertices, i.e. items that areinclu<strong>de</strong>d in the best solutions found so far have a high level ofpheromone. In the second variant, pheromone is placed on edgesof the non-directed search graph, or, equivalently, pairs of items.In this setting the probability that an ant chooses a specific item attime t increases with the number of (good) previously found solutionsthat contain both this specific item as well as the item the anthas chosen at time t − 1. In the third variant the graph is assumedto be a complete directed graph and pheromone is laid on arcs, i.e.directed edges. Contrary to the two former settings, this setting notonly takes into account which items (or item pairs) had been ad<strong>de</strong>dto former good solutions, but also in which or<strong>de</strong>r. In the following,when talking of an element, this refers to either a vertex, edge orarc of the search graph.We use a local as well as a global up<strong>da</strong>te procedure (see e.g. [6]).The local up<strong>da</strong>te procedure is performed after every constructionstep. The pheromone level on the elements chosen during this stepby an ant is slightly reduced, in or<strong>de</strong>r to diversify the producedsolutions. For an element i the local up<strong>da</strong>te rule is as follows:τ i ← (1 − ρ loc ) · τ i + ρ loc τ min (6)ρ loc is the local evaporation parameter: The larger ρ loc , the higherthe evaporation and thus the higher the <strong>de</strong>crease of pheromone onthe chosen elements. τ min is a lower bound for the pheromonelevel.The global up<strong>da</strong>te procedure is done once all ants have constructedtheir solutions. The evaporation of pheromone on all arcs is theALIO-EURO <strong>2011</strong> – 90


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>first part of the global up<strong>da</strong>ting:τ i ← (1 − ρ) · τ i (7)where ρ is the global evaporation parameter.In the second part of the global up<strong>da</strong>te procedure only the bestfound solutions are consi<strong>de</strong>red and the pheromone level on thesesolutions is intensified. In our setting we intensify the pheromonelevel on an element if and only if the element has been chosen ineither the best solution found so far or in one of the λ best solutionsfound in the last iteration:τ i ← ρ (8)Note that the maximum pheromone level is 1. If due to the up<strong>da</strong>teprocedures the pheromone level on an element falls below a lowerbound τ 0 , it is set to τ 0 .In the case of pheromone on arcs we additionally diversify the solutionsby storing the best solution as a set of items. The pheromoneis then increased on all arcs that lead to one of these vertices.3.3. Heuristic utility measuresAn advantage of the T SKP compared to the MCKP is that we havea clearly <strong>de</strong>fined "relevance factor" for each knapsack constraint:the probability of the corresponding scenario (see [9] for more informationon utility measures for the MCKP). Our i<strong>de</strong>a is thus tocompute the overall utility ratio of an item as an average over theutility ratios of those scenarios where the item still fits the capacity.The problem is, however, that, once adding an item would leadto a violation of the capacity in one or more scenarios, <strong>de</strong>cidingwhether it is more profitable to remove an item and add the newone, or to discard the current item, is NP-hard. We overcome thisproblem by relying on the chosen utility measure: If the utilitymeasure is chosen wisely, one might get good solutions by alwaysdiscarding the current item (in the case of an overload).While in the case of the MCKP two factors have to be consi<strong>de</strong>red(reward and used capacity), there are 2 more factors that play a rolefor the utility of an item in the two-stage setting: the second-stagereward and the second-stage penalty. This makes the <strong>de</strong>finition ofa good utility measure much more complex.The utility measure for the termination vertex should <strong>de</strong>pend onthe penalty we would have to pay in the second stage if we ad<strong>da</strong>nother item or the reward we could gain in the second-stage if wedo not add any of the remaining items. We thus compute an additional"non-utility" ratio ν i for each item i. The utility ratio of thetermination vertex is then <strong>de</strong>fined as the minimum over these ratios:If for all items the non-utility ratio is high, termination mightbe the best choice.We propose three different choices for the (non-)utility ratios. Theseare calculated with respect to the set K of scenarios where the respectiveitem still fits in the knapsack.Simple measure: Here we <strong>de</strong>fine the utility of an item to be the"average" ratio of first-stage reward and weight.ηi S = ∑ p k r ik∈K χikNote that this measure is not the exact mean of the reward-weightratios over the scenarios where the item still fits as ∑ k∈K p k < 1is possible. The exact mean would be obtained by dividing ηiS by∑ k∈K p k . The utility ratios do thus also <strong>de</strong>pend on the probabilitythat item i still fits the capacity (given by ∑ k∈K p k ).We <strong>de</strong>fine two non-utility measures. For half of the ants the firstmeasure is applied and for the other half the second. The first nonutilityratio is <strong>de</strong>fined to be the "average" ratio of second-stagepenalty and weight over the instances where the item does not fitin the knapsack any more. Contrary to the utility ratios, these first(9)non-utility ratios increase with ∑ k∉K p k . The second non-utilityratio equals the reward we would gain on average in the secondstage if we do not add the item and assume that it can be ad<strong>de</strong>d inany scenario in the second stage.νi S = ∑ p k d ik∉K χikνi S K= ∑ p k r ik=1 χik(10)Difference Measure: We compare what we would gain by addingan item in the first and not the second stage (r i − r i ) with what wewould loose if we would have to remove the item in the secondstage (d i − r i ):η D i= ∑ p k r i − r ik∈K χikν D i= ∑ p k d i − r ik∉K χikRatio measure: Instead of differences we consi<strong>de</strong>r ratios:η R i= ∑ p k r i/r ik∈K χik3.4. Transition probabilitiesν R i= ∑ p k d i/r ik∉K χik(11)(12)In this study we only consi<strong>de</strong>r the most traditional way of computingthe transition probabilities from the pheromone level and utilityratio (see e.g. [4]): For a vertex v ∈ {1,...,n + 1}, the probabilitythat an ant a currently sitting on vertex u moves to v is compute<strong>da</strong>s follows:τi(u,v) α π(u,v,S a (t − 1),τ) =(t)ηβ v (S a (t − 1))∑ n w=1 τα i(u,w) (t)ηβ w (S a (t − 1))(13)Here α and β are two parameters that control the relative importanceof pheromone level and utility ratio and i(u,v) = v (vertexpheromone) or i(u,v) = (u,v) (arc or edge pheromone). In the firstiteration we only take the utility ratio into account. As a consequence,the pheromone level on the elements is initialized duringthe first global up<strong>da</strong>te procedure.4. SUMMARY OF THE OBSERVATIONS MADE DURINGTHE NUMERICAL TESTS4.1. Comparison of the 3 different variants to lay pheromonetrailsDuring our tests we observed that, when pheromone is placed onvertices (or edges), the ants had difficulties to reproduce the bestsolution found so far and to search in its local neighborhood (evenwith λ = 0). As a consequence, the solution value of the bestsolution produced during an iteration was mostly strictly smallerthan that of the the current best solution. This caused severe problemsfor the convergence of our ACO-algorithm. In contrast, whenpheromone is laid on arcs, the quality of the best solution producedduring one single iteration generally increased monotonically(however not strictly). These observations seem to be contradictoryto what has been observed in previous studies of ACOproblemsfor the MCKP (see [6]). It can, however, be explainedby the fact that our utility measure relies on the or<strong>de</strong>r in which theitems have been ad<strong>de</strong>d. More precisely, the set of items that arestill allowed to be chosen <strong>de</strong>pend heavily on the set of previouslyad<strong>de</strong>d items.4.2. Comparison of the 3 different utility measuresFor a representative comparison of the convergence behavior ofour ACO-algorithm using the three different measures see FigureALIO-EURO <strong>2011</strong> – 91


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>In or<strong>de</strong>r to evaluate the second-stage expectation for a given foundfirst-stage solution we solved the K second-stage knapsack problemsin<strong>de</strong>pen<strong>de</strong>ntly using an optimal knapsack algorithm from theliterature. If nee<strong>de</strong>d, the CPU-time could be <strong>de</strong>creased by insteadusing an FPTAS . By increasing the performance ratio of the use<strong>da</strong>pproximation algorithm during the iterations, convergence mightonce more be achieved.Last but not least, to fully evaluate the competitiveness of an ACOapproachto solve the T SKP a comparison with other metaheuristicsis clearly nee<strong>de</strong>d.6. REFERENCESFigure 1: Representative convergence behavior using differentutility measures1 (test with pheromone on arcs). Our numerical tests on the chosentest instances showed that the difference measure seems to bebetter suited than the two other measures: Using the differencemeasure our algorithm found the optimal solution in around 16%of the tests while the other two measures were only rarely (on someinstances never) able to produce optimal solutions. Concerning theruns where the optimal solution was not found the average (maximum)relative gap was of 0.03% (0.06%) for the difference measureversus 0.09% and 0.1% (0.18% and 0.19%) for the simple andratio measure. The differences in the solution qualities are on theone hand due to the initial iteration where the ants find much bettersolutions based on the difference measure heuristic than based onone of the other two heuristics. On the other hand, the algorithmconverges much faster to near optimal solutions in the former caseand the quality of the best solution produced per iteration never<strong>de</strong>creases even when the best found solution is already close to theoptimum.5. FUTURE WORKIn case of instances with a high number of scenarios samplingshould be consi<strong>de</strong>red. This means that at each iteration a set ofscenarios is sampled whose cardinality is smaller than K. By increasingthe sample size during the iterations convergence mightbe achieved. Moreover, one obtains a natural additional diversificationof the produced solutions (see [5] for more <strong>de</strong>tails).[1] A. Shapiro, D. Dentcheva, and A. Ruszczynski, “Lectures onstochastic programming: Mo<strong>de</strong>ling and theory,” in MPS/SIAMSeries on Optimization. SIAM-Society for Industrial andApplied Mathematics, 2009, vol. 9.[2] A. J. Kleywegt, A. Shapiro, and T. Homem-<strong>de</strong>-Mello, “Thesample average approximation method for stochastic discreteoptimization,” SIAM Journal on Optimization, vol. 12, no. 2,pp. 479–502, 2002.[3] A. A. Gaivoronski, A. Lisser, R. Lopez, and X. Hu, “Knapsackproblem with probability constraints,” Journal of GlobalOptimization (Online First), 2010.[4] V. Maniezzo, L. M. Gambar<strong>de</strong>lla, and F. <strong>de</strong> Luigi, Ant ColonyOptimization. Springer Berlin / Hei<strong>de</strong>lberg, 2004, ch. 5, pp.101–117.[5] L. Bianchi, M. Dorigo, L. M. Gambar<strong>de</strong>lla, and W. J. Gutjahr,“A survey on metaheuristics for stochastic combinatorialoptimization,” Natural Computing: an international journal,vol. 8, pp. 239–287, 2009.[6] S. Fi<strong>da</strong>nova, “Ant colony optimization for multiple knapsackproblem and mo<strong>de</strong>l bias,” in Numerical Analysis and Its Applications,ser. Lecture Notes in Computer Science. SpringerBerlin / Hei<strong>de</strong>lberg, 2005, vol. 3401, pp. 280–287.[7] L. Ke, Z. Feng, Z. Ren, and X. Wei, “An ant colony optimizationapproach for the multidimensional knapsack problem,”Journal of Heuristics, vol. 16, pp. 65–83, 2010.[8] T. Benoist, E. Bourreau, and B. Rottembourg, “Towardsstochastic constraint programming: A study of online multichoiceknapsack with <strong>de</strong>adlines,” in <strong>Proceedings</strong> of the CP’01. Springer London, 2001, pp. 61–76.[9] H. Kellerer, U. Pferschy, and D. Pisinger, Knapsack Problems.Springer Berlin / Hei<strong>de</strong>lberg, 2004.ALIO-EURO <strong>2011</strong> – 92


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Optimal Parts Allocation for Structural Systems via Improved Initial SolutionGenerationYang Zhang ∗Horst Baier ∗∗ Institute of Lightweight Structures, TU MünchenMünchen, Germany{zhang, baier}@llb.mw.tum.<strong>de</strong>ABSTRACTIn a mechanical structure, it is often the case that many of the partsare nominally i<strong>de</strong>ntical. But actually they always differ slightlyin physical and geometrical properties due to variation of materialand manufacturing error. Parts allocation for a structural systemaims at optimizing performance of the manufactured structure byassigning each of these parts to a proper position in the structureduring the assembling period. In this paper, the parts allocationproblem is addressed and the formulation of it as a nonlinear assignmentproblem (NAP) is presented. A method is <strong>de</strong>veloped togenerate an initial solution for it. The technique is tested on benchmarkexamples. All the results show that it could always constructa high quality starting point from both view of objective and constraintviolation. Compared to starting with the i<strong>de</strong>ntity permutationand randomly generated ones, the stan<strong>da</strong>rd 2-exchange localsearch algorithm starting with initial solutions generated by thismethod well solves most of the test problems in the meantime witha large reduction in total number of function evaluations.Keywords: Initial solution, Nonlinear assignment problem, Localsearch, Parts allocation1. INTRODUCTIONDuring structural manufacturing, we often need to assemble partstogether to create a whole structure. Many of the parts are <strong>de</strong>signedto be i<strong>de</strong>ntical and could be swapped with each other without influenceon characteristics of the assembled structure. But due tovariation of material and manufacturing errors, parts that have beenmanufactured are always slightly different in some properties fromeach other. The parts allocation problem for a structural system isthat, we want to find out how to allocate each of the parts at handto the structure so that the assembled one could reach a best mechanicalperformance, such as minimum <strong>de</strong>flection at some pointun<strong>de</strong>r certain loads and certain constraints.There is a significant feature of this kind of problem, that eachevaluation of a solution requires normally time-consuming computation,e.g. finite element analysis. For a large scale problem,each such analysis could lasts minutes even hours. Therefore, anapplicable algorithm need not return the global optimum, but insteadit has to be able to return a good enough solution with as fewnumber of function evaluations as possible.In this paper, the parts allocation problem for structural systemsis formulated as a nonlinear assignment problem. Assignmentproblem (AP) is a type of problem in combinatorial optimization,which aims at finding a way to assign n items to n other itemsto obtain the minimum of a <strong>de</strong>fined objective. There are manypolynomial-time algorithms have been <strong>de</strong>veloped for linear assignmentproblem (LAP), such as Munkres (Hungarian) algorithm,shortest path algorithms and auction algorithms [1]. Well-knownnonlinear assignment problems are quadratic assignment problem(QAP) and 3-in<strong>de</strong>x assignment problem (3AP), which have beenshown that both are NP-hard problems [2, 3]. For even more generalNAPs, so far, heuristic algorithms are wi<strong>de</strong>ly studied and appliedto find good quality solutions [4, 5].A high quality initial solution is essential for any heuristic algorithm,which could reduce the total number of function evaluationswhile returning a same quality solution. There are several waysto construct initial solutions, for instance, by taking the i<strong>de</strong>ntitypermutation, a randomly generated permutation, or a heuristically<strong>de</strong>termined starting point [4]. For the first two methods, they don’tinclu<strong>de</strong> any consi<strong>de</strong>ration of a specific problem, so there is no reasonto take them as a good starting point.The outline of this paper is as follows: in Section 2, we present theformulation of parts allocation problem for structural systems as aNAP. In Section 3, a procedure to generate an initial solution forthe problem is <strong>de</strong>fined. We apply the technique to some benchmarkexamples and present the test results in Section 4. Finally we reachthe conclusion.2. MATHEMATICAL FORMULATION OF THEPROBLEMIn this study, we assume the properties of each part that have beenmanufactured are measurable and are known. And we take the differencein properties of area of cross-section (A), Young’s Modulus(E) and coefficient of thermal expansion (CTE) into account.Consi<strong>de</strong>r we have n exchangeable parts have been manufacture<strong>da</strong>nd are to be assembled into n different positions of a structuralsystem. The objective is to minimize the displacement at certainpoint or the maximum stress in the assembled structure un<strong>de</strong>r certainloads. We number the n positions and <strong>de</strong>note the propertiesof parts assigned to each position {A(i),E(i),CT E(i)}, i=1, 2, . . . ,n. We also number the parts at hand by 1,2,. . . n, and each witha property set {A j ,E j ,CT E j }, j=1, 2, . . . , n. To evaluate the displacementof the structure un<strong>de</strong>r certain loads, we usually needto perform a finite element analysis, which mainly solves a largesystem of linear equations as follows:KU = F (1)where K is the master stiffness matrix that is <strong>de</strong>pen<strong>de</strong>nt on propertiesA and E of parts at each position, F is the load vector which is<strong>de</strong>pen<strong>de</strong>nt on CTEs, and U is the displacement vector to be computed.We represent the assignment with a permutation matrixX = (x i j ) n×n , which satisfies following assignment constraints:n∑ x i j = 1, j = 1,2,...,n, (2)i=1ALIO-EURO <strong>2011</strong> – 93


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>andn∑ x i j = 1, i = 1,2,...,n, (3)j=1x i j ∈ {0,1}, i, j = 1,2,...,n, (4)4. EXAMPLES AND COMPUTATIONAL RESULTSTo show the quality of the initial solution generated by above method,we tested on several benchmark examples.{1 iff jx i j =th part is allocated to position i,0 otherwise.Thus the areas of cross-section at each position could be interpolatedwith following equation:(5)[A(1),A(2),...,A(n)] T = X[A 1 ,A 2 ,...,A n ] T (6)Similar interpolation schemes are performed for E and CTE. Withthese interpolation formulas, the stiffness matrix and the load vectorare both formulated as a function of entries in the permutationmatrix X. Therefore, the unknown displacement componentsare normally highly nonlinear functions of {x i j }. Further, the responseof stresses in the structure that can be <strong>de</strong>rived from U, arealso nonlinear functions of {x i j }. Finally, we formulated the partsallocation problems as a nonlinear assignment problem.4.1. 10-Bar Truss Allocation ProblemWe tested our method first with a 2D 10-bar truss structure shownin Figure 1. All the bars in the structure are <strong>de</strong>signed to have thesame length of 1000mm, the same circular cross-section of areaA = 1000mm 2 and use the same material with Young’s modulusE = 68.95GPa, CTE = 23.6 × 10 −6 / ◦ C. Thus all of them couldbe swapped with each other. Now assume we have manufacturedten bars to be allocated into the ten positions of the structure, anddue to manufacturing errors, the properties A, E and CTE of eachbar are different to <strong>de</strong>sign slightly. The objective is to find an allocationof the bars to minimize the displacement of no<strong>de</strong> 1 un<strong>de</strong>rboth a uniform thermal load of ∆T = 42.37 ◦ C on the structure an<strong>da</strong> downward force of 29.4kN at no<strong>de</strong> 1.3. PROCEDURE FOR GENERATING AN INITIALSOLUTIONThrough interpolation equation (6), it could be seen that propertiesat each position are continuous functions of X if we make a continuousrelaxation of the binary constraints on each x i j . Therefore,displacements and stresses are also <strong>de</strong>rived to be continuous functionsof X. This continuity makes it mathematically meaningful toevaluate objective at points where entries of X lies between 0 and1. Based on this fact, we <strong>de</strong>signed a 3-step <strong>de</strong>terministic way togenerate an initial solution for a parts allocation problem of size n:Step1. Construct the matrix X S = (x s i j ) n×n, where all the entriesequals to 1/n. And evaluate the objective f S = f (X S ).Step2. Compute c i j = ∂ f /∂x i j at X S , for i, j = 1,2,. . . ,n.Step3. Construct cost matrix C = (c i j ) n×n , and solve the linearassignment problem min ∑ n i, j=1 c i jx 0 i j , where X0 = (x 0 i j ) n×nsatisfies all the assignment constraints from equation (2) toequation (4).We artificially create matrix X S in Step1, which assign all the entriesthe same value so as to avoid bias of any specific possiblesolution. In Step2, we use finite difference method to evaluate thepartial <strong>de</strong>rivatives of f: set stepsize ε be a small positive value, thenc i j ≈ ( f (X S +∆ i j )− f S )/ε, where ∆ i j is a n × n matrix with all theentries equal to zero except the one in position (i,j) equals to ε. Thesolution X 0 in Step3 is just the initial solution we generated.The procedure could be seen as making a linearization of the objectivefunction around X S and then finding the point that reducethe objective most with <strong>de</strong>epest <strong>de</strong>scent method. Thus, if the problemis originally a LAP, then the initial solution we generated isexactly the optimal solution for the problem. For nonlinear assignmentproblems we could also expect to reach a good qualitysolution after Step3 if the <strong>de</strong>rivatives of objective with respect to{x i j } do not change largely at different points.The number of function evaluations we need to construct the initialpoint is n 2 +1. It could be further reduce to n 2 if we simply assumef S in Step1 is 0, which wouldn’t influence the result in Step3 butreduce number of function evaluations by one.Figure 1: 10-bar truss structure un<strong>de</strong>r loads.We tested with three different situations where all the propertiesfor each bar are manufactured with maximum error of 5%, 10%and 50% respectively. And for each error level, we randomly generated10 instances from a uniform distribution. The stepsize εused in Step2 is 10 −3 . Munkres algorithm [6] is applied to solvethe <strong>de</strong>rived LAP in Step3.For each instance, we compute relative error of objective of theinitial solution with respect to that of the global optimum, whichis found by enumerating all the possible permutations with totalnumber of 10! ≈ 3.6 × 10 6 . The average relative errors are 0.00%,0.01% and 0.98% for error level of 5%, 10% and 50% respectively.For lower error level, the properties of bars are less different.Therefore the change of the <strong>de</strong>rivatives of objective withdifferent allocations is less, which leads to higher quality initialsolutions obtained through our method.After generation of the initial solution, we use a stan<strong>da</strong>rd 2-exchangelocal search algorithm starting with it to solve the problem (LS-Our). We compared the results with other two methods: one isusing the same algorithm but starting always from the i<strong>de</strong>ntity permutation(LS-Id); the other one is using the same algorithm butstarting from a randomly generated initial solution (LS-Random).To reduce the occasionality of this method, we randomly generate100 initial points for each instance and take the average performanceto compare with others.The statistical results of the 30 instances are listed in Table 1,where we use following notations: e ini is the average relative errorof the objective of initial solutions with respect to that of the globaloptimum. e f inal is the average relative error of the objective of finalsolutions. p succ is the percentage of successful runs, in whichALIO-EURO <strong>2011</strong> – 94


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>the relative error of the final solution is less than 1%. n ite is theaverage number of iterations and n f unc is the average total numberof function evaluations.Method e ini e f inal p succ n ite n f uncLS-Id 41.4% 0.28% 93% 8.5 384LS-Random 46.9% 0.22 % 94% 8.0 363LS-Our 0.33% 0.00% 100% 3.1 242Table 1: Statistical results with different initial solution.It could be seen that our procedure could generate quite high qualityinitial solutions and increase the ability of the algorithm toachieve successful solutions. Meanwhile, the average number ofiterations and number of function evaluations is largely reducedthough it requires n 2 times function evaluations at the beginning.4.2. 25-Bar Truss Allocation Problem4.2.1. Case without constraintsIn practice, it is always the case that not all of the parts are <strong>de</strong>signedto be the same and could be swapped with each other. However,we could usually divi<strong>de</strong> all of the parts into several groups accordingto their geometry, so that parts in the same group could beexchanged. For this multiple groups problem, when constructingthe initial solution, we simply treat each group in<strong>de</strong>pen<strong>de</strong>ntly byfixing the entries in permutation matrix of other groups to be 1/n g ,where n g <strong>de</strong>note the size of the corresponding group.We tested this kind of problem with a 3D 25-bar truss structurepresented in [7]. All the 25 bars are divi<strong>de</strong>d into 8 groups, and eachgroup has 1,4,4,2,2,4,4,4 bars respectively as colored in Figure 2.Bars of the same group could be exchanged with each other andthey differ in E, CTE and A. The values of these properties are<strong>de</strong>signed to be i<strong>de</strong>ntical as in Section 4.1. Our goal is to minimizethe displacement of no<strong>de</strong> 1 un<strong>de</strong>r a uniform thermal load of ∆T =42.37 ◦ C and some mechanical forces.Load case123No<strong>de</strong>sLoadsFx/kN Fy/kN Fz/kN1 4.45 -44.5 -44.52 0 -44.5 -44.53 2.22 0 06 2.67 0 01 0 89.0 -22.22 0 -89.0 -22.21 4.45 44.5 -22.22 0 44.5 -22.23 2.22 0 06 2.22 0 0Table 2: Load cases for 25-bar truss structure.Method e ini e f inal p succ n ite n f uncLS-Id 5.25% 0.01% 100% 12.7 406LS-Random 4.93% 0.01% 100% 12.0 383LS-Our 0.00% 0.00% 100% 1.2 128Table 3: Statistical results with different initial solution.4.2.2. Case with stress constraintsExcept the goal to minimize the objective, mechanical structuresare always required to fulfil some constraints, typically like limitationof maximum stress. We further add a stress constraint toabove problem:σ max /σ A − 1 ≤ 0 (7)where σ max is the maximum stress in the structure, σ A is the allowablestress. In our problem, σ A is selected to be the maximumstress when bars are all manufactured without error. And the objectiveis still to minimize the displacement of no<strong>de</strong> 1 un<strong>de</strong>r differentloads.We use penalty method to <strong>de</strong>al with constraints. Denote t equalsto the left hand si<strong>de</strong> of the constraint equation (7), and introducefollowing penalty function to be ad<strong>de</strong>d to the objective:p(t) ={αt t > 0,0 t ≤ 0,(8)where α is a large constant so that the penalty of violation increasesquickly and large enough to dominate the objective. Statisticalresults are shown in Table 4, where vio ini is the averagevalue of positive t of initial solutions.Method e ini vio ini e f inal p succ n ite n f uncLS-Id 5.01% 1.36% 0.30% 87% 14.6 468LS-Random 4.73% 1.07% 0.30% 88 % 14.3 460LS-Our 1.82% 0.27% 0.20% 93% 6.7 305Table 4: Statistical results of case with stress constraints.Figure 2: 25-bar truss structure.We applied three different load cases onto the structure, where themechanical forces are different as listed in Table 2. We randomlygenerated 10 instances with manufacturing error of 5% for eachload case. Statistical results are presented in Table 3.The global optimum are still found by enumerating all the possiblepermutations with total number of approximately 3.2 × 10 7 . Theaverage iteration nee<strong>de</strong>d by the algorithm starting from the generatedinitial solution is close to 1, which means the procedure isable to find an initial solution very close to the global optimum.As could be seen, the procedure could return a starter with bothsmaller objective and less violation of the constraint. And the qualityof final solution is higher with a reduction in total number offunction evaluations.4.3. 72-Bar Truss Allocation ProblemFinally, we applied the procedure on a large scale problem whichcontains totally 72 bars in the structure as shown in Figure 3. Allthe bars are divi<strong>de</strong>d into 4 groups with 8,16,16,32 bars respectively.Still, the properties of bars <strong>de</strong>viate from <strong>de</strong>sign with maximumerror of 5%. We apply two load cases where the mechanicalforces are the same as presented in [7] and the uniform thermalload are i<strong>de</strong>ntical as former examples. Our goal is to minimizeALIO-EURO <strong>2011</strong> – 95


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>the displacement of no<strong>de</strong> 20 un<strong>de</strong>r loads. We randomly generate10 instances for each load case. The statistical results of caseswithout and with stress constraints are presented in Table 5 and 6respectively.In this paper, parts allocation problem for structural systems is presente<strong>da</strong>nd formulated into a nonlinear assignment problem. Procedurefor constructing an initial solution for solving this kind ofproblem is established.The procedure has been tested on a 10-bar truss, a 25-bar truss an<strong>da</strong> large-scale 72-bar truss allocation problem. The performancefor problems with stress constraints is also studied. All the resultsshow that our procedure could construct a high quality initial solutionfor parts allocation problems. A stan<strong>da</strong>rd 2-exchange localsearchalgorithm starting from this initial point is able to solvemost of our test examples with fewer total number of functionevaluations compared with starting from the i<strong>de</strong>ntity permutationor randomly generated initial solutions.6. ACKNOWLEDGEMENTSFigure 3: 72-bar truss structure.Method e ini e f inal p succ n ite n f uncLS-Id 11.8% 0.03% 100% 129 98405LS-Random 11.6% 0.03% 100% 122 93571LS-Our 0.16% 0.01% 100% 29.1 23795Table 5: Statistical results of case without constraints.Method e ini vio ini e f inal p succ n ite n f uncLS-Id 11.8% 2.62% 0.17% 100% 136 103676LS-Random 11.5% 2.62% 0.19% 96.5% 132 100546LS-Our 4.78% 0.36% 0.24% 85% 70.4 55387Table 6: Statistical results of case with stress constraints.The total number of possible combinations is 8! × 16! × 16! ×32! ≈ 4.6 × 10 66 . We have no way to find the global optimumin this case. So for each instance, we take the best solution obtainedby all the three methods as the reference solution and therelative error are calculated with respect to it.For this large scale problem, comparing to the total number ofcombinations, the number of function of evaluations we need aremuch smaller. Although the percentage of successful run is relativelow starting from our initial solution, the average final relativeerror is still of the same level. And the reduction on total numberof function evaluations is still significant.5. CONCLUSIONThe authors gratefully acknowledge DAAD (German Aca<strong>de</strong>micExchange Service) for awarding the first author DAAD Scholarshipto carry out study at Institute of Lightweight Structures, TUMünchen, Germany.7. REFERENCES[1] R. Burkard, M. Dell’Amico, and S. Martello, AssignmentProblems. Phila<strong>de</strong>lphia, PA, USA: Society for Industrial andApplied Mathematics, 2009, ch. Linear sum assignment problem,pp. 73–144.[2] S. Sahni and T. Gonzalez, “P-complete approximation problems,”Journal of the Association of Computing Machinery,vol. 23, no. 3, pp. 555–565, July 1976.[3] A. M. Frieze, “Complextiy of a 3-dimensional assignmentproblem,” European Journal of Operation Research, vol. 13,no. 2, pp. 161–164, June 1983.[4] P. M. Par<strong>da</strong>los and L. S. Pitsoulis, Nonlinear AssignmentProblems: Algorithms and Applications (Combinatorial Optimization).Secaucus, NJ, USA: Springer-Verlag New York,Inc., 2000, ch. Heuristics for Nonlinear Assignment Problems,pp. 175–215.[5] E. Aarts and J. K. Lenstra, Local Search in Combinatorial Optimization.Princeton, NJ, USA: Princeton University Press,2003, pp. 57–214.[6] J. Munkres, “Algorithms for the Assignment and TransportationProblems,” Journal of the Society for Industrial and AppliedMathematics, vol. 5, no. 1, pp. 32–38, March 1957.[7] H. A<strong>de</strong>li and O. Kamal, “Efficient optimization of spacetrusses,” Computers and Structures, vol. 24, no. 3, pp. 501–511, 1986.ALIO-EURO <strong>2011</strong> – 96


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Partitioning a service region among several vehiclesJohn Gunnar Carlsson ∗∗ Industrial and Systems Engineering, University of Minnesota111 Church St SE, Minneapolis, MN 55455jgc@isye.umn.eduABSTRACTWe consi<strong>de</strong>r an uncapacitated stochastic vehicle routing problemin which vehicle <strong>de</strong>pot locations are fixed and client locations in aservice region are unknown, but are assumed to be i.i.d. samplesfrom a given probability <strong>de</strong>nsity function. We present an algorithmfor partitioning the service region into sub-regions so as to balancethe workloads of all vehicles when the service region is simplyconnected (has no holes) and point-to-point distances follow some“natural” metric, such as any L p norm. This algorithm can also beapplied to load-balancing of other combinatorial structures, suchas minimum spanning trees and minimum matchings.For each sub-region R i , we will solve a travelling salesman problem,in which the point set consists of a <strong>de</strong>pot point plus all pointsin R i . See figure 1.(a)(b)Keywords: Location, Geometry, Algorithms, Vehicle routing1. INTRODUCTIONOptimal assignment of a workload between several agents is acommon objective that is encountered in resource allocation problems.Frequently, workloads are assigned in such a way as to minimizethe total amount of work done by all agents. In other situations,one may want an equitable assignment that balances theworkload evenly across all agents. Equitable assignment policiesare commonly encountered in queueing theory [1, 2, 3], vehiclerouting [4, 5, 6], facility location [7, 8, 9, 10], and robotics [11, 12],among others.Our motivation for this research comes from an industrial affiliatein the form of a stochastic vehicle routing problem. Our objectiveis to partition a geometric region so as to assign workloads to vehiclesin an equitable fashion. Partitioning and routing occupy twodifferent strategic tiers in the optimization hierarchy; partitioningis done at a (high) tactical management level, while routing optimizationis operational and ma<strong>de</strong> on a <strong>da</strong>y-to-<strong>da</strong>y basis. Hence, anatural strategy, especially in the presence of uncertainty, is to segmentthe service region into a collection of sub-regions and thento solve each routing sub-problem induced at the sub-regions in<strong>de</strong>pen<strong>de</strong>ntlyof the others. This approach was used, for example,by [5], who treated the problem as a two-stage optimization problem(partitioning and routing) and implemented a tabu search andmultistart heuristic to consi<strong>de</strong>r the problem of partitioning a planargraph optimally. This problem is also often consi<strong>de</strong>red in thecontext of facility location [7, 8, 10] and robotics [12].In this paper, we give an algorithm that takes as input a planar, simplyconnected (not having holes) region R, together with a probability<strong>de</strong>nsity f (·) <strong>de</strong>fined on R. Contained in R is a collection of n<strong>de</strong>pot points P = {p 1 ,..., p n }, representing the starting locationsof a fleet of vehicles. We assume (purely for expositional purposes)that each point p i corresponds to exactly one vehicle. Thevehicles must visit clients whose exact locations are unknown, butare assumed to be i.i.d. samples from the <strong>de</strong>nsity f (·). Our goalis to partition R into n disjoint sub-regions, with one vehicle assignedto each sub-region, so that the workloads in all sub-regionsare asymptotically equal when a large number of samples is drawn.(c)Figure 1: Inputs and outputs to our problem. We begin with a<strong>de</strong>pot set and a <strong>de</strong>nsity f (·) <strong>de</strong>fined on a region R (1(a)), which wethen partition (1(b)). This partition should be constructed so that,when points are sampled in<strong>de</strong>pen<strong>de</strong>ntly from f (·) (1(c)), the TSPtours of all the points in each sub-region are asymptotically equal(1(d)).Our problem turns out to be a special case of the equitable partitioningproblem, in which we are given a pair of <strong>de</strong>nsities λ (·)and µ (·) on a region R and we want to partition R into n subregionsR i with ˜R iλ (·) dA = 1 ˜n˜R λ (·) dA and R iµ (·) dA =1n˜R µ (·) dA for all i. The case where λ (·) and µ (·) are bothatomic measures consisting of gn and hn points for some positiveintegers g and h is a well-studied problem in combinatorialgeometry known as a red-blue partition [13, 14, 15], and severalfast algorithms are already known for this problem. Our problemconsists of a “mixed” case where λ (·) is an atomic measure consistingof n <strong>de</strong>pot points and µ (·) represents the TSP workloadover a sub-region when points are sampled from f (·).The outline of this paper is as follows: first, we <strong>de</strong>scribe a necessarycondition for optimality of a partition of R that follows immediatelyfrom well-known results from geometric probability. Nextwe give an algorithm that finds an optimal partition of R when Ris a simply connected polygon. Finally, we present some simulationresults that show the solution quality of our algorithm whenapplied to some simulated problems and a case study.(d)ALIO-EURO <strong>2011</strong> – 97


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>2. SUMMARY OF KEY FACTS AND FINDINGS FROMRELATED WORKIn this section we summarize the important theoretical results thatform the basis of our partitioning algorithm. We consi<strong>de</strong>r the travellingsalesman problem (TSP) in a planar region R, where thedistance between two points is Eucli<strong>de</strong>an, or any other “natural”metric such as the Manhattan or sup norm. The well-known BHHtheorem [16] says that the length of an optimal TSP tour of a setof points follows a law of large numbers:Theorem 1. Suppose that {X i } is a sequence of random pointsi.i.d. according to a probability <strong>de</strong>nsity function f (·) <strong>de</strong>fined ona compact planar region R. Then with probability one, the lengthTSP({X 1 ,...,X k }) of the optimal travelling salesman tour traversingpoints {X 1 ,...,X k } satisfiesTSP({X 1 ,...,Xlimk })√ = βk→∞ k¨R√fc (x)dA (1)where β is a constant and f c (·) represents the absolutely continuouspart of f (·).It is additionally known that 0.6250 ≤ β ≤ 0.9204 [17]. This resultwas subsequently improved in [18], which showed that a similarlaw of large numbers holds for any subadditive Eucli<strong>de</strong>an functional,such as a minimum-weight matching, minimum spanningtree, Steiner tree, or Delaunay triangulation, with different constantsβ. Applying a stan<strong>da</strong>rd coupling argument to (1) gives thefollowing result:Theorem 2. Let R be a compact planar region and let f (·) be anabsolutely continuous probability <strong>de</strong>nsity <strong>de</strong>fined on R. Let {X i } bea collection of i.i.d samples drawn from f (·). Let {R 1 ,...,R n } bea partition of R. If a partition of R into n disjoint pieces R 1 ,...,R nsatisfies ¨√¨1 √f (x)dA = f (x)dA (2)R in Rfor i ∈ {1,...,n}, then asymptotically, the lengths of the TSP toursTSP({X 1 ,...,X k } ∩ R i ) will differ by a term of or<strong>de</strong>r o( √ k), wherek is the number of points sampled. Hence, the maximum tourlength over any sub-region R i differs from the optimal solution bya term of or<strong>de</strong>r o( √ k).As a special case, we remark that when f (·) is the uniform distributionon R, if a partition of R into n disjoint pieces {R 1 ,...,R n }satisfiesArea(R i ) = Area(R)/nthen asymptotically, the lengths of the TSP tours TSP({X 1 ,...,X k }∩R i ) will differ by a term of or<strong>de</strong>r o( √ k).3. THE EQUITABLE PARTITIONING PROBLEM ON ASIMPLY CONNECTED SERVICE REGION3.1. AnalysisThe optimality condition <strong>de</strong>fined in theorem 2 is easy to achieve,in the absence of other criteria; for example, a partition might consist˜ exclusively √ of vertical ˜ √lines, with each vertical strip cutting offstrip f (x)dA = 1n R f (x)dA. For this reason, we will imposeadditional constraints on our algorithm that should, in principle,give a better solution. Recall that in our original problemstatement, we assumed that our service region R contained a setof <strong>de</strong>pot points P = {p 1 ,..., p n }. A natural constraint to imposeis that each sub-region R i should contain the <strong>de</strong>pot point that wehave assigned to it.This still leaves us with consi<strong>de</strong>rable freedom; we have not yetimposed any constraints on the shape of the sub-regions. A furtherproperty that might be <strong>de</strong>sired is that for any two points u,v ∈ R i ,the shortest path between u and v be contained in R i . When theinput region R is convex, this constraint is equivalent to requiringthat each sub-region R i also be convex. When R is not convex, theproperty that we <strong>de</strong>sire is called relative convexity [13]: each subregionR i should be convex “relative” to the input region R, so thatthe shortest path between u,v ∈ R i (which may not be a straightline) must itself be contained in R i . Our main result in this paperis the following theorem:Theorem 3. Given a simply connected region S with m vertices,a probability <strong>de</strong>nsity µ (·) <strong>de</strong>fined on S such that˜S µ (x) dA = 1,and a collection of points P = {p 1 ,..., p n } ⊂ S where the verticesof S and the points in P are all in general position, there exists apartition of S into n relatively convex sub-regions S 1 ,...,S n withdisjoint interiors, where each sub-region S i contains exactly onepoint from P and satisfies ´S iµ (x) dA = 1/n. Furthermore, wecan find such a partition in running time O (nN logN), where N =m + n.Using theorem 2, by setting µ (·) = √ f (·), the algorithm <strong>de</strong>scribedin theorem 3 partitions S into n sub-regions whose TSP tours (forpoints sampled from the <strong>de</strong>nsity f (·)) are asymptotically equalwhen a large number of points is sampled. For purposes of brevitywe will assume that Area(S) = 1 and that f (·) is the uniform distribution,so our goal is to partition S into relatively convex piecesof area 1/n, each containing a point p i . The rea<strong>de</strong>r is invited torefer to [19] for the complete generalization of our algorithm an<strong>da</strong> proof of its running time. An example of the input and output ofour algorithm is shown in figure 2. We let ∂ <strong>de</strong>note the boun<strong>da</strong>ry(a)Figure 2: Inputs S and P (2(a)) and output (2(b)) to our problem,where µ (·) is the uniform distribution on S. Note that the regionmarked S i consists of two polygons joined at a vertex, but stillsatisfies our relative convexity constraint.operator, e.g. ∂S <strong>de</strong>notes the boun<strong>da</strong>ry of S. We let |·| <strong>de</strong>note thecardinality operator, e.g. |P| = n. We begin with some <strong>de</strong>finitions:Definition 1. Let S be a compact, simply connected planar region,and let P = {p 1 ,··· , p n } ⊂ S <strong>de</strong>note a set of n points, where nis even. A partition {S 1 ,S 2 } of S into 2 (relatively) convex subregionsis said to be an equitable (relatively) convex 2-partition ifwe haveArea(S 1 )|P ∩ S 1 |= Area(S 2).|P ∩ S 2 |Definition 2. An S-geo<strong>de</strong>sic between two points u and v in a simplepolygon S, written G(u,v|S ), is the shortest path between uand v contained in S.Definition 3. A sub-region ˜S of a simple polygon S is relativelyconvex to S if, for every pair of points u,v ∈ ˜S, the S-geo<strong>de</strong>sicG(u,v|S ) lies in ˜S.Definition 4. Given two points u and v on ∂S, the left shell L (u,v|S )consists of all elements of S lying on or to the left of G(u,v|S ). If(b)ALIO-EURO <strong>2011</strong> – 98


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>u or v does not lie on ∂S, then we <strong>de</strong>fine L (u,v) = L(u ′ ,v ′) ,where u ′ and v ′ are obtained by extending the endpoints of G(u,v|S )via straight lines to ∂S (see figure 3).φ ( ¯x) = 0 and our equitable 2-partition is obtained. We can findthis by performing a binary search for i ∈ {k,...,n − k}, wherefor each i we compute the point LShell i/n (x 0 ) and the number ofpoints contained therein. The preceding argument guarantees thatwe must find an equitable 2-partition somewhere in this procedure.Case 2Figure 3: The geo<strong>de</strong>sic G(u,v|S ), its extension points u ′ and v ′ ( ),and the induced left shell L (u,v|S ) = L u ′ ,v ′ |S .Definition 5. Given a point u on ∂S and a positive integer α k, we have a left shell containing too many points (relative to itsarea) and another left shell containing too few points. Hence, there#"!! !"#$%&" (" )! !"#$%&" '"! !"#$%&$"This section consists of a proof of the following theorem:Theorem 4. Let x 0 and x 1 be two points on ∂S. If Area(L (x 0 ,x 1 |S )) =kn for some integer k ≤ n/2 and |L (x 0,x 1 |S ) ∩ P| > k, then we canfind a relatively convex equitable 2-partition of S and P in runningtime O (N logN), where N = m + n.Note that theorem 4 is more than sufficient to prove theorem 3when n = 2 j for some positive integer j and f (·) is the uniformdistribution, since we can always meet the necessary conditions oftheorem 4 with k = n/2 (by dividing S in half with any geo<strong>de</strong>sic,and counting the number of points on either si<strong>de</strong>), and then applytheorem 4 recursively to both sub-regions. This can also beused more generally for other n, although we have omitted the discussionhere for brevity (see [19] for the complete result). Theremain<strong>de</strong>r of this section consists of a sketch of a proof of thistheorem.As in the theorem, let x 0 and x 1 be two points on ∂S such thatArea(L (x 0 ,x 1 |S )) = k n for some integer k ≤ n/2 and|L (x 0 ,x 1 |S ) ∩ P| > k. Construct another point x 2 on ∂S so thatArea(L (x 2 ,x 0 |S )) = k n . Then either |L (x 2,x 0 |S ) ∩ P| < k or|L (x 2 ,x 0 |S ) ∩ P| > k (if we have equality then we are finished),and in either case we can <strong>de</strong>rive an equitable 2-partition:Case 1Suppose that |L (x 2 ,x 0 |S ) ∩ P| > k. Then |L (x 0 ,x 2 |S )∩P| < n−k and Area(L (x 0 ,x 2 |S )) = n−kn . Hence, L (x 0,x 1 |S ) containstoo many points (relative to its area) and L (x 0 ,x 2 |S ) contains toofew points. Consi<strong>de</strong>r a family of left shells L (x 0 ,x|S ), whereFigure 5: An equitable geo<strong>de</strong>sic shell exists between ¯x and ˜x withk = 4 and n = 9.must exist some pair of points ¯x, ˜x in ∂S such that ¯x ∈ ∂L (x 0 ,x 2 |S )and ˜x ∈ ∂L (x 1 ,x 0 |S ) (see figure 5), where Area(L ( ¯x, ˜x|S )) = nkand |L ( ¯x, ˜x|S ) ∩ P| = k. This is because the function LShell k/n (x)is continuous in x (for x ∈ ∂S), and the assumption that our pointslie in general position ensures that as x traverses ∂S from x 0 to x 2 ,the elements of P will enter and exit Lone.(x,LShell k/n (x)4. COMPUTATIONAL RESULTS)one byTheorem 2, our criterion for optimal partitioning, is an asymptoticresult. We are guaranteed ( that vehicle workloads will differ by√k )terms of or<strong>de</strong>r o , but we have not yet established that workloadsare in fact balanced when this algorithm is employed (e.g.,that the convergence in k may be slow in practice). In this sectionwe give some examples that suggest that vehicle workloads willin fact be balanced in a practical setting when point-to-point distancesare Eucli<strong>de</strong>an. We also present the results of a case studyin which we apply our partitioning algorithm as a pre-processingstage in a non-Eucli<strong>de</strong>an vehicle routing problem using <strong>da</strong>ta suppliedfrom an industrial affiliate. In this problem, we are given themap of a road network of a city, and we must use our fleet of vehiclesto traverse every road. This is a multi-vehicle variant of theChinese Postman Problem (CPP), a well-studied routing optimizationproblem first <strong>de</strong>scribed in [20].4.1. Simulation resultsFigure 4: A family of left shells cutting off area n k , k+1 n−kn ,..., n ,with k = 2 and n = 9.x traverses ∂S clockwise from x 1 to x 2 ; see figure 4. The functionφ (x) := Area(L (x 0 ,x|S ))− k n |L (x 0,x|S ) ∩ P| is piecewisecontinuous, increasing on each of its components, and <strong>de</strong>creasingat each discontinuity. Since φ (x 1 ) < 0 and φ (x 2 ) > 0, the intermediatevalue theorem guarantees the existence of a point ¯x whereWe first present the results of a simulation in which we constructa synthetic <strong>da</strong>ta set with n = 9 <strong>de</strong>pots where f (·) is a mixture ofthree Gaussian distributions, truncated to lie within a simple polygonS ⊂ [0,1] 2 . One of the polygons that forms the input to oursimulation is shown in figure 6. For each polygon, we generate 20scenarios, with each scenario consisting of 30 samples of k pointsin S, for k between 50 and 1500 (and hence we performed a totalof 600 simulations per polygon). TSP tours were computed usingthe Lin-Kernighan heuristic from Concor<strong>de</strong> [21]. Tour lengths fora particular scenario, and the average vehicle tour lengths over allscenarios, are shown in figure 7. As the plots show, the vehicleworkloads are well balanced by partitioning; these suggest that theALIO-EURO <strong>2011</strong> – 99


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>graph, and consequently vehicle tours may not be geographicallyseparate. In a practical setting it is <strong>de</strong>sirable to separate one vehicle’sroute from another in an obvious geographic way so as tolocalize drivers to specific areas of the city.In our partition, each sub-region contains a <strong>de</strong>pot and all subregionscontain (approximately) the same total amount of roads.(a)(b)Figure 6: The input and output to our simulation.Tour lengths21.81.61.41.210.80.6Tour lengths1.81.61.41.210.80.60.40.200 500 1000 1500Total # points(a)0.40.200 500 1000 1500Total # pointsFigure 7: Tour lengths of the 9 vehicles in a particular randomscenario, and average tour lengths over 20 scenarios (7(b)).o( √ k) term of theorem 2 may be negligible, although the variabilitybetween vehicle tours for small k is still high. This is not surprisingsince our partition is “asymptotically optimal” and makesno guarantees for the tour lengths when the number of points issmall. A second observation is that our algorithm performs wellwhen many scenarios are averaged, as suggested in figure 7(b).For a related application, figure 8 shows the result of this algo-Figure 8: An equitable partition of Hennepin County, Minnesota.All sub-regions have the same total population and each subregioncontains one post office.rithm applied to a map of Hennepin County, Minnesota, whereµ (·) is the population <strong>de</strong>nsity and P represents the 29 largest postoffices. Rather than producing equal TSP tour lengths, this partitionsso that each mail carrier services the same number of houseseach <strong>da</strong>y.4.2. Case studyAs a final example, we show in figure 9 a partition of the roadnetwork of a city that was provi<strong>de</strong>d by an industrial affiliate. Theobjective in this problem is to traverse every street segment in thecity with a fleet of vehicles originating at various <strong>de</strong>pots. Althoughheuristics for these kinds of problems are already known [22], theydo not take advantage of the fact that our road map is a planar(b)Figure 9: An equitable partition of a road network that is relativelyconvex with respect to the metric induced by the road network. Allsub-regions have the same total road mass and each sub-regioncontains a <strong>de</strong>pot.Each sub-region is “relatively convex” to the metric induced bythe road network (i.e. for any two points u,v ∈ R i , the shortestpath from u to v lies in R i ). The lengths of the total amount ofroads in each sub-region differ by a factor of at most 1.11.5. REFERENCES[1] Y. Azar, “On-line load balancing,” in Online Algorithms, ser.Lecture Notes in Computer Science, vol. 1442. SpringerBerlin / Hei<strong>de</strong>lberg, 1998, pp. 178–195.[2] Y. He and Z. Tan, “Ordinal on-line scheduling for maximizingthe minimum machine completion time,” Journal ofCombinatorial Optimization, vol. 6, no. 2, pp. 199–206, June2002.[3] H. Kellerer, V. Kotov, M. G. Speranza, and Z. Tuza, “Semion-line algorithms for the partition problem,” Operations ResearchLetters, vol. 21, no. 5, pp. 235 – 242, 1997.[4] J. G. Carlsson, D. Ge, A. Subramaniam, and Y. Ye,“Solving the min-max multi-<strong>de</strong>pot vehicle routing problem,”in <strong>Proceedings</strong> of the FIELDS Workshop on GlobalOptimization, 2007. [Online]. Available: http://www.stanford.edu/~yyye/MDVRP-JGSWY.pdf[5] D. Haugland, S. C. Ho, and G. Laporte, “Designing<strong>de</strong>livery districts for the vehicle routingproblem with stochastic <strong>de</strong>mands,” European Journalof Operational Research, vol. 180, no. 3, pp. 997– 1010, 2007. [Online]. Available: http://www.sciencedirect.com/science/article/B6VCT-4K9C5B8-5/2/d783603a3a80c1d1e6379a16d47d59ce[6] M. Pavone, N. Bisnik, E. Frazzoli, and V. Isler, “Decentralizedvehicle routing in a stochastic and dynamic environmentwith customer impatience,” in RoboComm ’07: <strong>Proceedings</strong>of the 1st international conference on Robot communicationand coordination. IEEE Press, 2007, pp. 1–8.[7] B. Aronov, P. Carmi, and M. Katz, “Minimum-costload-balancing partitions,” Algorithmica, vol. 54, no. 3,pp. 318–336, July 2009. [Online]. Available: http://www.springerlink.com/content/v42887v071p41701/[8] O. Baron, O. Berman, D. Krass, and Q. Wang, “The equitablelocation problem on the plane,” European Journal ofOperational Research, vol. 183, pp. 578–590, 2007.ALIO-EURO <strong>2011</strong> – 100


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[9] O. Berman, Z. Drezner, A. Tamir, and G. O. Wesolowsky,“Optimal location with equitable loads,” Annals of OperationsResearch, vol. 167, no. 1, pp. 307–325, March 2009.[10] Z. Drezner and A. Suzuki, “Covering continuous <strong>de</strong>mandin the plane,” Journal of the Operational Research Society,vol. 61, no. 5, pp. 878–881, 2010.[11] M. Jäger and B. Nebel, “Dynamic <strong>de</strong>centralized area partitioningfor cooperating cleaning robots,” in in ICRA 2002,2002, pp. 3577–3582.[12] M. Pavone, A. Arsie, E. Frazzoli, and F. Bullo, “Distributedpolicies for equitable partitioning: theory and applications,”in <strong>Proceedings</strong> of the 47th IEEE Conference on Decision andControl. Piscataway, NJ, USA: IEEE Press, 2008, pp. 4191–4197.[13] S. Bereg, P. Bose, and D. Kirkpatrick, “Equitablesubdivisions within polygonal regions,” ComputationalGeometry, vol. 34, no. 1, pp. 20 – 27, 2006, special Issueon the Japan Conference on Discrete and ComputationalGeometry 2004. [Online]. Available: http://www.sciencedirect.com/science/article/B6TYS-4H877G8-3/2/635100921efef04dc3e364aa283b958b[14] S. Bespamyatnikh, D. Kirkpatrick, and J. Snoeyink,“Generalizing ham sandwich cuts to equitable subdivisions,”Discrete and Computational Geometry, vol. 24, pp. 605–622, 2000. [Online]. Available: http://dx.doi.org/10.1007/s004540010065[15] A. Kaneko and M. Kano, “Discrete geometry on red andblue points in the plane - a survey,” in in Discrete andComputational Geometry, The Goodman-Pollack Festschrift.Springer, 2003, pp. 551–570.[16] J. Beardwood, J. Halton, and J. Hammersley, “The shortestpath through many points,” <strong>Proceedings</strong> of the CambridgePhilosophical Society, vol. 55, pp. 299–327, 1959.[17] D. L. Applegate, R. E. Bixby, V. Chvatal, and W. J. Cook,The Traveling Salesman Problem: A Computational Study(Princeton Series in Applied Mathematics). Princeton, NJ,USA: Princeton University Press, 2007.[18] J. M. Steele, “Subadditive eucli<strong>de</strong>an functionals andnonlinear growth in geometric probability,” The Annals ofProbability, vol. 9, no. 3, pp. 365–376, 1981. [Online].Available: http://www.jstor.org/stable/2243524[19] J. G. Carlsson, “Equitable partitioning for multi-<strong>de</strong>potvehicle routing,” INFORMS Journal on Computing,Un<strong>de</strong>r revision, see http://www.tc.umn.edu/~jcarlsso/equitable-partitioning-IJOC-revision.pdf.[20] M. K. Kwan, “Graphic programming using odd or evenpoints,” Chinese Math., vol. 1, pp. 273–277, 1962.[21] W. Cook, “Concor<strong>de</strong> TSP Solver,” http://www.tsp.gatech.edu/concor<strong>de</strong>.html, 1997–2005.[22] G. N. Fre<strong>de</strong>rickson, “Approximation algorithms for somepostman problems,” J. ACM, vol. 26, pp. 538–554,July 1979. [Online]. Available: http://doi.acm.org/10.1145/322139.322150ALIO-EURO <strong>2011</strong> – 101


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A Bi-Objective Approach for Selection of Sugarcane Varieties in BrazilianCompaniesMargari<strong>da</strong> Vaz Pato ∗ † Helenice <strong>de</strong> Oliveira Florentino ‡∗ Instituto Superior <strong>de</strong> Economia e Gestão, Universi<strong>da</strong><strong>de</strong> Técnica <strong>de</strong> Lisboa, Portugal, FCUPAddress: Depto. Matemática, ISEG, Rua do Quelhas, 6, 1200-781, Lisboa, Portugalmpato@iseg.utl.pt† Centro <strong>de</strong> Investigação Operacional, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências, Universi<strong>da</strong><strong>de</strong> <strong>de</strong> Lisboa, Portugal‡ Depto. Bioestatística, Instituto <strong>de</strong> Biociências, Universi<strong>da</strong><strong>de</strong> Estadual Paulista, Botucatu, BrasilAddress: Rubião Júnior, P. O. Box 510, CEP 18618-000, Botucatu, São Paulo, Brazilhelenice@ibb.unesp.brABSTRACTThe selection of sugarcane varieties is an important problem facedby sugarcane mill companies confronted by the issue of efficiencyand the reduction of <strong>da</strong>mage to the environment. Here the authorspresent the problem of sugarcane variety selection in the light oftechnical constraints and the aim to minimize collection and transportcosts of the residue from sugarcane harvest and maximize energyobtained from the residue. This problem will be presented andformalized within bi-objective binary linear programming. Thestudy is mainly <strong>de</strong>voted to the application of a bi-objective geneticalgorithm to solve real problems addressed in the São Paulo Stateof Brazil. Results from the computational experiment un<strong>de</strong>rtakenwill be reported.Keywords: Selection of sugarcane varieties, Bi-objective geneticalgorithm1. INTRODUCTIONBrazil is the world’s largest sugarcane producer. This crop is mainlyused to obtain ethanol, sugar and energy. Currently, the big worryfor environmental and governmental organizations arises from theresidue generated when harvesting. On one hand, the commonpractice of burning the straw prior to harvest brings about seriousenvironmental <strong>da</strong>mages and will soon be prohibited. On the otherhand, the absence of burnings, leading to the additional straw accumulatingon the soil creates favourable conditions for parasitesand <strong>de</strong>lays sugarcane shooting, thus compromising the next crop.Therefore, the <strong>de</strong>stiny of this residual material in the field has beenthe subject of many studies. Of particular interest is the one <strong>de</strong>votedto the selection of sugarcane varieties <strong>de</strong>signed to cope withenvironmental and economic requirement issues, in short referredto as SSVP.A mo<strong>de</strong>l for the SSVP will be given, followed by a brief presentationof a bi-objective genetic algorithm and, finally, by computationalresults.2. MATHEMATICAL MODELThe SSVP consists of <strong>de</strong>termining which of the n varieties a<strong>da</strong>ptedto local soil and climate conditions should be planted in each ofthe k plots. They should, at the same time offer the lowest possiblefield-to-mill transfer cost and maximum energy balance for residualbiomass from the sugarcane harvest. Moreover, the solutionmust satisfy sucrose and fibre limits for sugarcane, recommen<strong>de</strong>dby the company, use the whole area set asi<strong>de</strong> for sugarcane plantationand respect the specific varieties’ area limits.To construct a bi-objective binary linear programming mo<strong>de</strong>l forthe SSVP we consi<strong>de</strong>r the <strong>de</strong>cision variables x i j = 1 if sugarcanevariety i is planted in plot j, x i j = 0, in the opposite case (for alli = 1,2,...,n; j = 1,2,...,k) and the parameters:c i j : transfer cost of the residual biomass produced from 1 ha ofsugarcane variety j on plot i;e i j : energy balance of the biomass from 1 ha of variety j on ploti;s i j : estimated sucrose production from plot j should it beplanted with variety i;Slo: minimum quantity established for the total sugar to be extractedfrom the planting area;f i j : estimated fibre content of sugarcane planted in plot j withvariety i;Flo, Fup: lower and upper bounds established for the total quantity offibre;L j : area of plot j;Lup i : maximum area for variety i.The mo<strong>de</strong>l follows:n kminimize f 1 (x) = ∑ ∑ c i j x i j (1)i=1 j=1n kmaximize f 2 (x) = ∑ ∑ e i j x i j (2)i=1 j=1subject to n k∑ ∑ s i j x i j ≥ Slo (3)i=1 j=1n kFlo ≤ ∑ ∑ f i j x i j ≤ Fup (4)i=1 j=1n∑ x i j = 1 j = 1,2,...,k (5)i=1k∑ L j x i j ≤ Lup i i = 1,2,...,n (6)j=1x i j = 0 or 1 i = 1,2,...,n; j = 1,2,...,k (7)ALIO-EURO <strong>2011</strong> – 102


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>This multi-objective optimization problem (MOP) is similar to theone presented in [1], however more complete from the practicalperspective insofar as it preserves the quality of sugarcane in termsof fiber and optimizes both cost and energy balance.The SSVP is NP-hard, hence non-exact methods are required tocope with the medium/high dimension instances of the SSVP characterizingthe most frequent real cases arising from companies inthe Mid South region of Brazil.3. BI-OBJECTIVE GENETIC ALGORITHMFrom among the many types of non-exact multi-objective methods,the genetic or evolutionary heuristics have proved to be successfulin obtaining solutions for difficult MOPs. The reason forthis is that they <strong>de</strong>al with a population of solutions with differentcharacteristics as to the optimization goals. [2] covers the actualresearch and application in the field. Genetic heuristics have beensuccessfully applied for multi-objective problems with knapsackand semi- assignment type constraints, e.g. [3],[4].Within the new bi-objective genetic algorithm we <strong>de</strong>veloped forSSVP each individual of the population is characterized by a singlechromosome that represents a solution for the SSVP. The chromosomeis enco<strong>de</strong>d through an integer valued vector whose k componentsprovi<strong>de</strong> the sugarcane varieties selected. Hence, in thisrepresentation each gene is a variety, the very one proposed for theplot. The solution may or not be feasible and, in this case, bothcost and energy are penalized. To evaluate the individual’s fitness,the simple rank concept is used, thus giving relevance to the dominancerelations, as within NSGA type algorithms [2].The dimension of the population in every generation is N=100 andthe maximum number of generations is Nmax=2000. Two differentprocesses are used to generate the individuals of the initial population:one is a constructive algorithm to produce Ng=4 individualsby enforcing the bounding constraints of the SSVP - (3) (4) and(6) and the other algorithm randomly generates the remaining N-Ng individuals.As to the operators, five basic operators are applied to the currentpopulation, to create the population of the next generation:selection, crossover, mutation, repair and elitism. The selectionoperator is a stan<strong>da</strong>rd binary tournament to build the Pool, givingpriority to an individual with a low cost and a high energy balance.The crossover is the one point procedure. When a child is not feasible,it is repaired through the action of the repair operator, theabove constructive algorithm. Afterwards, each child replaces anyone of the parents in the Pool, but only if it is fairly better than thatparent is as regards the dominance relation. Then mutation applieswith probability p m =0.05 on each gene of all the chromosomes ofthe Pool. If a gene is going to mutate, the sugarcane variety forthe respective plot is randomly chosen by giving equal probabilityto all the n varieties. Again, if the mutant is not feasible, thenthe repair operator is applied. Finally, within the elitist operator,all the potentially efficient individuals of the previous generation,here represented by S ∗ , are inclu<strong>de</strong>d in the Pool and the populationfor the next generation is <strong>de</strong>termined by eliminating the |S ∗ | lessfitted individuals from the Pool.4. COMPUTATIONAL RESULTSThe bi-objective genetic algorithm was tested along with an exactmethod with an SSVP instance corresponding to a small companyof the São Paulo State in Brazil [5], thus producing results that willbe given at the talk. This company <strong>de</strong>als with 10 sugarcane varietiesand possesses a total area of 315.81 ha. Other 80 simulatedinstances, corresponding to fields from 405 to 6075 ha, have alsobeen solved with the above algorithm.The effect of the genetic evolution on the initial population for allthe 81 test instances and the computing times will be shown. Thequality of the solutions obtained from the genetic algorithm is accessedthrough performance measures [6]]. These figures showthat, at low computing times, the spread within the non-exact frontieris high and the cardinality of this frontier is also significant.All the programs were co<strong>de</strong>d in MATLAB [7] and ran on CORE 2QUAD computers with 2.83 GHz and 2G RAM at the Departmentof Biostatistics, UNESP, Botucatu, Brazil.5. FINAL COMMENTSResults obtained from the computational experiment reveal the favorablebehavior of the bi-objective genetic heuristic specially <strong>de</strong>visedfor SSVP, both from the mathematical and the practical perspectives.Hence, this methodology will be appropriate in helping managersof sugarcane mills in the Brazilian Mid South region to plan theirproduction activities.6. ACKNOWLEDGEMENTSThanks are due to FUNDUNESP and FAPESP, Brazil (grants No.2009/14901-4, No. 2010/07585-6) and to FCT, Portugal (projectPOCTI/ISFL/152) for the financial support.7. REFERENCES[1] H. O. Florentino, E. V. Moreno, and M. M. P. Sartori, “Multiobjectiveoptimization of economic balances of sugarcaneharvest biomass,” Scientia Agricola (Brazil), vol. 65, pp. 561–564, 2008.[2] C. A. C. Coello, G. B. Lamont, and D. A. V. Velduizen,Evolutionary algorithms for solving multi-objective problems,2nd ed. New York: Springer, 2007.[3] K. Florios, G. Mavrotas, and D. Diakoulaki, “Solving multiobjective,multiconstraint knapsack problems using mathematicalprogramming and evolutionary algorithms,” EuropeanJournal of Operational Research, vol. 203, pp. 14–21, 2010.[4] P. R. Harper, V. <strong>de</strong> Senna, I. T. Vieira, and A. K. Shahani, “Agenetic algorithm for the project assignment problem,” Computers& Operations Research, vol. 32, pp. 1255–1265, 2005.[5] A. D. Lima, “Otimização do aproveitamento do palhiço <strong>da</strong>biomassa residual <strong>da</strong> colheita <strong>de</strong> cana-<strong>de</strong>-açúcar,” Ph.D. dissertation,<strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências Agronômicas, UNESP, Botucatu,Brazil, 2009.[6] Y. Collette and P. Siarry, Multiobjective optimization: Principlesand case studies. Berlin: Springer, 2003.[7] M. version 7.6.0.324 (R2008a), High performance numericcomputation and visualization software: Reference Gui<strong>de</strong>.Natick, USA: The MathWorks Inc., 2008.ALIO-EURO <strong>2011</strong> – 103


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>An Imputation Algorithm Applied the Nonresponse ProblemJose Brito ∗ Nelson Maculan † Luiz Ochi ‡ Flavio Montenegro § Luciana Brito ◦∗ ENCE, Escola Nacional <strong>de</strong> Ciências EstatísticasRua André Cavalcanti,106, sl 403, CEP:20231-050, Rio <strong>de</strong> Janeiro, Braziljose.m.brito@ibge.gov.br† COPPE, Universi<strong>da</strong><strong>de</strong> Fe<strong>de</strong>ral do Rio <strong>de</strong> JaneiroP.O. Box 68511, 21941-972 Rio <strong>de</strong> Janeiro, Brazilmaculan@cos.ufrj.br‡ UFF, Universi<strong>da</strong><strong>de</strong> Fe<strong>de</strong>ral Fluminense, Instituto <strong>de</strong> ComputaçãoRua Passo <strong>da</strong> Pátria 156, Bloco E, 3 o an<strong>da</strong>r, São Domingos, Niterói, RJ, Brazilsatoru@dcc.ic.uff.br§ IBGE, Instituto Brasileiro <strong>de</strong> Geografia e Estatística, DPE/COMEQAv.Chile, 500, 10 o An<strong>da</strong>r, Centro, Rio <strong>de</strong> Janeiro, RJ, Brazilflavio.montenegro@ibge.gov.br◦ UNIPLI, Centro Universitário Plínio LeiteAv. Viscon<strong>de</strong> do Rio Branco, 123, Centro, Niterói, RJ, Brazilluroquebrito@hotmail.comABSTRACTThis work <strong>de</strong>scribes an imputation algorithm to solve the nonresponseproblem in surveys. The nonresponse is associated the occurrenceof missing values in at least one variable of at least registryor unit of the survey. In or<strong>de</strong>r to prevent the negative effectsof nonresponse, an intense research has been produced in this areaand many procedures have been implemented. Among these, we<strong>de</strong>tach the imputation methods, that consist basically of substitutinga missing value by some suitable one, according some criterionor rule. In this work we propose a new imputation algorithmthat combines the clustering method and GRASP metaheuristic.Toevaluete its performance we present a set of computational resultsconsi<strong>de</strong>ring <strong>da</strong>ta from Brazilian Demographic Census 2000.Keywords: Nonresponse, Imputation, GRASP, Cluster Analysis,Survey1. INTRODUCTIONNonresponse is a normal but un<strong>de</strong>sirable feature of a survey [1]. Itis characterized by incomplete records of a survey <strong>da</strong>tabase, whichmay occur in the phase of <strong>da</strong>ta collection or <strong>da</strong>ta estimation. Nonresponseoccurs when, at least for one sampling unit (household,person, etc) of the population or sample [2] of the survey, thereis nonresponse to one question of a questionnaire (record) or theinformation given is not usable. Or else, when at least one itemof a questionnaire was not completed (survey variable). Incompletequestionnaires due to nonresponse are common in surveys,but <strong>de</strong>serve attention. Therefore, a consi<strong>de</strong>rable amount of moneyhas been spent in the <strong>de</strong>velopment and improvement of proceduresassociated to <strong>da</strong>ta assessment, in or<strong>de</strong>r to prevent the occurrenceof nonresponse or to minimize its negative effects. There has beenextensive research in this field, which is reported in many studies,such as [1, 3, 4, 5]. Among the procedures being <strong>de</strong>veloped arethose classified as imputation methods, which basically consist inreplacing a missing <strong>da</strong>ta with an estimated value, according to acriterion or rule [1]. With the purpose of treating the nonresponseissue, the present study introduces a method that combines an imputationrule, a technique of cluster analysis [6, 7] and GRASPmetaheuristics [8, 9] (Greedy Randomized A<strong>da</strong>ptive Search).2. NONRESPONSE AND IMPUTATIONThere are two types of nonresponse: (1) total nonresponse, whichcorresponds to the units from which no usable information wascollected, and partial nonresponse, corresponding to the units fromwhich there is at least one variable with a missing value and whichare not part of the total nonresponse set. The present study hasfocused on the treatment of partial nonresponse. Then, the conceptof nonresponse is <strong>de</strong>scribed in greater <strong>de</strong>tail, with emphasison some procedures for the treatment of nonresponse through imputationmethods. At first we may consi<strong>de</strong>r a set of p variablesassociated e.g. to the socio<strong>de</strong>mographic characteristics of a survey<strong>de</strong>fined by X 1 ,X 2 ,...,X p . Such characteristics are obtained for npersons (records), which <strong>de</strong>termines a matrix X np that has for eachinput X i j the value of the jth variable (characteristic) observed inthe ith i = 1,...,n record. If a M i j indicating variable of the observationof the corresponding <strong>da</strong>ta is associated to each X i j , we’llhave M i j = 1, If there is a value for X i j and M i j = 0, If it is otherwise.And based on this, a M matrix that <strong>de</strong>fines the pattern ofthe missing <strong>da</strong>ta is <strong>de</strong>fined. In the present article, we shall treatthe missing <strong>da</strong>ta associated to one single variable X j (UnivariateMissing Data), known as the study variable. That is, the matrix Mshall have zero elements in only one of its columns. The remainingvariables (p − 1) shall be treated as explicative variables, thatis, variables correlated with the variable of interest and that can beused to predict the values of this variable.When incomplete records are found in a given <strong>da</strong>tabase, that is,when there is missing information on one of the variables of the<strong>da</strong>tabase, <strong>da</strong>ta can be imputed. Imputation is a procedure throughwhich the missing values for one or more study variables "areALIO-EURO <strong>2011</strong> – 104


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>filled" with estimated values [1]. These "replacements" must beperformed according to a rule. The imputed values can be classifiedinto three main categories: (i) values constructed using a<strong>de</strong>vice for automatic imputation of missing values, consi<strong>de</strong>ring animputation statistical rule (ii) values observed for elements withsimilar response; (iii) values constructed by expert opinion or "bythe best possible estimate" [1]. The categories (i) and (ii) can becalled statistical rules because they use a statistical method aimedto produce a replacement value reasonably close to the originalvalue. The (i) category is frequently based on regression prediction[1]. Imputation is especially used in the treatment of partial nonresponse,which concerns the simulations presented in this article,although it can also be used in the treatment of total nonresponse.There are several methods of imputation [1, 5], such as: (1) NearestNeighbor Imputation: a function of the distance between thecomplete and incomplete records is calculated consi<strong>de</strong>ring the explicativevariables (p−1). The value of the observed unit with thesmallest distance to the non-respon<strong>de</strong>nt unit will be substituted forthe missing item. (2) Hot Deck Imputation: the variable X j associatedto an incomplete record is substituted for a value obtainedfrom a distribution estimated from the available <strong>da</strong>ta (completerecords). A complete record (donor) is selected in or<strong>de</strong>r to provi<strong>de</strong>values for the missing information in the incomplete record (recipient).This method is typically implemented in two stages: inthe first stage, a set of <strong>da</strong>ta is distributed into k groups (imputationclasses) consi<strong>de</strong>ring the explicative variables (p − 1) associated tothe study variable. Once the k groups are <strong>de</strong>fined, in the secondstage, the group of each incomplete record is i<strong>de</strong>ntified. The completerecords of a group are used to estimate the unknown valuesin the incomplete records. (3) Mean imputation: it is a simplemethod applicable to continuous variables. It substitutes the missingvalues with the general mean for the variable.3. METHODOLOGYThe present study shall treat the problem of nonresponse with thetype of imputation classes used in the Hot Deck method, expandingthe use of these classes to the cases of mean imputation (whichis then based on records associated to each one of these classes).Since the <strong>de</strong>finition of the imputation classes has direct impact onthe incomplete records, a new methodology for the <strong>de</strong>finition ofthe classes shall be proposed in this study, with the application ofthe cluster analysis, a technique wi<strong>de</strong>ly used to solve the problemof obtaining homogeneous groups (clusters) from a <strong>da</strong>tabase withspecial characteristics or attributes [7]. The clusters formed arecharacterized as follows: the objects of one cluster are very similarand the objects or different clusters are very dissimilar, consi<strong>de</strong>ringthe objective function (that aggregates the distances) shown inthe equation below.kf = ∑ ∑ d sr (1)l=1 ∀o s ,o r ∈C lThe function presented in the equation 1 consi<strong>de</strong>rs for each clusterC l ,l = 1,...,k the sum of all the objects that are part of the group.Therefore, minimizing f consists in allocating all the objects tothe clusters in such a way that the total sum of the distances (dissimilarities)between two objects from each one of the clusters isminimum.Regardless the objective function consi<strong>de</strong>red or other distance functions,this is not a simple task because of the combinatorial natureof this type of problem (see also [10, 11]). If a process of exhaustivesearch is used to obtain an optimal solution, all solutionsshall be enumerated, that is, all the possibilities of combination ofthe objects n in groups k. In general, the m number of possibilitiesgrows exponentially as a function of n [6]. Such characteristicmakes it impracticable to obtain the exact resolution of averageand large instances of these problems. A previous study on metaheuristicsapplied to cluster problems [12, 13, 14, 15] suggests thatit is a good alternative for the resolution of several clustering problems.In general, with the application of metaheuristics, feasiblesolutions of higher quality than those from heuristics (local minimums)are obtained.Consi<strong>de</strong>ring the last observation, and with the purpose of constructingthe classes used in the imputation of <strong>da</strong>ta, a cluster algorithmthat uses GRASP meta-heuristics was <strong>de</strong>veloped [9] andwhose objective function is the equation (1). The GRASP is an iterativegreedy heuristic to solve combinatorial optimization problems.Each iteration of the GRASP algorithm contains two steps:construction and local search. In the construction, a feasible solutionis built using a randomized greedy algorithm, while in the nextstep a local search heuristic is applied based on the constructed solution.3.1. Grasp AlgorihtmConstruction Procedure: Consi<strong>de</strong>ring a D set formed by objectsn (records of a <strong>da</strong>tabase) and a fixed number of clusters k, k objectsof D are selected, with each object allocated to a clusterC l ,l = 1,..,k. Then, in each construction iteration, each one ofthe (n − k) objects is allocated consi<strong>de</strong>ring their proximity to theobjects o j that are already part of each group C l . That is, in eachiteration, there is a list of candi<strong>da</strong>tes LC composed of objects o i notyet allocated to a cluster and two vectors q and g . Each positionq contains the number of the cluster where the closest object o j islocated (using the 1 equation of each object o i ). The vector g correspondsto the distance of the object o j in the <strong>da</strong>tabase located atthe shortest distance from each object o i . Based on the referred information,a LCR restricted candi<strong>da</strong>te list is constructed, which isformed by the o i objects, so that g i ≤ g min +α(g max −g min ). Beingg max and g min , respectively the maximum and minimum distancesfound in g. Then, an object LCR (element) is randomly selecte<strong>da</strong>nd allocated to one of the clusters consi<strong>de</strong>ring the informationstored in q. Every time a new object is inserted in one of the clusters,the candi<strong>da</strong>te list is up<strong>da</strong>ted. And when LC = /0 all the objectsshall be allocated to one of the clusters k.Local Search Procedure: At this step, the reallocation of objectsbetween the clusters k is sought, in or<strong>de</strong>r to reduce the value ofthe equation (1), and consequently, produce more homogeneousclusters (classes) for performing the imputation. Consi<strong>de</strong>ring thesolution obtained in the construction step, in each iteration of thisprocedure, two clusters C r and C l are selected from the clusters k<strong>de</strong>fined in the construction step. Afterwards, various (random) selectionsof an object o i ∈ C r and an object o j ∈ C l are performed,and in each selection the distances d i ,d il ,d j ,d jr are calculated.The values for d i and d j correspond respectively to the sum of thedistances from object o i to the other objects C r and the sum of thedistances from object o j to the other objects C l . And d il representsthe sum of the distances from object o i to the other objects C l . Anequal <strong>de</strong>finition is applied to d jr , though consi<strong>de</strong>ring the sum ofthe distances between the object o j and the objects C r . After thecalculation of the distances d i ,d il ,d j ,d jr , three types of reallocationsare assessed:(1) The object o i is allocated to cluster C l and the object o j isallocated to cluster C r and d = −d i + d il − d j + d jr is calculated.(2) The object o i is allocated to cluster C l and d = −d i + d il iscalculated(3) The object o j is allocated to cluster C r and d = −d j + d jr iscalculated.The reallocation that produces the greatest reduction (lowest valueALIO-EURO <strong>2011</strong> – 105


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>of d) in the objective function given by (1) shall be applied in thecurrent solution. Such reallocations are performed until the improvementsw (reductions) in the value of the objective functionare obtained, or until the number of replacement attempts is equalto a value of n Cr ∗ n Cl . Being n Cr and n Cl , respectively the numberof objects in clusters C r and C l . When at least one of the conditionsis satisfied, we get back to the main loop and select two newclusters. At the end of the local search, the new candi<strong>da</strong>te solutiongenerated is checked and compared to the best results obtained sofar, consi<strong>de</strong>ring previous GRASP iterations.3.2. Imputation AlgorithmThe imputation algorithm consi<strong>de</strong>rs, as input, a <strong>da</strong>tabase with nrecords, with complete information for the (p−1) explicative variables,X 1 ,X 2 ,...X p−1 . Besi<strong>de</strong>s, the missing information for thestudy variable X p in a given number n∗ < n of records, or else,a percentage of nonresponse. Then, the two basic steps of the algorithmare <strong>de</strong>scribed:• The algorithm GRASP is applied in the <strong>de</strong>termination of theimputation classes consi<strong>de</strong>ring the number of clusters equal to k.The objective function presented in the equation 1 and used in theGRASP consi<strong>de</strong>rs, for cluster purposes, the distances between theexplicative variables (p − 1).• Once the classes are constructed, the procedure of mean imputationis applied in each one of the incomplete records n∗ in relationto X p . This implies <strong>de</strong>termining to each class C l (l = 1,...,k) eachincomplete record i is allocated and assign a value ¯X l that correspondsto the mean (in class l) complete records in relation tovariable X p .x• Thus, ¯X l = ∑ipi∈Cl n l ∗ , being n l∗ the number of complete recordsin cluster C l and x ip the value of the variable X p in the nth completerecord that is part of the cluster C l .Figure 1: Phases of the Imputation Algorithm4. RESULTSThe present section contains a few computational results obtainedwith the application of the imputation algorithm, implemented inDelphi language (version 6.0) and run on Windows 7. All the computationalexperiments are performed in a 16 GB RAM I7 PC witha 2.93 GHz I7 processor. Prior to the presentation of the results, asmall <strong>de</strong>scription of the <strong>da</strong>ta used in the study is ma<strong>de</strong>, as well as ofthe nonresponse mechanism [1, 5, 16] consi<strong>de</strong>red for the <strong>da</strong>tabaseused in the experiments.4.1. DataIn or<strong>de</strong>r to perform the experiments, a real <strong>da</strong>tabase, more specifically,a file of the Sample of the 2000 Brazilian Demographic Census(state of Rio Gran<strong>de</strong> do Sul) was used. Based on this file, nineweighted areas (WAs) were drawn for the simulations with the imputationalgorithm. A weighted area is a small geographical areaformed by a mutually exclusive enumeration areas (cluster of censussegments), which comprise, each one of them, a set of recordsof households and people [17]. We <strong>de</strong>ci<strong>de</strong>d to work with the fileof people, where each record is related to the individual characteristicsof each inhabitant. And of the variables available in theserecords, six variables X 1 ,...,X 6 were selected to be consi<strong>de</strong>red inthe imputation, as follows: sex, relationship with the responsibleperson, age in years, highest completed level of education, schoolingyears and the gross earnings from the main occupation. Thefive first variables (all categorical) are explicative and correlated tothe earnings in reais (quantitative), which was the study variableconsi<strong>de</strong>red.4.2. Mechanisms that Lead to Missing Data and the Generationof Incomplete RecordsAs in any other study aimed to assess whether the method of imputationproduces good estimates for the imputed variable [2], thenonresponse mechanism must be consi<strong>de</strong>red. That is, since informationon a given study variable is missing, these values shall beimputed on a subset of records. In particular, concerning the earnings,it is known that the loss of information is greater for classeswith higher income, which characterizes a mechanism of nonresponsecalled Not Missing at Random (NMAR). This means thatthe probability of non-information of each input in the nth columnof X shall <strong>de</strong>pend on the values observed for the variable X p in matrixX (see section two). Such mechanism was used to perform thesimulations consi<strong>de</strong>ring a <strong>da</strong>tabase where all the records containthe information for the study variable (original records). With theapplication of the nonresponse mechanism, subsets of incompleterecords in relation to the gross earnings from the set can be generated,and consequently apply imputation to these records. Thenumber of incomplete records generated in the simulation <strong>de</strong>pendson the rate of nonresponse consi<strong>de</strong>red.One possible procedure for the generation of incomplete recordsconsists in assigning a previous value pr (0 ≤ pr ≤ 1) that correspondsto the probability of nonresponse (missing information) tothe study variable in each original record. In the present study, inparticular, such probability was obtained consi<strong>de</strong>ring the variablesrelationship with the responsible person (11 categories), highestcompleted level of education, (10 categories) and schooling years(four categories). According to the category informed for each oneof these variables, a probability pr of 0.1, 0.2 or 0.3 of the earningvalue (X 6 ) not being informed was attributed to each record. Themore the category is related to high earnings, the greater the probabilityis [16]. Once this probability is <strong>de</strong>fined, a value between 0and 1 is drawn for each record, and this value is compared to theprobability of nonresponse (pr) of the record. If the probabilityof the record is lower than the value drawn, such record shall havethe gross earning value informed at the incomplete <strong>da</strong>tabase, and,otherwise, it shall be consi<strong>de</strong>red a missing <strong>da</strong>ta on this <strong>da</strong>tabase.With the use of this procedure, r replicas can be generated fromthe complete <strong>da</strong>tabase, which correspond to the <strong>da</strong>tabase with differentincomplete records.4.3. Computational ExperimentsInitially, for the applying and vali<strong>da</strong>ting of the imputation algorithmto the records associated to the nine files of people (WAs)ALIO-EURO <strong>2011</strong> – 106


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>(see section 4.1), a rate of nonresponse of 10% was <strong>de</strong>fined andr = 100 replicas of the original <strong>da</strong>tabases were generated withdifferent subsets of incomplete records for each r replica. Applyingmean imputation to the incomplete records, we obtain foreach replica the complete records and the imputed records. Consi<strong>de</strong>ringsuch information, the values X¯rm e X¯rc were calculated,which correspond to the means associated to X p consi<strong>de</strong>ring: allthe records of each replica (complete and imputed) and only thecomplete records. It is also said that the same classes of imputation(clusters) were used in all the replicas. In this particular experiment,the algorithm GRASP was applied consi<strong>de</strong>ring the values kequal to 4, 6 and 8. Still concerning the GRASP, the number ofiterations was fixed in 50, improvements equal to 20 and the parameterα equal to 0.5.Table (1) shows the results obtained with the application of theimputation algorithm to the records of the nine instances used inthe simulations. The first column contains the number of the instanceand column two contains the records of each WA. Columnthree contains the number of constructed clusters (classes of imputation).Columns four and five contain the value of the objectivefunction (1) and the processing time (seconds) to constructthe clusters, generate the 100 replicas and apply the imputation.Columns six, seven and eight contain the values of X¯p , X¯m e X¯c thatcorrespond, respectively, to the mean of the incomes of all records(original <strong>da</strong>tabase) and the mean of the means of X¯rm and X¯rcconsi<strong>de</strong>ring the 100 replicas, that is: X¯m = ∑100 ¯ rr=1 X m ¯100 X c = ∑100 ¯ rr=1 X c100 .Finally, column nine contains the value of ρ that corresponds to therelative mean <strong>de</strong>viation between X¯p and X¯rm : ρ = ∑100 | X¯p −X¯r m |r=1 X¯r .mWA n k Time F OBJ X¯p X¯c X¯m ρ4 18 2369.3 559.1 561.5 3.51 178 6 6 1262.9 561.5 556.1 561.3 3.08 3 783.5 555.2 559.5 3.64 34 3875.4 509.3 512.2 1.62 222 6 11 2095.9 513.3 509.8 513.7 1.68 5 1359.7 508.2 512.5 1.64 77 7260.7 367.6 372.5 2.73 289 6 24 4012.4 373.6 366.7 371.9 3.18 11 2695.6 367.0 372.0 2.84 113 9268.9 349.5 354.1 1.74 334 6 36 4932.8 355.3 350.2 354.2 1.48 17 3349.6 350.2 354.8 1.34 215 12248.0 1162.9 1171.1 1.55 410 6 64 6808.8 1174.6 1161.5 1172.9 1.78 30 4359.1 1165.2 1176.1 1.64 332 17383.3 544.0 547.9 1.36 476 6 105 9326.4 547.3 541.3 546.4 1.58 49 6201.4 541.9 546.3 1.44 485 21402.2 438.3 439.2 1.17 539 6 153 11655.5 440.2 435.3 438.2 1.48 71 7591.6 437.4 440.5 1.34 764 28575.4 583.4 588.0 0.98 628 6 240 14730.3 590.9 584.4 589.4 0.98 113 9858.2 582.8 588.5 0.94 1121 38222.6 443.4 445.8 0.89 710 6 349 20743.3 446.7 442.8 445.8 0.98 160 13498.0 442.8 445.8 0.9Table 1: Results for the Imputation AlgorithmThe analysis of the results of columns 6, 7 and 8 of table (1) showsthat the application of the imputation algorithm has ma<strong>de</strong> it possibleto obtain good estimates for the mean, consi<strong>de</strong>ring the 100replicas. In particular, the values between 0.8% and 3.6% in columnnine indicate that the means in relation to the imputed recordswere reasonably close to the real mean value X¯p .Based on the results obtained, and <strong>de</strong>spite the need for a greaternumber of experiments, the combination of GRASP and clusteranalysis with an imputation method can be a good alternative tothe treatment of the problem of nonresponse and produce goodquality estimates for <strong>da</strong>tabases with incomplete records. In or<strong>de</strong>rto improve this procedure in the future, we intend to a<strong>da</strong>pt it tothe treatment of categorical variables. Also, we intend to use otherobjective functions for the construction of the clusters, as well asother metaheuristics such as ILS or Genetic Algorithms [9].5. ACKNOWLEDGEMENTSThe FAPERJ (project APQ1 E-26/111.587/2010) (http://www.faperj.br) and CNPQ (project 474051/2010-2) (http://www.cnpq.br) for the financial suport.6. REFERENCES[1] C. E. Sarn<strong>da</strong>l and S. Lundstrom, Estimation in Surveys withNonresponse. John Wiley and Sons Ltd, 2005.[2] S. L. Lohr, Sampling: Design Analysis. Brooks/Cole, CengageLearning, 2010.[3] J. G. Bethlehem and H. M. P. Kersten, “On the treatment ofnonresponse in sample surveys,” Journal of Official Statistics,vol. 1, no. 3, pp. 287–300, september 1985.[4] J. G. Bethlehem, “Reduction of nonresponse bias through regressionestimation,” Journal of Official Statistics, vol. 4, pp.251–260, <strong>de</strong>cember 1988.[5] R. J. A. Little and D. B. Rubin, Statistical Analysis with MissingData. John Wiley and Sons Ltd, 2002.[6] A. R. Johnson and D. W. Wichern, Applied Multivariate StatisticalAnalysis. Prentice Hall. Fifth Edition, 2002.[7] H. C. Romesburg, Cluster Analysis for Researchers. LuluPress, 2004.[8] T. A. Feo and M. G. C. Resen<strong>de</strong>, “Greedy randomized a<strong>da</strong>ptivesearch procedures,” Journal of Global Optimization,vol. 6, pp. 109–133, 1995.[9] F. Glover and G. Kochenberger, Handbook of Metaheuristics.Kluwer Aca<strong>de</strong>mic Publishers, 2003, pp. 219–249.[10] P. Hansen and B. Jaumard, “Cluster analysis and mathematicalprogramming,” Mathematical Programming, vol. 79, pp.191–215, 1997.[11] P. A. L. J. Hubert and J. J. Meulman, CombinatorialData Analysis: Optimization by Dynamic Programming.Phila<strong>de</strong>lphia: Society for Industrial and Applied Mathematics,2001.[12] M. C. G. Guojun and W. Jianhong, Data Clustering: Theory,Algorithms and Applications. ASA-SIAM Series onStatistics and Applied Probability, 2007.[13] M. J. Brusco and D. Steinley, “A comparison of heuristicsprocedures for minimum within-cluster sums of squares partitioning,”Psychometrika, vol. 72, pp. 583–600, 2007.[14] W. Sheng and X. Liu, “A genetic k-medoids clustering algorithm,”Journal of Heuristics, vol. 12, pp. 447–446, 2006.[15] M. C. V. Nascimento, F. M. B. Toledo, and A. C. P. L. F.Carvalho, “Investigation of a new grasp-based clustering algorithmaapplied to biological <strong>da</strong>ta,” Computers and OperationsResearch, vol. 37, pp. 1381–1388, 2010.[16] S. Albieri, “A ausência <strong>de</strong> respostas em pesquisas: Uma aplicação<strong>de</strong> métodos <strong>de</strong> imputação. dissertação impa,” 1989.[17] http:/www.censo2010.ibge.gov.br/altera_idioma.php?idioma=_EN.ALIO-EURO <strong>2011</strong> – 107


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Automatic Generation of Algorithms for the Non Guillotine Cutting ProblemJ. Alejandro Zepe<strong>da</strong> ∗ Víctor Para<strong>da</strong> ∗ Gustavo Gatica † Mauricio Sepúlve<strong>da</strong> ∗∗ Informatics Engineering Department, University of Santiago of ChileSantiago, Chile{jose.zepe<strong>da</strong>,victor.para<strong>da</strong>,mauricio.sepulve<strong>da</strong>}@usach.cl† Universi<strong>da</strong>d Andrés BelloSantiago, Chileggatica@unab.clABSTRACTThere exist several optimization problems for which an efficientsolution algorithm have not been found, they are used in <strong>de</strong>cisionmaking for a lot of production and service processes. In practice,hard problems must be solved in an operational, tactical andstrategically way insi<strong>de</strong> several organizations. Using this assumption,<strong>de</strong>veloping algorithms for finding an approximate solution or"a good solution" is encouraging.The automatic generation of optimization programs is an emergingfield of research. The construction of programs is <strong>de</strong>velopedthrough several evolving-nature hyper-heuristics or local searchmethod. We used Genetic Programming to find algorithms rewrittenas pseudo-co<strong>de</strong> and analyze them to get new knowledge.The experiment evolved individuals to solve the Non-GuillotineCutting Stock Problem, a NP-Hard Problem. We tested the populationobtained over a <strong>da</strong>ta set of instances from literature, the fittestindividual averaged 5.4% of material waste and was the object ofour analysis. We found interesting blocks of genetic co<strong>de</strong> that resembleintuitive human solutions, and we believe that crafting theterminal and functional elements to facilitate the comparison mayhelp to find interesting even human-competitive algorithms.Keywords: Genetic programming, Cutting Stock Problem, Algorithms1. INTRODUCTIONThere exist several optimization problems for which an efficientsolution algorithm have not been found [1, 2]. They are used in<strong>de</strong>cision making for a lot of production and service processes. Inpractice, hard problems must be solved in an operational, tacticaland strategically way insi<strong>de</strong> several organizations [3]. Generallythe main goal of finding the best solution is sacrificed, as either it isnot in the computational scope or the search cost is higher than thebenefits. Using this assumption, <strong>de</strong>veloping algorithms for findingan approximate solution or ä good solutionïs encouraging. Analgorithm to solve an optimization problem needs to maximize orminimize some given objective function, so the whole partial solutionset must belong to the feasible solution space.The automatic <strong>de</strong>velopment of optimization programs is a field ofintense research, having Burke as its mayor exponents [4]. Thefeasible solution is an individual, in this case a computer programthat solves a given problem, and the objective function is an evaluatorfor some characteristics to be searched, for example efficacy,simplicity, size, etc. The Genetic Programming (GP) [5, 6] can beused as a tool to generate algorithms, if some primitives are <strong>de</strong>signedto be easy to comprehend and close to some programminglanguage to establish some parallelism. GP could evolve thosestructures and find algorithms, rewritten as pseudoco<strong>de</strong> and analyzedto get new knowledge. Some related works have been publishedby [7] who solved the coloring graph, by [8] who evolved"greedy programs" to solve the Traveling Sales Problem and by[4] who have generated programs to solve the packing problem[4, 9, 10]. This research presents one algorithm generated throughGP to solve a NP-Hard Problem, the Non-Guillotine Cutting StockProblem (NGCSP) [11].2. GENERATING ALGORITHMSThe generating process of algorithms through GP is presented ina preliminary sequence of general steps <strong>de</strong>pict by [12]: The firststep is a clear <strong>de</strong>finition of the problem domain, but without anystatement about how to solve it; NGCSP was mo<strong>de</strong>led as a setof <strong>da</strong>ta structures and procedures to simulate the process of nonguillotinecutting, i.e., the sheet, the pieces, the geometric constraints,the dynamic process (to obtain a layout pattern throughsome <strong>de</strong>grees of freedom to use the entities and behaviors), and anevaluator to assess the result. In this research, we <strong>de</strong>fine a set ofterminals and functions which fulfill the Closure and Sufficiencyproperties, using the entities and their behaviors yet mentioned;Then the objective function quantify the fitness of the individualusing the mo<strong>de</strong>l’s evaluator. We selected the execution parametersof GP after being i<strong>de</strong>ntified through local search for different probabilitiesof mutation and crossover to find the ones best suited forthe evolutionary process. Finally, the evolutionary process is runand eventually the fittest individual would be found. This iterativeprocess may require the re<strong>de</strong>finition of some step, until achievingthe generation of algorithms with the performance nee<strong>de</strong>d.The NGCSP consi<strong>de</strong>rs a rectangular sheet of area A with (W,L) asdimensions, being W the width and L the length. Let R a set ofrectangular pieces of lesser dimensions (w i ,l i ), i = 1,2,...,n, an<strong>da</strong>rea a i [13]. A layout is a set of pieces cut from the sheet, minimizingthe waste of material and fulfilling some rules of geometricfeasibility. The mathematical formulation is:Min Z(x) = W · L −∑w i · l i · x i where x i 0,∀i ∈ N (1)iThere were <strong>de</strong>fined 20 operations, among terminals and functions,and a fitness function that evaluate the performance of each individual.In this case, the fitness is the used area ratio for a fitnesscase or problem instance (1), being T p the total pieces cut fromcontainer sheet, see equation (2):ALIO-EURO <strong>2011</strong> – 108


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>f =T p∑ a ii=1Furthermore, let h be the height of the tree in which it is mappe<strong>da</strong>n algorithm automatically generated using GP. Here h is set to 14and let δ be the total of no<strong>de</strong>s of a full (strictly) binary tree. Let Nbe the total of no<strong>de</strong>s of each individual generated and ParsimonyP be the ratio between N and δ. So as to simplify the analysis, it is<strong>de</strong>fined the Correctness C as the total of semantic errors shown inan individual divi<strong>de</strong>d by N. Let raw fitness RF, a fitness measuretaken directly from the domain problem, here being un<strong>de</strong>rstood asa measure of error e equals to the sum of ratios of wasted area consi<strong>de</strong>ringthe N e fitness cases or examples from the domain problem,as shown in (3).RF =AN e∑j=1(2)1 − f (3)The stan<strong>da</strong>rd fitness is calculated using the additional selectivepressures C and P, being P a penalty over P i , being i = RF, P,C all summing 1, then SF = RF ∗ p RF + P ∗ p P +C ∗ p C . To setupthe parameters used, a local search tool, ParamILS [14], resultingin the crossover probability of 90%, a mutation probability Swapand Shrink of 1%, respectively. The kernel used is GPC++ <strong>de</strong>velopedby [15], a personal computer with an Intel Core I7-9402.93Ghz processor and 8 GB RAM.Evolution provi<strong>de</strong>d a population of 1500 individuals trained tosolve the problem, evolved over a group of 44 instances [16, 17].Later the same population was tested over a <strong>da</strong>ta set of 8 instancespublished by Hopper, and selected the individual that <strong>de</strong>picted thebest pattern layouts, the smaller waste of material (see Figure 1).Its bloating zones of useless co<strong>de</strong> were cleaned, and this strippedgenetic co<strong>de</strong> was synthesized as pseudo-co<strong>de</strong>, analyzed and <strong>de</strong>scribed.The convergence of the experiment was similar to thatof a Genetic Algorithm [18], being very fast in the first generations.Annex 1 shows the best algorithms, whose average loss rateis 5.4%, also inclu<strong>de</strong>s control parameters, pseudo-co<strong>de</strong>, associate<strong>da</strong>lgorithmic complexity and layouts obtained.3. CONCLUSIONSIt was common to obtain individuals with high polynomial algorithmiccomplexity O(n 4 ), with nested looping co<strong>de</strong> apparentlyunnecessary or redun<strong>da</strong>nt and useless co<strong>de</strong> inflation, resulting ina slower execution. In analyzing the algorithms, there are geneticconstructs with intuitive procedures, where a cycle of placement ofpieces, it is reviewed if it is possible that minor available piece atthe time be used a wasted area as a result of impossibility placingthere the current minor piece available. The discovered algorithmhas a genetic fragment called "greedy" that have been appearedfrequently in the fittest individuals, with some variations in shapebut easily recognizable in the structural. Within the conditionalloop checking the existence of parts, it is inclu<strong>de</strong>d the placementof the piece achieving best fit. Thus, in each step, a <strong>de</strong>cision istaken to put the item that best fit the current situation and the restremains to be consi<strong>de</strong>red a sub-problem. The algorithm optimizesthe problem evolved since for all the test instances used a <strong>de</strong>terministicprocedure to find a solution of a certain quality (greaterthan 90%). An interesting modification to improve the current resultswould be to add to the set of primitive selectors some terminalsfor basic allocation strategies. Moreover, given the frequentpresence of similar co<strong>de</strong> fragments, the use of ADF would benefitoverall performance [5]. Based on the foregoing, we conclu<strong>de</strong>that PG is capable of evolving two-phase algorithm, a constructiveand a Local Search. The evolution found a way to solve the problem,and it is perfectly possible to enhance the results in the wayto generate new, better and human-competitive solutions [6, 19].4. ANNEX A: ALGORITHM SPECIFICATIONSNumber of Generation: 1362 Size of Population: 1500Pc, Pm, Pu: 0.95, 0.04, 0.0 Random Seed: 12470Used ADF: No Aptitu<strong>de</strong>: 1.65411Algorithm 1: ADD PIECERequire: A piece p.1: l = l + p2: lA = lA + p3: lL = lL + p4: lW = lW + pTable 1: Algorithm specificationsAlgorithm 2: REMOVE PIECERequire: A piece p.1: l = l − p2: lA = lA − p3: lL = lL − p4: lW = lW − pAlgorithm 3: PUT PIECERequire: A piece p A space e.Ensure: Boolean n.1: if PUT PIECE(p,e) then2: REMOVE PIECE(p)3: e < −− availbleSpaceBottomLeft()4: return True5: else6: return False7: end ifAlgorithm 4: PUT PIECERequire: A piece p.Ensure: Boolean n.1: e < −− availbleSpaceBottomLeft()2: if PUT PIECE(p,e) then3: REMOVE PIECE(p)4: e < −− availbleSpaceBottomLeft()5: return True6: else7: return False8: end if5. REFERENCES[1] D. J. M. Garey, Computers and intractability. A gui<strong>de</strong> to thetheory of NP-completeness. W.H. Freeman and Company,San Francisco, Calif, 1979.ALIO-EURO <strong>2011</strong> – 109


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Algorithm 5: GREEDYEnsure: Boolean b.1: loop = f alse2: ad < −− availableArea()3: while l.notEmpty() && noChange


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Algorithm 9: MAIN1: l < − − listo f re f erencestoavailablepieces.2: lA < − − listo f re f erencestoavailablepiecessortedbyarea.3: lL < − − listo f re f erencestoavailablepiecessortedbylength.4: lW < − − listo f re f erencestoavailablepiecessortedbywidth.5: e < − − container6: if SUB RUTINE 1() then7: loop = f alse8: ad1 < − − availableArea()9: while SUB RUTINE 2() && l.notEmpty() &&noChange1 < maxTryoutsWithEnhance() do10: loop = true11: ad2 < −− availableArea()12: while PUT PIECE(rotate(piece(maxWidth))) &&l.notEmpty() && noChange2


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Enhancements to the best fit heuristic for the orthogonal stock-cutting problemJannes Verstichel ∗ † Patrick De Causmaecker † Greet Van<strong>de</strong>n Berghe ∗∗ CODeS, KAHO Sint LievenGebroe<strong>de</strong>rs De Smetstraat 1, 9000 Gent, Belgium{jannes.verstichel, greet.van<strong>de</strong>nberghe}@kahosl.be† CODeS, KU Leuven Campus KortrijkEtienne Sabbelaan 53, 8500 Kortrijk, Belgiumpatrick.<strong>de</strong>causmaecker@kuleuven-kortrijk.beABSTRACTWe present several enhancements to the best fit heuristic for the orthogonalstock-cutting problem. The solution quality of the heuristicis improved by applying additional placement policies and newor<strong>de</strong>rings of the items. These additions are combined with an optimaltime implementation of the heuristic to improve the heuristic’sscalability. Experiments on a large test set from the literature showsignificantly better results in shorter calculation times compared tothe original best fit heuristic.Keywords: Orthogonal stock-cutting, Best fit heuristic1. INTRODUCTIONOver the years, extensive research has been performed in the domainof cutting and packing problems. The results have been appliedin different fields of operations research, for example, thepaper and metal industries. Several bibliographic papers exist ontypologies for cutting and packing problems [1, 2]. We focus onthe two dimensional orthogonal stock cutting problem, which wasproven to be NP hard [3]. The goal is to place a number of rectangularitems on a rectangular sheet as <strong>de</strong>nsely as possible withoutitem overlap, resulting in a minimal height of the sheet nee<strong>de</strong>d forplacing all the items. A 90 <strong>de</strong>gree rotation of the items is allowe<strong>da</strong>nd each stock sheet has a fixed width and infinite length, allowingall items to be placed on a single sheet. Several approaches existfor tackling this problem. A linear and dynamic programming approachis presented in [4], while [5] uses artificial neural networksto solve the problem. One of the best known heuristics for thisproblem is the bottom left (fill) heuristic and its variants [6, 7, 8]. Abest fit heuristic, which outperforms the bottom left based heuristicson all benchmarks with more than 50 items and most smallerinstances, is presented by Burke et al. [9]. The scalability ofthis heuristic has been strongly improved by Imahory and Yagiura[10]. They reduce the time complexity of the best fit heuristicto O(nlogn) and show that the heuristic performs very well forvery large <strong>da</strong>ta instances. Several metaheuristic approaches to theorthogonal stock cutting problem exist. These are mostly hybridisationsthat generate different input sequences for existing heuristicapproaches in or<strong>de</strong>r to improve their results [8, 11, 12]. Other approachesuse genetic algorithms [8, 11, 13, 14]. An interestingcomparison of different (meta) heuristic approaches and geneticalgorithms can be found in [12]. In [15] a metaheuristic combiningthe best fit heuristic and a simulated annealing bottom left fillhybridisation further improves on the results of [9].In this abstract, we present several enhancements to the originalbest fit heuristic. In Section 2, we introduce this a<strong>da</strong>pted best fitheuristic. Next, we improve the time complexity of the heuristicby using the <strong>da</strong>ta structures from [10] in Section 3. In Section4 the results of the heuristic, both with respect to solution qualityand computation time, are discussed. Finally, in Section 5 we drawconclusions from our research.2. THE THREE-WAY BEST FIT HEURISTICThe original best fit heuristic consists of a preprocessing step, asolution construction and a postprocessing step [9]. In the preprocessingstep, all rectangles are rotated in such a way that theirwidth turns out to be their largest dimension. Next, the rectanglesare or<strong>de</strong>red by <strong>de</strong>creasing width. When this step is finished, thesolution construction begins. In this step the lowest gap, i.e. thelowest sequence of x coordinates with an i<strong>de</strong>ntical height, is locatedusing the sheet skyline. Next the rectangle that fits the widthof this gap best, possibly after rotation, is placed in the gap usinga pre<strong>de</strong>fined placement policy, after which the sheet skyline isup<strong>da</strong>ted. If no rectangle can be found to fit the current gap, theskyline at the gap is raised so that it levels with the lowest of therectangles neighbouring the gap. This process continues until allrectangles are placed on the sheet. After the construction phase,the postprocessing part of the heuristic tries to further improve thesolution quality. This is done by checking if the topmost rectangleis placed in portrait, i.e. it has been rotated. If this is the case, thepostprocessing step tries to improve the solution by rotating therectangle by 90 <strong>de</strong>grees and placing it on the sheet at the lowestpossible level. If this leads to an improvement, the process is repeatedfor the new topmost rectangle. When this procedure doesnot lead to an improvement, or when the topmost rectangle is alreadyoriented in landscape, the postprocessing step terminates.The proposed three-way best fit heuristic adds some additionalsteps to both the preprocessing and the solution construction step.In the preprocessing step, the original best fit heuristic uses a <strong>de</strong>creasingwidth or<strong>de</strong>ring of all rectangles. Therefore, the rectanglesare always selected for placement in a width <strong>de</strong>creasing or<strong>de</strong>r. Wesuggest to add two more or<strong>de</strong>rings to the solution process: <strong>de</strong>creasingheight or<strong>de</strong>r and <strong>de</strong>creasing surface or<strong>de</strong>r. Applying each oneof these or<strong>de</strong>rings ensures a significant disruption of the rectanglesequence compared to the width or<strong>de</strong>ring. The rectangles are alwaysrotated in such a way that their width turns out to be theirlargest dimension before applying any of the three or<strong>de</strong>rings. Thesolution construction will be executed for each or<strong>de</strong>ring individually.With respect to the solution construction step, the original best fitheuristic uses three placement policies: leftmost, tallest and shortestneighbour. Depending on the length of the rectangle that isplaced and the length of the gap <strong>de</strong>fining neighbours, a placementpolicy will <strong>de</strong>ci<strong>de</strong> wether to place the rectangle at the left or theALIO-EURO <strong>2011</strong> – 112


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>right si<strong>de</strong> of the gap. We suggest the addition of three more placementpolicies: rightmost, minimal difference and maximal differenceneighbour. These policies will place the new rectangle respectivelyat the right si<strong>de</strong> of the gap, next to the neighbour withending height closest to the new rectangle and next to the neighbourwith ending height furthest from the new rectangle. An exampleof the minimal and maximal difference placement policiesis shown in Figure 1.MaxDiff policy(a) (b) (c)MinDiff policy(a) (b) (c)Figure 1: Example of the maximal difference policy (top) and minimaldifference policy (bottom).By using both the old and new placement policies and combiningthem with the <strong>de</strong>creasing width, height and surface or<strong>de</strong>rs, we createa very performant extension to the best fit heuristic. We cancall this new heuristic a three-way best fit heuristic as the rectanglesare or<strong>de</strong>red in three different ways during the search fora good solution. In fact, this heuristic solves the problem oncefor each or<strong>de</strong>ring and placement policy combination. Due to itssimple nature and efficient implementation with respect to, for example,overlap checks, the computation times are kept short. Anadvantage of the heuristic is that or<strong>de</strong>rings and placement strategiescan easily be ad<strong>de</strong>d or removed if wanted. For example, whenall shapes un<strong>de</strong>r consi<strong>de</strong>ration are square, it does not make senseto use more than one of the proposed or<strong>de</strong>rs, as they will all resultin the same initial sequence.In some cases, rectangles may have one dimension, we can say therectangle’s width without loss of generality, larger than the sheetwidth. The best fit heuristic will not prioirtise the placement ofthese rectangles, as they can only be placed after rotation. Thelarger the width/length ratio of these rectangles, the higher theirprobability of being among the last rectangles that are placed. Thisbehaviour strongly <strong>de</strong>creases the worst case performance of thebest fit heuristic. Therefore, we propose the addition of one morerule to the three-way best fit heuristic. It rotates all rectangles witha dimension larger than the sheet width, such that their height isthe largest dimension. We apply this rotation after the or<strong>de</strong>ring,such that the rectangle sequence is not changed when compared tothe heuristic without this rotation.3. AN OPTIMAL TIME THREE-WAY HEURISTICImahori and Yagiura [10] analyse the time and space complexityof the original best fit heuristic. They propose alternative <strong>da</strong>tastructures to reduce the time and space complexity, and prove thattheir implementation is optimal. By reducing the time complexityfrom O(n 2 +W) to O(nlogn), they manage to solve instances with2 20 rectangles in un<strong>de</strong>r 10 seconds. In this section, we discuss theapplicability of Imahori and Yagiura’s <strong>da</strong>ta structures to the newthree-way best fit heuristic.In the original best fit heuristic, the sheet skyline is stored in a integerarray, where each element i represents the height of the skylineat width i. The optimal time best fit heuristic stores the sheet skylineusing both a heap and a doubly linked list. This allows for(d)(d)a significant improvement with respect to time complexity whencompared to using the original <strong>da</strong>ta structures [10]. We can now<strong>de</strong>termine the location and size of the lowest available gap in constanttime, while up<strong>da</strong>ting the skyline requires only O(logn) time,which is a great improvement compared to the original approach[9].The original best fit heuristic stores the rectangles in an or<strong>de</strong>redlist, iterating the list for each placement until the best fitting rectangleis found. In the optimal time best fit heuristic, the items arestored in a balanced binary tree based on their width. Both theoriginal item and its rotated copy are placed in this tree, in or<strong>de</strong>rto allow a O(logn) complexity for finding the best fitting rectanglefor the current gap. This balanced tree is however not directly compatiblewith the previously introduced three-way best fit heuristic.This is due to the mismatch between the alternative or<strong>de</strong>rings ofthe items, based on the height or the size of the rectangles, andthe rectangle selection procedure which is based on the width ofthe gap. When using this <strong>da</strong>ta structure combined with a <strong>de</strong>creasingheight or<strong>de</strong>ring, the items will be placed with their height asthe largest dimension. As this portrait placement is not <strong>de</strong>sirablewith respect to solution quality, a more advanced <strong>de</strong>creasing heightor<strong>de</strong>ring must be implemented. This or<strong>de</strong>ring will sort the itemsbased on their height, while making a distinction between normalitems, oriented in landscape, and rotated items that are oriented inportrait. When or<strong>de</strong>ring all the items and their rotated copy usingthis advanced height or<strong>de</strong>ring, the same priority list is created aswhen ‘expanding’ the original height or<strong>de</strong>red list (i.e. adding therotated copies at the correct place in the list). A disadvantage ofthis or<strong>de</strong>ring is its inconsistency with respect to the width of theitems. Therefore it is not possible to use this advanced or<strong>de</strong>ring toobtain the best fitting rectangle in O(logn) time. Instead, the <strong>da</strong>tastructure will return a ‘good’ fitting rectangle, without the guaranteethat no better fitting rectangle is available.The main reason for using the alternative or<strong>de</strong>rings however, isthe strong disruption of the priority sequence generated comparedto using the <strong>de</strong>creasing width or<strong>de</strong>ring. While the optimal time<strong>da</strong>ta structures cause a slightly different disruption compared tousing the original <strong>da</strong>ta structure, their overall solution quality iscomparable. Furthermore, the difference in computation time forlarge problem instances will be huge, as we change from O(n 2 )to O(nlogn) time complexity. Therefore we propose the usage ofthese <strong>da</strong>tastructures in a new optimal time three-way heuristic (noticethe absence of the ‘best fit’ part). With respect to the three-waybest fit heuristic, we will use the O(logn) sheet skyline <strong>da</strong>ta structureto improve its performance, while maintaining the originalrectangle selection procedure.4. COMPUTATIONAL RESULTSWe discuss the performance of the best fit heuristic and its optimaltime variant on a set of benchmark problems from the literature(Table 1). Due to the very large computation times nee<strong>de</strong>d to solvethe i19 and i20 instances from Imahori and Yaguira (2010) for theoriginal and three way best fit heuristic, these instances were onlyused for comparing scalability. All the other experiments ignoredthese instances.Data source #Problems #RectanglesHopper (2000) 70 17 to 199Hopper and Turton (2001) 21 16 to 197Burke et al. (2004) 13 10 to 3152Beasley (1985) 12 10 to 50Imahori and Yagiura (2010) 170 2 4 to 2 20Table 1: Benchmarks from the literature.By combining the different or<strong>de</strong>ring strategies and placement poli-ALIO-EURO <strong>2011</strong> – 113


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>cies into a three-way best fit, we can improve the solution quality.Using the three-way best fit heuristic produces significantly betterresults compared to the original best fit heuristic. Statistical analysisusing a T-test showed a certainty of more than 99.9999% thatthe three-way best fit outperforms the original best fit heuristic.When looking at the optimal time variant, we find the results arenot significantly different from those of the stan<strong>da</strong>rd three way bestfit heuristic (p−value = 0.158). Especially for the larger probleminstances, we can see that both heuristics produce very similar results.This is confirmed by a statistical analysis which shows onlya 70.79% confi<strong>de</strong>nce interval that the heuristics perform significantlydifferent on the instances from Imahori and Yagiura [10].When consi<strong>de</strong>ring the largest problem sizes only, i14 to i18, thisconfi<strong>de</strong>nce interval becomes even smaller (p − value = 0.933).The test set from Imahori and Yagiura [10] contains instances withup to 2 20 rectangles, and allows for an easy comparison of the scalabilityof the different heuristics. Figure 2 shows the computationtimes for the original best fit heuristic, three-way best fit heuristicand optimal time three-way heuristic on this test set. The threewaybest fit heuristic clearly benefits from using the optimized gaplocation process, as the computation times are lower than those ofthe original implementation for all but the largest instances. Notethat the three-way best fit heuristic solves each problem 18 times,which is 6 times more than the original best fit heuristic. We canalso see that using the optimal time implementation [10] makes theheuristic significantly faster for all but the smallest test instances.For instances with 2 18 items, the optimal time three-way heuristicrequires only 1.60% of the time nee<strong>de</strong>d by the original best fitheuristic to solve the same problem, while obtaining a better result.For these instances, the computation time nee<strong>de</strong>d by the optimaltime three way heuristic is only 0.46% of the time nee<strong>de</strong>d by thethree-way best fit heuristic. Furthermore, the optimal time heuristicperforms slightly better than the three-way best fit heuristic onthese instances.Average calculation time (s)1000001000010001001010.10.010.001i4 i5 i6 i7 i8 i9 i10 i11 i12 i13 i14 i15 i16 i17 i18 i19 i20InstanceBestFit3Way3WayOptimalFigure 2: Average computation times of the original best fit, threewaybest fit and optimal time three-way heuristic, for the Imahoriand Yagiura instances.5. CONCLUSIONSIn this abstract we presented several enhancements to the best fitheuristic from Burke et al. [9]. We introduced new placement policiesand additional or<strong>de</strong>rings of the items in or<strong>de</strong>r to obtain bettersolutions for rectangular stock-cutting problem. These enhancementsallow for a significantly better performance compared to theoriginal best fit heuristic, on a large test set from the literature. Asthe addition of the new placement policies and or<strong>de</strong>rings increasedthe computation time of the heuristic, a more efficient implementationof the heuristic was used. The three-way best fit heuristic usesa more efficient way of storing and locating the gaps [10] to reduceits computational complexity. Due to this improvement, thisheuristic has smaller computation times than the original best fitheuristic for all but the largest problem instances. Next, we furtherimproved the scalability of the heuristic, by also applying the rectangleselection procedure from [10]. This resulted in an optimaltime three-way heuristic, with a slightly altered rectangle selectionthat no longer guarantees the selection of the best fitting rectanglefor a given gap. Due to this changed rectangle selection procedure,the heuristic obtains slightly, but not significantly, differentresults than the three way best fit heuristic. The optimal time threewayheuristic is, however, much faster than the three-way best fitheuristic on all but the smallest instances. For instances with 2 18items, the optimal time three-way heuristic requires only 0.46% ofthe time required by the three-way best fit heuristic. Therefore, wepropose the usage of the optimal time three-way heuristic whensmall computation times are important. When the quality of thesolutions is more important than the computation times, combinedusage of both three-way heuristics is advised when no more than2 16 items need to be placed. When more than 2 16 items need tobe placed, the optimal time three-way heuristic is recommen<strong>de</strong>d asit performs best both with respect to average solution quality andcomputation time.6. ACKNOWLEDGEMENTSResearch fun<strong>de</strong>d by a Ph.D. grant of the Agency for Innovation byScience and Technology (IWT)7. REFERENCES[1] H. Dyckhoff, “A typology of cutting and packing problems,”European Journal of Operational Research, vol. 44, no. 2,pp. 145–159, January 1990.[2] G. Wascher, H. Hausner, and H. Schumann, “An improvedtypology of cutting and packing problems,” European Journalof Operational Research, vol. 183, no. 3, pp. 1109–1130,December 2007.[3] M. R. Garey and D. S. Johnson, Computers and Intractability:A Gui<strong>de</strong> to the Theory of NP-Completeness (Series ofBooks in the Mathematical Sciences). W. H. Freeman & CoLtd, January 1979.[4] P. C. Gilmore and R. E. Gomory, “A linear programmingapproach to the cutting-stock problem,” OPERATIONS RE-SEARCH, vol. 9, no. 6, pp. 849–859, November 1961.[5] C. H. Dagli and P. Poshyanon<strong>da</strong>, “New approaches to nestingrectangular patterns,” Journal of Intelligent Manufacturing,vol. 8, no. 3, pp. 177–190, May 1997.[6] B. S. Baker, E. G. Coffman_jr, and R. L. Rivest, “Orthogonalpackings in two dimensions,” SIAM Journal on Computing,vol. 9, no. 4, pp. 846–855, 1980.[7] Chazelle, “The bottomn-left bin-packing heuristic: An efficientimplementation,” IEEE Transactions on Computers,vol. C-32, no. 8, pp. 697–707, August 1983.[8] S. Jakobs, “On genetic algorithms for the packing of polygons,”European Journal of Operational Research, vol. 88,no. 1, pp. 165–181, January 1996.[9] E. K. Burke, G. Ken<strong>da</strong>ll, and G. Whitwell, “A new placementheuristic for the orthogonal stock-cutting problem,” OperationsResearch, vol. 52, pp. 655 – 671, 2004.[10] S. Imahori and M. Yagiura, “The best-fit heuristic for therectangular strip packing problem: An efficient implementationand the worst-case approximation ratio,” Computers &Operations Research, vol. 37, no. 2, pp. 325–333, February2010.[11] A. R. Babu and N. R. Babu, “Effective nesting of rectangularparts in multiple rectangular sheets using genetic andheuristic algorithms.” International Journal of ProductionResearch, vol. 37, no. 7, p. 1625, 1999.ALIO-EURO <strong>2011</strong> – 114


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[12] E. Hopper and B. Turton, “An empirical investigation ofmeta-heuristic and heuristic algorithms for a 2d packingproblem,” European Journal of Operational Research, vol.128, no. 1, pp. 34–57, January 2001.[13] ——, “A genetic algorithm for a 2d industrial packing problem,”Comput. Ind. Eng., vol. 37, no. 1-2, pp. 375–378, 1999.[14] B. Kroger, “Guillotineable bin packing: A genetic approach,”European Journal of Operational Research, vol. 84,no. 3, pp. 645–661, August 1995.[15] E. K. Burke, G. Ken<strong>da</strong>ll, and G. Whitwell, “A simulated annealingenhancement of the best-fit heuristic for the orthogonalstock-cutting problem,” INFORMS Journal on Computing,vol. 21, no. 3, pp. 505–516, February 2009.ALIO-EURO <strong>2011</strong> – 115


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Bi-dimensional Bin-Packing Problem: A Multiobjective ApproachA. Fernán<strong>de</strong>z ∗ C. Gil ∗ R. Baños ∗ A. L. Márquez ∗ M.G. Montoya ∗M. Parra ∗∗ University of AlmeríaCarretera <strong>de</strong> Sacramento s/n, Caña<strong>da</strong> <strong>de</strong> San Urbano, 04120 Almería, Spain{af<strong>de</strong>zmolina, cgilm, rbanos, almarquez, dgil, mariaparra}@ual.esABSTRACTThe bin-packing problem (BPP) and its multi-dimensional variants,have a large number of practical applications, including productionplanning, project selection, multiprocessor scheduling, packingobjects in boxes, etc. The two-dimensional bin packing(2D-BPP) consists of packing a collection of objects (pieces) inthe minimum number of bins (containers). This paper works withan extending of the classical single-objective formulation to copewith other <strong>de</strong>signing objectives. It presents a new multi-objectivememetic algorithm that uses a population of individuals (agents)that are optimized using evolutionary operators (mutation and crossover)and a local-search optimizer specially <strong>de</strong>signed to solve theMO-2DBPP. The Pareto-optimization concept is used in the selectionprocess. Results obtained in several test problems show thegood performance of the memetic algorithm in comparison withother previously proposed approaches.Keywords: Two-dimensional bin packing problem, Memetic algorithm,Multi-objective optimization1. INTRODUCTIONThe bin-packing problem (BPP) and its multi-dimensional variants,have a large number of practical applications in industry (e.g.cutting stock), in computer systems (e.g. assignment of segmentsof track on disks), in machine scheduling (e.g. minimizing thenumber of machines necessary for completing all tasks by a given<strong>de</strong>adline), etc. [1]. The traditional two-dimensional BPP (2DBPP)[2] consists of packing a collection of objects, characterized byhaving different heights and widths, in the minimum number ofbins (containers). The family of bin packing problems is inclu<strong>de</strong>din the category of NP-hard problems [3], which implies that thereis no known method to obtain the optimal solution in a polynomialtime. Recently, some authors have proposed multi-objectiveformulations of the 2DBPP (MO-2DBPP) that consi<strong>de</strong>r other objectivesto minimize in addition to the number of bins. One ofthese multi-objective formulations with applications in containerloading, tractor trailer trucks, pallet loading, cargo airplanes, etc.consists of minimizing not only the number of bins used to storethe pieces, but also the imbalance of the objects according to thecentre of gravity of the bin. This paper presents a new multiobjective[4] memetic algorithm that uses a population of individuals(agents) that are optimized using evolutionary operators(mutation and crossover) and a local-search optimizer specially <strong>de</strong>signedto solve the MO-2DBPP. The Pareto-optimization concept[5] is used in the selection process.2. MULTI-OBJECTIVE TWO-DIMENSIONALBIN-PACKING PROBLEMMost papers <strong>de</strong>aling with the 2DBPP try to solve single-objectiveformulations, where the aim is to minimize the number of binsnee<strong>de</strong>d to pack all the objects. Recently, other authors have proposedsimultaneously optimizing other objectives. In particular,Liu et al. [6] applied particle swarm optimization to solve themulti-objective two-dimensional bin packing problem (MO-2D-BPP), by consi<strong>de</strong>ring minimizing, not only the number of bins,but also the imbalance of the bins according to a centre of gravity.This formulation is <strong>de</strong>scribed as follows: Given a set of n rectangularobjects where h i , w i , and γ i are the height, width and weightof object i, respectively (i=1,2,. . . ,n), and given an unlimited numberof bins, all of which have a height H, width W and centre ofgravity (λ x ,λ y ) the goal is to insert all the objects without overlapin the minimum number of bins (nBIN), with the centre of gravity(CG) of the bins as close as possible to the <strong>de</strong>sired CG. The <strong>de</strong>siredCG in this case is the bottom of the bin, and therefore, the aim isto minimize the average eucli<strong>de</strong>an distance d i between the CG ofthe objects stored in the bin with respect to the CG of the bin. The<strong>de</strong>finition of the centre of gravity is provi<strong>de</strong>d below:where:CG = 1 nBin√nBin∑ (λ x, j − λ d,x ) 2 + (λ y, j ) 2 (1)j=1λ x, j = ∑n i=1 X i jx i γ i∑ n i=1 γ iλ y, j = ∑n i=1 X i jy i γ i∑ n i=1 γ ih i , w i , and γ i : height, width and weight of item i;x i and y i : center of gravity of item i in positions x and y;X i j ∈ {0,1}, where i = {1,...,I}, j = {1,...J}. If item j is assignedto bin i, X i j = 1, otherwise X i j = 0;H and W: height and width of bins;(λ x, j ,λ y, j ): coordinates of the centre of gravity of bin j;λ d,x : <strong>de</strong>sired center of gravity of bin i in direction x.CG: balance of the bins according to a centre of gravity (objective2);In or<strong>de</strong>r to minimize the load balancing of an individual, the fitnessfunction used <strong>de</strong>termines the average balancing of each bin, takinginto account the sum of the Eucli<strong>de</strong>an distances from the centre ofeach object to the <strong>de</strong>sired CG of the bin, and taking into accounttheir weight. Figure 1 offers a graphical <strong>de</strong>scription of this secondobjective in a bin which contains a single object.(2)ALIO-EURO <strong>2011</strong> – 116


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Class 1 2 3 4 5 6h i , w i [0,100] [0,25] [0,50] [0,75] [25,75] [25,50]γ i[0,20] for instances 1-10 of each class;and [0,100] for instances 11-20Table 1: Test benchmarks generated for solving the MO-2DBPP.Figure 1: Graphical representation of load balancing.2.1. Description of the operators used in MA2dbppFour different mutation operators are used in or<strong>de</strong>r to insert objectsin the bins using the list of available rectangular spaces. Oneof these operators (mutation 4) takes some i<strong>de</strong>as of the strategyrecently proposed by Grunert <strong>da</strong> Fonseca and Fonseca [7] that isbased on performing a permutation between two objects of differentbins such that the variation is smaller than when a single objectis moved from one bin to another one.• Mutation1: an object is randomly taken from one bin and itis stored in another randomly chosen one only if the availablespace is large enough. If all the bins have been visite<strong>da</strong>nd the storage has not been possible, the object is not inserted.• Mutation2: an object is randomly chosen from the bin withmost available space, and it is stored in another randomlychosen bin only if there is free space. If all the bins havebeen visited and the storage has not been possible, the objectis not inserted.• Mutation3: an object is randomly chosen from the bin withmost available space, and it is stored in the empties remainingbin only if the available space is large enough. If all thebins have been tried, and the storage has not been possible,the object is inserted in a new bin in the lower left corner.• Mutation4: two objects are randomly taken from differentbins and are swaped only if there are free space in the bins.test instances with 500 pieces <strong>de</strong>scribed above. The memetic algorithmwas executed with a stop criterion of 1000 generations and apopulation size of 500 agents.To compare the different fronts, we use a coverage metric [9]. Thecoverage C(A,B) computes the relative number of points in set Bdominated by the points in set A.C(A,B) =|{b ∈ B | ∃a ∈ A : a ≺ b}||B|To show the good performance of the algorithm MOMA-2D-BPP,it was compared with a recent evolutionary multi-objective particleswarm optimization algorithm called MOEPSO [6]. Figure2 shows the Pareto fronts generated by these algorithms for a selectedset of instances. It can be observed that most of the solutionsof the non-dominated sets obtained by MOMA-2DBPP arebelow those obtained by MOEPSO, i.e. MOMA-2DBPP obtainsbetter approximations to the true (unknown) Pareto-optimal front,although MOEPSO obtains more extreme solutions in some testinstances.(3)The selection of agents is carried out by applying tournaments usingPareto-dominance relations [5]. The crossover operator worksby taking two random agents (A1, A2) as parents, and creating achild agent (CH) by consi<strong>de</strong>ring bins of both parents. In particular,CH takes the fullest bin of A1, plus the bins of A2, but discardingthe objects already taken from A1 in or<strong>de</strong>r not to duplicate objects.Finally, a new local optimizer is also consi<strong>de</strong>red with the aim ofreducing the number of bins. This task takes the most occupiedbin and tests each available space to <strong>de</strong>termine whether or not anobject from the remaining bins can fit.3. EXPERIMENTAL RESULTSA set of instances proposed by Berkey and Wang [8] have beenused to compare the algorithms. A total of six classes with 20 instanceseach are randomly generated to <strong>de</strong>termine the performanceof the multi-objectives memetic algorithms. The weight γ i of eachpiece randomly generated in different ranges, has been ad<strong>de</strong>d tothe benchmark set, as table 1 shows. For each instance, there are500 items to be packed.The performance of the multi-objective memetic algorithm (MO-MA-2DBPP) has been compared with other algorithms, using theFigure 2: Pareto front of MOMA-2DBPP and MOEPSO.Table 2 shows a comparison of both algorithms for previous instances.The coverage metric has been used to compare the Paretofronts generated by each algorithm. MOMA-2DBPP algorithmachieves better results than MOEPSO for the two instances, sincethe coverage metric of the memetic algorithm is higher thanMOEPSO in both instances which reinforces the previous conclusionsobtained from the graphics displayed above.ALIO-EURO <strong>2011</strong> – 117


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>class_3_9_19 class_3_10_20MOEPSO MOMA MOEPSO MOMAMOEPSO - 0.20 - 0.05MOMA 0.52 - 0.77 -Table 2: Comparison between MOEPSO and MOMA-2BPP interms of coverage metric.4. CONCLUSIONThis paper presents a memetic algorithm that aims to improve theperformance of other published algorithms when solving singleobjectiveand multi-objective formulations of the two-dimensionalbin-packing problem with rotations. The memetic algorithm hereimplemented uses several search operators specifically <strong>de</strong>signed tosolve this problem. The multi-objective implementation, MOMA-2DBPP is compared with a multi-objective particle swarm optimizationalgorithm, MOEPSO. Results obtained in the multiobjectiveformulation show the good behavior of MOMA-2DBPP,which obtains better results than MOEPSO in terms of coveragemetric. The results obtained by the memetic algorithm of this complexproblem reinforce the previous conclusions of other authorsabout the good performance of this meta-heuristic to solve NPhardoptimization problems. Future research should be focused onextending the memetic algorithm for the three-dimensional variantsof bin-packing [10], which also have many practical applicationsin real problems. Despite that, the load balancing in twodimensions can be applied to real world problems, where heightdoes not influence, for instance the storage of pallets.4.1. AcknowledgementsThis work has been financed by the Spanish Ministry of Innovationand Science (TIN2008-01117) and the Excellence Project of Junta<strong>de</strong> An<strong>da</strong>lucía (P07-TIC02988), in part financed by the EuropeanRegional Development Fund (ERDF).5. REFERENCES[1] H. L. Ong, M. J. Magazine, and T. S. Wee, “Probabilisticanalysis of bin packing heuristics,” OPERATIONSRESEARCH, vol. 32, no. 5, pp. 983–998, 1984. [Online].Available: http://or.journal.informs.org/cgi/content/abstract/32/5/983[2] E. Hopper and B. C. H. Turton, “An empiricalinvestigation of meta-heuristic and heuristic algorithmsfor a 2d packing problem,” European Journalof Operational Research, vol. 128, no. 1, pp.34 – 57, 2001. [Online]. Available: http://www.sciencedirect.com/science/article/B6VCT-41Y1XYH-3/2/73392e0f11c162878430f67e02d8349d[3] M. R. Garey and D. S. Johnson, Computers and Intractability:A Gui<strong>de</strong> to the Theory of NP-Completeness(Series of Books in the Mathematical Sciences), firstedition ed. W. H. Freeman & Co Ltd, January 1979.[Online]. Available: http://www.amazon.com/exec/obidos/redirect?tag=citeulike07-20&path=ASIN/0716710455[4] P. J. F. Carlos M. Fonseca, “Genetic algorithms for multiobjectiveoptimization: Formulation, discussion and generalization,”pp. 416–423, 1993.[5] D. E. Goldberg, “Genetic algorithms in search, optimizationand machine learning,” 1989.[6] D. Liu, K. Tan, C. Goh, and W. Ho, “On solving multiobjectivebin packing problems using particle swarm optimization,”in Evolutionary Computation, 2006. CEC 2006. IEEECongress on, 2006, pp. 2095 –2102.[7] C. F. V. Grunert <strong>da</strong> Fonseca, “The attainment-function approachto stochastic multiobjective optimizer assessment andcomparison,” in Experimental Methods for the Analysis ofOptimization Algorithms, T. Bartz-Beielstein, Ed. Springer,[2010 to apperar].[8] J. O. Berkey and P. Y. Wang, “Two-dimensional finite binpackingalgorithms,” The Journal of theOperational ResearchSociety, vol. 38, no. 5, pp. 423–429, May, 1987.[9] E. Zitzler, “Evolutionary Algorithms for MultiobjectiveOptimization: Methods and Applications,” Ph.D. dissertation,ETH Zurich, Switzerland, 1999. [Online]. Available:http://www.tik.ethz.ch/~sop/publications/[10] A. Lodi, S. Martello, and D. Vigo, “Heuristic algorithmsfor the three-dimensional bin packing problem,” EuropeanJournal of Operational Research, vol. 141, no. 2, pp.410–420, September 2002. [Online]. Available: http://i<strong>de</strong>as.repec.org/a/eee/ejores/v141y2002i2p410-420.htmlALIO-EURO <strong>2011</strong> – 118


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A recursive partitioning approach for generating unconstrained two-dimensionalnon-guillotine cutting patternsErnesto G. Birgin ∗ Rafael D. Lobato ∗ Reinaldo Morabito †∗ Department of Computer Science, Institute of Mathematics and Statistics, University of São PauloRua do Matão 1010, Ci<strong>da</strong><strong>de</strong> Universitária, 05508-090 São Paulo, SP, Brazil{egbirgin,lobato}@ime.usp.br† Department of Production Engineering, Fe<strong>de</strong>ral University of São CarlosVia Washington Luiz km. 235, 13565-905, São Carlos, SP, Brazilmorabito@ufscar.brABSTRACTIn this study, a dynamic programming approach to <strong>de</strong>al with theunconstrained two-dimensional non-guillotine cutting problem ispresented. The method extends the recently introduced recursivepartitioning approach for the manufacturer’s pallet loading problem.The approach involves two phases and uses bounds based onunconstrained two-staged and non-staged guillotine cutting. Themethod is able to find the optimal cutting pattern of a large numberof problem instances of mo<strong>de</strong>rate sizes known in the literatureand a counterexample for which the approach fails to find knownoptimal solutions was not found. For the instances that the requiredcomputer runtime is excessive, the approach is combinedwith simple heuristics to reduce its running time. Detailed numericalexperiments show the reliability of the method.Keywords: Cutting and packing, Two-dimensional non-guillotinecutting pattern, Dynamic programming, Recursive approach, Distributor’spallet loading problem1. INTRODUCTIONIn the present paper, we study the generation of two-dimensionalnon-guillotine cutting (or packing) patterns, also referred by someauthors as two-dimensional knapsack problem or two-dimensionaldistributor’s pallet loading. This problem is classified as 2/B/O/according to Dyckhoff’s typology of cutting and packing problems[1], and as two-dimensional rectangular Single Large ObjectPacking Problem (SLOPP) based on Waescher et al.’s typology [2].Besi<strong>de</strong>s the inherent complexity of this problem (it is NP-hard [3]),we are also motivated by its practical relevance in different industrialand logistics settings, such as in the cutting of steel and glassstock plates into required sizes, the cutting of wood sheets and textilematerials to make or<strong>de</strong>red pieces, the loading of different itemson the pallet surface or the loading of different pallets on the truckor container floor, the cutting of cardboards into boxes, the placingof advertisements on the pages of newspapers and magazines,the positioning of components on chips when <strong>de</strong>signing integratedcircuit, among others.Given a large rectangle of length L and width W (i.e. a stockplate), and a set of rectangular pieces grouped into m differenttypes of length l i , width w i and value v i , i = 1,...,m (i.e. theor<strong>de</strong>red items), the problem is to find a cutting (packing) patternwhich maximizes the sum of the values of the pieces cut (packed).The cutting pattern is referred as two-dimensional since it involvestwo relevant dimensions, the lengths and widths of the plate andpieces. A feasible two-dimensional pattern for the problem is onein which the pieces placed into the plate do not overlap each other,they have to be entirely insi<strong>de</strong> the plate, and each piece must haveone edge parallel to one edge of the plate (i.e., an orthogonal pattern).In this paper we assume that there are no imposed lower orupper bounds on the number of times that each type of piece can becut from the plate; therefore, the two-dimensional pattern is calledunconstrained.Without loss of generality, we also assume that the cuts are infinitelythin (otherwise we consi<strong>de</strong>r that the saw thickness wasad<strong>de</strong>d to L, W, l i , w i ), the orientation of the pieces is fixed (i.e.,a piece of size (l i ,w i ) is different from a piece of size (w i ,l i ) ifl i ≠ w i ) and that L, W, l i , w i are positive integers. We note that ifthe 90 ◦ -rotation is allowed for cutting or packing the piece type i ofsize (l i ,w i ), this situation can be handled by simply consi<strong>de</strong>ring afictitious piece type m+i of size (w i ,l i ) in the list of or<strong>de</strong>red items,since the pattern is unconstrained. Depending on the values v i , thepattern is called unweighted, if v i = γl i w i for i = 1,...,m and γ > 0(i.e., proportional to the area of the piece), or weighted, otherwise.Moreover, we assume that the unconstrained two-dimensional cuttingpattern is non-guillotine as it is not limited by the guillotinetype cuts imposed by some cutting machines.In the present paper we extend a Recursive Partitioning Approachpresented in [4] for the manufacturer’s pallet loading to <strong>de</strong>al withthe unconstrained two-dimensional orthogonal non-guillotine cutting(unweighted and weighted, without and with piece rotation).This Recursive Partitioning Approach combines refined versionsof both the Recursive Five-block Heuristic presented in [5, 6] andthe L-approach for cutting rectangles from larger rectangles andL-shaped pieces presented in [7, 8]). This combined approach alsouses bounds based on unconstrained two-staged and non-stagedguillotine cutting patterns. The approach was able to find an optimalsolution of a large number of problem instances of mo<strong>de</strong>ratesizes known in the literature and we were unable to find an instancefor which the approach fails to find a known or proved optimal solution.For the instances that the required computer runtimes wereexcessive, we combined the approach with simple heuristics to reduceits running time.2. DESCRIPTION OF THE ALGORITHMThe Recursive Partitioning Algorithm presented here is an extensionof the algorithm <strong>de</strong>scribed in [4] for the manufacturer’s palletloading problem. It has basically two phases: in phase 1 it appliesa recursive five-block heuristic based on the procedure presentedin [5] and in phase 2 it uses an L-approach based on a dynamic programmingrecursive formula presented in [7, 8]. Firstly, phase 1 isexecuted and, if a certificate of optimality is not provi<strong>de</strong>d by theALIO-EURO <strong>2011</strong> – 119


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Recursive Five-block Heuristic, then phase 2 is executed. Additionally,information obtained in phase 1 is used in phase 2 in atleast two ways, according to [4]. If an optimal solution was alreadyfound for a subproblem in phase 1, it is not solved again inphase 2, improving the performance of phase 2. Moreover, havingthe information obtained in phase 1 at hand, phase 2 is often ableto obtain better lower bounds for its subproblems than the onesprovi<strong>de</strong>d by homogeneous cuttings, therefore improving the performanceof phase 2. These two phases are <strong>de</strong>tailed in the sequel.2.1. Phase 1In phase 1, the Recursive Five-block Heuristic divi<strong>de</strong>s a rectangleinto five (or less) smaller rectangles in a way that is called firstor<strong>de</strong>rnon-guillotine cut [9]. Figure 1 illustrates this kind of cutrepresented by a quadruple (x 1 ,x 2 ,y 1 ,y 2 ), such that 0 ≤ x 1 ≤ x 2 ≤L and 0 ≤ y 1 ≤ y 2 ≤ W. This cut <strong>de</strong>termines five subrectangles(L 1 ,W 1 ),...,(L 5 ,W 5 ) such that L 1 = x 1 , W 1 = W −y 1 , L 2 = L−x 1 ,W 2 = W − y 2 , L 3 = x 2 − x 1 , W 3 = y 2 − y 1 , L 4 = x 2 , W 4 = y 1 , L 5 =L − x 2 and W 5 = y 2 . Each rectangle is recursively cut unless the(sub)problem related to this rectangle has already been solved.2 W 2y 2W 1 13y 15 WW 454(0,0) x 1 x 2 (0,0) L 4 L 5(a)Figure 1: Representation of a first-or<strong>de</strong>r non-guillotine cut.2.2. Phase 2Phase 2 of the Recursive Partitioning Approach applies the L-approach [7, 8, 4] which is based on the computation of a dynamicprogramming recursive formula [7]. This procedure divi<strong>de</strong>sa rectangle or an L-shaped piece into two L-shaped pieces. An L-shaped piece is <strong>de</strong>termined by a quadruple (X,Y,x,y), with X ≥ xand Y ≥ y, and is <strong>de</strong>fined as the topological closure of the rectanglewhose diagonal goes from (0,0) to (X,Y ) minus the rectanglewhose diagonal goes from (x,y) to (X,Y ). Figure 2 <strong>de</strong>picts thenine possible divisions [4] of a rectangle or an L-shaped piece intotwo L-shaped pieces.2.3. Heuristics for large problemsThe generation of all patterns by the Recursive Partitioning Approachmay be prohibitive for large instances. Moreover, the amountof memory required by these algorithms may not be available. Forthis reason, we propose heuristics that reduce both the time andmemory requirements of the algorithms. These procedures, however,may lead to a loss of quality of the solution found. Since thetime and memory complexities of generating all possible cuttingshighly <strong>de</strong>pends on the sizes of the integer conic combinations andraster points sets, we can significantly reduce time and memoryrequirements in two ways: (i) by limiting the search <strong>de</strong>pth of therecursions; and (ii) by replacing the integer conic combinationsand raster points sets by smaller sets.L 1(b)L 2(0, 0)(0, 0)(X, Y )(X, Y )L 1 (x, y)(x ′ , y ′ )L 2 (x, y)L 1 (x, y)(x ′ , y ′ )(x ′ , y ′ )L 2L 2 L 1(0, 0)(0, 0)B 1 B 2 B 3(X, Y )(X, Y )(X, Y )(X, Y )(x, y)L 1 (x, y)(x ′ , y ′ )(x ′ , y ′ )(x ′ , y ′ ) L 2L 1 L L 2 L 1 2(x ′′ , y ′ )(0, 0)(0, 0)(0, 0)B 4 B 5 B 6(X, Y )(X, Y )(X, Y )(x ′ , y ′′ )(x ′ , y ′ )L 1 L2 L(x, y)1L 1 (x, y)(x ′ , y ′ )L 2 L (x ′ , y ′ 2 )(0, 0)(0, 0)B 7 B 8 B 9Figure 2: Subdivisions of an L-shaped piece into two L-shapedpieces.3. NUMERICAL EXPERIMENTSWe implemented the Recursive Partitioning Approach and its heuristiccounterpart for the unconstrained two-dimensional non-guillotinecutting problem. The algorithms were co<strong>de</strong>d in C/C++ language.The computer implementation of the algorithms as well asthe <strong>da</strong>ta sets used in our experiments and the solutions found arepublicly available for benchmarking purposes at [10]. In the numericalexperiments, we consi<strong>de</strong>red 95 problem instances foundin the literature. Extensive numerical experiments evaluating theproposed method can be found in [11], where the whole materialof the present exten<strong>de</strong>d abstract is present in <strong>de</strong>tail.4. CONCLUDING REMARKSWhile a large number of studies in the literature have consi<strong>de</strong>redstaged and non-staged two-dimensional guillotine cutting problems,much less studies have consi<strong>de</strong>red two-dimensional nonguillotinecutting problems (constrained and unconstrained), andonly a few of them have proposed exact methods to generate nonguillotinepatterns. Moreover, most of the approaches (exact andheuristic) for non-guillotine cutting (or packing) were <strong>de</strong>velopedfor the constrained problem, which can be more interesting forcertain practical applications with relatively low <strong>de</strong>mands of theor<strong>de</strong>red items. However, part of these methods may not performwell when solving the unconstrained problem. On the other hand,the unconstrained problem is particularly interesting for cuttingstock applications with large-scale production and weakly heterogeneousitems, in which the problem plays the role of a columngeneration procedure.This study presented a Recursive Partitioning Approach to generateunconstrained two-dimensional non-guillotine cutting (or packing)patterns. The approach was able to find the optimal solutionof a large number of mo<strong>de</strong>rate-sized instances known in the literatureand we were unable to find a counterexample for which theapproach fails to find a known optimal solution. To cope with largeinstances, we combined the approach with simple heuristics to reduceits computational efforts. For mo<strong>de</strong>rate-sized instances, boththe five-block and the L-Algorithm phases of the approach seemto be promising alternatives for obtaining reasonably good or opti-ALIO-EURO <strong>2011</strong> – 120


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>mal non-guillotine solutions un<strong>de</strong>r affor<strong>da</strong>ble computer runtimes,whereas for larger instances, the guillotine or the five-block phasemay be preferable, <strong>de</strong>pending on the <strong>de</strong>finition of an acceptabletime limit. An interesting perspective for future research is to extendthe Recursive Partitioning Approach to <strong>de</strong>al with constrainedtwo-dimensional non-guillotine cutting.5. REFERENCES[1] H. Dyckhoff, “A typology of cutting and packing problems,”European Journal of Operational Research, vol. 44, pp. 145–159, 1990.[2] G. Wäescher, H. Haußner, and H. Schumann, “An improvedtypology of cutting and packing problems,” European Journalof Operational Research, vol. 183, pp. 1109–1130, 2007.[3] J. E. Beasley, “A population heuristic for constrained twodimensionalnon-guillotine cutting,” European Journal ofOperational Research, vol. 156, pp. 601–627, 2004.[4] E. G. Birgin, R. D. Lobato, and R. Morabito, “An effectiverecursive partitioning approach for the packing of i<strong>de</strong>nticalrectangles in a rectangle,” Journal of the Operational ResearchSociety, vol. 61, pp. 306–320, 2010.[5] R. Morabito and S. Morales, “A simple and effective recursiveprocedure for the manufacturer’s pallet loading problem,”Journal of the Operational Research Society, vol. 49,pp. 819–828, 1998.[6] ——, “Erratum to ’A simple and effective recursive procedurefor the manufacturer’s pallet loading problem’,” Journalof the Operational Research Society, vol. 50, pp. 876–876,1999.[7] L. Lins, S. Lins, and R. Morabito, “An L-approach for packing(l,w)-rectangles into rectangular and L-shaped pieces,”Journal of the Operational Research Society, vol. 54, pp.777–789, 2003.[8] E. G. Birgin, R. Morabito, and F. H. Nishihara, “A note onan L-approach for solving the manufacturer’s pallet loadingproblem,” Journal of the Operational Research Society,vol. 56, pp. 1448–1451, 2005.[9] M. Arenales and R. Morabito, “An and/or-graph approach tothe solution of two-dimensional non-guillotine cutting problems,”European Journal of Operational Research, vol. 84,pp. 599–617, 1995.[10] “http://www.ime.usp.br/∼egbirgin/packing/.”[11] E. G. Birgin, R. D. Lobato, and R. Morabito, “Generating unconstrainedtwo-dimensional non-guillotine cutting patternsby a recursive partitioning algorithm,” Journal of the OperationalResearch Society, <strong>2011</strong>, to appear.ALIO-EURO <strong>2011</strong> – 121


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A Complete Search Method For Relaxed Traveling Tournament ProblemFilipe Brandão ∗ João Pedro Pedroso ∗ †∗ <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências, Universi<strong>da</strong><strong>de</strong> do PortoRua do Campo Alegre, 4169-007 Porto, Portugalf<strong>da</strong>bran<strong>da</strong>o@dcc.fc.up.pt jpp@fc.up.pt† INESC PortoRua Dr. Roberto Frias 378, 4200-465 Porto, PortugalABSTRACTThe Traveling Tournament Problem (TTP) is a sports schedulingproblem that inclu<strong>de</strong>s two major issues in creating timetables:home/away pattern feasibility and travel distance. In this problemthe schedule must be compact: every team plays in every time slot.However, there are some sports leagues that have both home/awaypattern restrictions and distance limits, but do not require a compactschedule. In such schedules, one or more teams can have abye in any time slot. This leads us to a variant of the problem:the Relaxed Traveling Tournament Problem (RTTP). We presenta complete search method to solve this problem based on branchand-bound,metaheuristics and dynamic programming.Keywords: Complete search, Dynamic programming, Metaheuristics,Branch-and-bound1. INTRODUCTIONThe advances in mo<strong>de</strong>ling the combinatorial structure of sportsschedules and their solution, together with the increasing practicalrequirements for schedules by real sports leagues has increased theinterest in computational methods for creating them.The key issues for constructing a schedule are travel distance andhome/away pattern restrictions. While teams wish to reduce thetotal amount they travel, they are also concerned with more traditionalissues with respect with home and away patterns.The Traveling Tournament Problem (TTP) abstracts the key issuesin creating a schedule that combines home/away pattern constraintsand travel distance minimization. Either home/away patternconstraints and travel distance minimization are reasonablyeasy to solve, but the combination of them makes this problemvery difficult. This problem was proposed in [1].In TTP the schedule must be compact: every team plays in everytime slot; however, there are some sports leagues that have bothhome/away pattern restrictions and distance limits, but do not requirea compact schedule. This leads us to a new problem: theRelaxed Traveling Tournament Problem. This variant of the TTPwas proposed by Renjun Bao and Michael Trick [2]. As in thisvariant the schedule is not compact, teams have byes (i.e., slotswhere they do not play) in their schedule. The objective is to minimizethe travel distance, and the teams are allowed to have a fixednumber K of byes.2. THE TRAVELING TOURNAMENT PROBLEMIn the Traveling Tournament Problem, there is a even number n ofteams, each with a home venue. The teams wish to play a roundrobin tournament, whereby each team will play against every otherteam twice, once at each team’s home venue. This means that2(n − 1) slots, or time periods, are required to play a double roundrobin tournament. There are exactly 2(n − 1) time slots availableto play these games, so every team plays in every time slot. Associatedwith a TTP instance is a n by n distance matrix D, whereD i j is the distance between the venues of team i and team j.Each team begins at its home site and travels to play its gamesat the chosen venues. At the end of the schedule each team thenreturns (if necessary) to its home site.Consecutive games for a team constitute a road trip; consecutivehome games are a home stand. The length of a road trip or homestand is the number of opponents played (not the travel distance).The TTP is <strong>de</strong>fined as follows:Input: n, the number of teams; D, an n by n symmetrical distancematrix; l, u integer parameters.Output: A double round robin tournament on the n teams suchthat:• the length of every home stand and road trip is between land u inclusive;• games between the same opponents cannot happen in consecutivetime slots, which is called no repeater constraint;• the total distance traveled by the teams is minimized.The parameters l and u <strong>de</strong>fine the tra<strong>de</strong>-off between distance andpattern consi<strong>de</strong>rations. For l = 1 and u = n − 1, a team may take atrip equivalent to a traveling salesman tour. For small u, teamsmust return home often, so the distance traveled will increase.Usually l = 1 and u = 3, which means that each team cannot playmore than three consecutive home games or three consecutive roadgames.The solution of the TTP has proven to be a computational difficultchallenge. For many years, the six-team instance NL6, availablein [3], was the largest instance solved to a provable optimum. In2008, NL8 was solved; NL10 was solved in 2009. This leavestwelve teams as the next unsolved instance, which is a significantlysmall league size for such a simple problem <strong>de</strong>scription.ALIO-EURO <strong>2011</strong> – 122


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>3. THE RELAXED TRAVELING TOURNAMENTPROBLEMThe goal in the TTP is to find a compact schedule: the number oftime slots is equal to the number of games each team plays. Thisforces every team to play in every time slot. However, there aresome sports leagues that have both home/away pattern restrictionsand distance limits but do not require a compact schedule. In suchschedules, one or more teams can have a bye in any time slot. Thisleads us to the Relaxed Traveling Tournament Problem (RTTP).In this variant of the TTP, instead of fixing the schedule length tobe 2(n − 1), we let the schedule length be 2(n − 1) + K for someinteger K ≥ 0. For a given K, the problem is called K-RTTP. ForK = 0, the RTTP is just the TTP. For K > 0, each team has K slotsin which it does not play.Byes are ignored in <strong>de</strong>termining the length of a homestand or roadtrip,and in <strong>de</strong>termining whether a repeater has occured. This allowsthat TTP’s solutions are feasible for the K-RTTP for everyK ≥ 0 (in fact, K 1 -RTTP’s solutions are feasible for K 2 -RTTP ifK 1 ≤ K 2 ).4. SOLUTION METHODOLOGYFor solving the RTTP one has to <strong>de</strong>al both with feasibility concerns(the home and away pattern) and optimization concerns (the traveldistance); this combination makes this problem very difficult tosolve to a provable optimal.One of the most successful methods of solving the TTP is an algorithmwhich combines an iterative <strong>de</strong>epening algorithm [4] with<strong>de</strong>pth-first branch-and-bound [5]. Other approaches inclu<strong>de</strong> a simulate<strong>da</strong>nnealing metaheuristic [6], representing the problem withhard and soft constraints, and exploring both feasible and infeasibleschedules based on a large neighborhood.Our solution methodology for RTTP is a complete search-method,putting in place several tools: branch-and-bound (the main method),metaheuristics (for trying to improve bounds), and dynamic programming(to compute lower bounds quickly). The way we combinedthese tools is <strong>de</strong>scribed below in Algorithm 1.So far, the largest instance solved to a provable optimal was NL4;our method allowed us to solve NL6 very quickly and NL8. Forlarger instances, the method was unable to reach solutions betterthan the best known solutions for the TTP.Algorithm 1: Hybrid RTTP-Solver1: UB ← ∞2: S ←[empty schedule]3: while not empty(S) do4: u ← pop(S)5: if final(u) then6: v ← hill-climbing(u)7: if cost(v) < UB then8: UB ← cost(v)9: end if10: else if cost(u)+ILB(u) < UB then11: for all v ∈ branch(u) do12: push(S, v)13: end for14: end if15: end while4.1. Branch-and-boundIf solutions for the RTTP are generated team by team (i.e., fix allthe games of a team before moving to other team), it becomes verydifficult to check all the constraints of the problem. E.g., when wefix a game for a team, we are also fixing a game for another team(the first’s opponent) in the same round; however we can not apply,for example, the restriction of home/away pattern to the opponentteam, due to not having information about previous games.Therefore, solutions are generated round by round: all the gamesof one round are fixed before moving to the subsequent round.The advantage of this or<strong>de</strong>r is that we can verify restrictions earlier,avoiding the exploration of significant parts of the branch-andboundtree.To enumerate solutions we use the following method1. start at the first round;2. for each team, if a game is not scheduled yet, pick eachpossible opponent, and try to schedule a game;3. after trying all opponents, try to use a bye;4. when the schedule for the current round is complete, repeatthis process in the following round, until completing theschedule.For trimming off non-optimal candi<strong>da</strong>tes from the branch-andboundtree, we use the current cost plus the In<strong>de</strong>pen<strong>de</strong>nt LowerBound (ILB) for the remaining games of each team, as <strong>de</strong>scribedbelow.4.2. In<strong>de</strong>pen<strong>de</strong>nt Lower Bound and Dynamic ProgrammingIf we calculate the optimal schedule (that minimizes travel distance)for one team without taking into account the other teams’schedule, we have a lower bound to the distance traveled by thatteam. The sum over the n teams of the distances associated withtheir in<strong>de</strong>pen<strong>de</strong>nt optimal schedule provi<strong>de</strong>s a simple but stronglower bound. This is called In<strong>de</strong>pen<strong>de</strong>nt Lower Bound (ILB) aswas first proposed in [7].To calculate this lower bound, we need to know: the team, the currentlocation, the number of remaining home games, the list of remainingaway games, the current number of consecutive home/awaygames. This information can be used as the state in dynamic programming.Exploiting some symmetries, a small table suffices forholding this information; e.g., a 108MB table is enough for thetwelve teams problem NL12, and it can be computed very quickly.4.3. MetaheuristicsEverytime we find a new solution insi<strong>de</strong> the branch-and-boundtree, we apply a hill climbing metaheuristic to try to improve bounds.When a local optimum is reached, random perturbations are appliedto the solution; this perturbation and hill climbing process isrepeated a number of times (100, in our experiment).To generate the neighbours for the current solution, we use threefrom the five transformations proposed in [6]. These movementsare:• SwapHomes(T i ,T j ): Given two teams, their home/away rolesin the two scheduled games between them are swapped;• SwapRounds(r k ,r l ): This move swaps rounds r k and r l ;• SwapTeams(T i ,T j ): This move simply swaps the scheduleof teams T i and T j .ALIO-EURO <strong>2011</strong> – 123


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Whenever applying a move leads to an invalid solution, the scheduleis discar<strong>de</strong>d. These three moves are not sufficient for exploringthe entire search space and, as a consequence, they lead to suboptimalsolutions; however, they can lead to better solutions, therebyimproving the upper bound.The use of this metaheuristic to improve bounds is particularlyimportant in big instances, such as NL8, where it allows us toquickly find good solutions sooner, and thus pruning more effectivelythe branch-and-bound tree. Small instances, such as NL6,can be solved without this component, as in this case the searchtree (using only the ILB) is relatively small.6. CONCLUSIONSThe solution of Traveling Tournament Problem has proved to be acomputational difficult challenge. The combination of home/awaypattern constraints and travel distance minimization makes thisproblem very difficult. Its relaxed version (RTTP) seems to beeven har<strong>de</strong>r to solve to a provable optimum. To tackle this problem,we combined different methods: branch-and-bound, dynamicprogramming and metaheuristics. These were combined in a carefulcomputer implementation, allowing us to solve to optimalitysome of the previously open instances.5. COMPUTATIONAL RESULTSThe method proposed in this paper was tested on a subset of thebenchmark instances available in [3]. The results obtained are reportedin Table 1. The previous best known solutions are reportedin Table 2. For the NL8 with two byes, the solution for K = 1 wasused as initial upper bound (⋆); for NL8 with with three byes, theprevious (K = 2) solution provi<strong>de</strong>d the initial upper bound (⋆⋆).CPU times were obtained with a (sequential) implementation inthe C programming language, in a Quad-Core Intel Xeon at 2.66GHz, running Mac OS X 10.6.6.Name # teams K ILB Optimal Solution TimeNL4 4 1 8044 8160 0sNL4 4 2 8044 8160 0sNL4 4 3 8044 8044 0sNL6 6 1 22557 23124 10sNL6 6 2 22557 22557 1sNL8 8 1 38670 39128 44hNL8 8 2 38670 38761 208h(⋆)NL8 8 3 38670 38670 92h(⋆⋆)Table 1: Results for NL Instances. ILB is the in<strong>de</strong>pen<strong>de</strong>nt lowerbound at the root no<strong>de</strong>.Name # teams K Solution Optimal SolutionNL4 4 1 8160 8160NL4 4 2 8160 8160NL4 4 3 8044 8044NL6 6 1 23791 231247. REFERENCES[1] K. Easton, G. Nemhauser, and M. Trick, “The traveling tournamentproblem <strong>de</strong>scription and benchmarks,” 2001.[2] R. Bao, “Time relaxed round robin tournament and the NBAscheduling problem,” Master’s thesis, Cleveland State University,2006.[3] M. Trick, “Challenge traveling tournament instances,”<strong>2011</strong>, (accessed January 29, <strong>2011</strong>). [Online]. Available:http://mat.gsia.cmu.edu/TOURN[4] R. E. Korf, “Depth-first iterative-<strong>de</strong>epening : An optimaladmissible tree search,” Artificial Intelligence,vol. 27, no. 1, pp. 97 – 109, 1985. [Online].Available: http://www.sciencedirect.com/science/article/B6TYF-47X1JH4-G/2/656a3c8f0a14e8d6ca73a9a996faebfe[5] D. C. Uthus, P. J. Riddle, and H. W. Guesgen, “Dfs* and thetraveling tournament problem,” in CPAIOR, ser. Lecture Notesin Computer Science, W. J. van Hoeve and J. N. Hooker, Eds.,vol. 5547. Springer, 2009, pp. 279–293.[6] A. Anagnostopoulos, L. Michel, P. V. Hentenryck, andY. Vergados, “A simulated annealing approach to thetraveling tournament problem,” J. of Scheduling, vol. 9,pp. 177–193, April 2006. [Online]. Available: http://portal.acm.org/citation.cfm?id=1127684.1127697[7] K. Easton, G. L. Nemhauser, and M. A. Trick, “Solving thetravelling tournament problem: A combined integer programmingand constraint programming approach,” in PATAT, ser.Lecture Notes in Computer Science, E. K. Burke and P. D.Causmaecker, Eds., vol. 2740. Springer, 2002, pp. 100–112.Table 2: Previous results for NL Instances from Bao and Trick [2].ALIO-EURO <strong>2011</strong> – 124


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A Hybrid Algorithm for Minimizing Earliness-tardiness Penalties in ParallelMachinesFulgencia Villa ∗ Ramon Alvarez-Val<strong>de</strong>s † Jose M. Tamarit †∗ Polytechnic University of ValenciaDept. Applied Statistics and Operations Research and Qualitymfuvilju@eio.upv.es† University of ValenciaDept. Statistics and Operations Research{ramon.alvarez, jose.tamarit}@uv.esABSTRACTWe consi<strong>de</strong>r the problem of scheduling a set of jobs on a set ofi<strong>de</strong>ntical parallel machines where the objective is to minimize thetotal weighted earliness and tardiness with respect to a commondue <strong>da</strong>te. We propose a hybrid heuristic algorithm, combining priorityrules for assigning jobs to machines, local search and PathRelinking, with exact procedures for solving the one-machine subproblems.These exact procedures have been <strong>de</strong>veloped by ourgroup in a previous study. The algorithm is compared with thebest reported results on the same instances in or<strong>de</strong>r to assess theefficiency of the proposed strategy.Keywords: Scheduling, Earliness-tardiness, Metaheuristics1. INTRODUCTIONIn Just-In-Time scheduling, not only tardiness but also earlinessare penalized. Tardy jobs, completed after their due <strong>da</strong>te, resultin customer discontent, contract penalties, loss of sales and lossof reputation, but early jobs also have non-<strong>de</strong>sirable effects suchas inventory carrying costs, the opportunity cost of the money investedin inventory, storage and insurance costs, and product <strong>de</strong>terioration.Therefore, criteria involving both earliness and tardinesscosts are receiving increased attention in machine scheduling research.In this paper we consi<strong>de</strong>r the problem of scheduling a setof jobs on a set of i<strong>de</strong>ntical parallel machines where the objectiveis to minimize the total weighted earliness and tardiness with respectto a common due <strong>da</strong>te. In practice, problems with a commondue <strong>da</strong>te appear when a set of components are produced to be assembledin a later phase or when a set of products have to be senttogether to a client.The problem can be <strong>de</strong>fined as follows. There are n jobs to beprocessed on a set of m i<strong>de</strong>ntical parallel machines, all of themwith the same due <strong>da</strong>te d. For each job i, the processing time p i ,the penalty per period of earliness α i , and the penalty per period oftardiness β i , are known. No preemption is allowed, all the jobs areavailable at time zero and the machine is continuously availablefor work. If we <strong>de</strong>note the completion time of job i by C i , theobjective ismin∑ n i α iE i + β i T i ,where E i = max{d −C i ,0} and T i = max{C i − d,0}.When <strong>de</strong>aling with this objective function, two cases can be distinguished.We consi<strong>de</strong>r a problem as non-restrictive if the optimalcost cannot <strong>de</strong>crease with extensions to the common due<strong>da</strong>te. In this case we say that the due <strong>da</strong>te is non-restrictive (d l ),that is, long enough to allow as many jobs as required to be processedin the interval (0,d). In the restrictive case the due <strong>da</strong>te,d r , affects the optimal schedule because not all the required jobsfit into the interval (0,d). According to the classification systemby Graham et al. [1], the problem can be <strong>de</strong>noted as P|d i =d r |∑ i (α i E i + β i T i ). The problem is strongly NP-hard because thebasic problem P||∑ i w i C i , which is already NP-hard, is a particularcase.The non-restrictive case has been studied by Hall [2] and Sun<strong>da</strong>raghavanand Ahmed [3]. Chen and Powell [4] proposed a columngeneration algorithm for P|d i = d l |∑ i ((α i E i + β i T i ), optimallysolving instances of up to 60 jobs. More recently, Rios-Solisy Sourd [5] have studied the restrictive case, <strong>de</strong>veloping heuristicsbased on the efficient exploration of an exponential-size neighborhood.An extensive computational study, using new and existinginstances, shows the good performance of the proposed procedures.Ke<strong>da</strong>d-Sidhoum et al. [6] have <strong>de</strong>veloped a lower boun<strong>da</strong>nd a local search heuristic for the case with distinct due dutes, buttheir procedures can obviously be appplied to the case of a commondue <strong>da</strong>te.2. SOLVING THE ONE-MACHINE PROBLEMThe one-machine problem has been extensively studied. From previousstudies we know that there is always an optimal solution satisfyingthree conditions:1 An optimal schedule does not contain any idle time betweenconsecutive jobs.2 The optimal schedule is V-shaped around the common due<strong>da</strong>te. Jobs completed before or on the due <strong>da</strong>te are scheduledin non-increasing or<strong>de</strong>r of p i /α i , and jobs starting onor after the due <strong>da</strong>te are scheduled in non-<strong>de</strong>creasing or<strong>de</strong>rof p i /β i .3 In the optimal schedule, either the first job starts at timezero or there is a job finishing on the due <strong>da</strong>te.According to property 3, we can classify the instances into twocategories: those for which the optimal solution has a job finishingon the due <strong>da</strong>te and those where the optimal solution starts at timezero. If both conditions hold for a given instance, it is classifiedinto the first category. We have <strong>de</strong>veloped a different quadraticmo<strong>de</strong>l for each class of problems [7].ALIO-EURO <strong>2011</strong> – 125


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>2.1. Mo<strong>de</strong>l 1: Problems in which a job ends on the due <strong>da</strong>temins.t.∑α i b ii∑j>i,inBb j p j +∑β i a ii∑ a j p j (1)j≤i,inAn∑ b i p i ≤ d (2)i=1a i + b i = 1 ∀i = 1,2,...,n (3)a i ,b i ∈ {0,1} ∀i = 1,2,...,n (4)computed from time 0; the contribution of the jobs in A (the secondterm in the expression (5)) is computed from the end of thesequence at time T = ∑ i p i , and the contribution of the straddlingjob appears in the third term.The computational results obtained with these two mo<strong>de</strong>ls on alarge set of test instances from the literature show that Mo<strong>de</strong>l 1is extremely fast, even for very large problems. On the contrary,Mo<strong>de</strong>l 2 is much slower and for instances with more than 20 jobsobtaining the optimal solution in a reasonable time cannot be guaranteed.{1, if i finishes on or before db i =0, otherwise ∀i = 1,2,..,n{1, if i begins on or after <strong>da</strong> i =0, otherwise ∀i = 1,2,..,nIn this mo<strong>de</strong>l, as there is always a job finishing on d, all jobs areclassified as jobs finishing on or before d (the jobs in set B), andjobs starting on or after d (the jobs in set A). Variables a i and b i<strong>de</strong>fine whether each job i belongs to A or B. Obviously, a i = 1−b i ,and constraints (3) are redun<strong>da</strong>nt. We only keep both for the clarityof the mo<strong>de</strong>l. Once the jobs are classified, their relative positionin A and B is <strong>de</strong>termined by property 2. Therefore, the or<strong>de</strong>r requiredin the objective function is known. We take advantage ofthis property by building two or<strong>de</strong>red lists: the B-or<strong>de</strong>r, by nonincreasingor<strong>de</strong>r of p i /α i , and the A-or<strong>de</strong>r, non-<strong>de</strong>creasing or<strong>de</strong>rof p i /β i . In expression (1), the notation " j > i, inB" makes referenceto the B-or<strong>de</strong>r and " j ≤ i, inA" makes reference to the A-or<strong>de</strong>r.The contribution to the objective function of the jobs in B and A isgiven by the first and second terms of expression (1). Constraint(2) ensures that all the jobs being processed before d fit into theinterval (0,d).2.2. Mo<strong>de</strong>l 2: Problems with a job starting at time zeromins.t.∑i∑α i b i (d −j≤i,inBb j p j ) +∑i+∑(1 − b i − a i )β i (T − d −∑ij∑β i a i (T − d − a j p j )j>i,inAa j p j ) (5)n∑ b i p i ≤ d (6)i=1n∑ a i p i ≤ T − d (7)i=1a i + b i ≤ 1 ∀i = 1,...,n (8)∑(a i + b i ) ≥ n − 1 (9)ia i ,b i ∈ {0,1} ∀i = 1,...,n (10)We use the same variables a i and b i from the previous mo<strong>de</strong>l, but inthis case a straddling job can appear, starting before d and finishingafter d. Therefore, we can have a i = b i = 0 for at most one job andconstraints (8) are no longer equalities as they were in Mo<strong>de</strong>l 1.Constraints (9) ensure that, apart from the possible straddling job,all the other jobs must belong to B or A. Constraint (6) guaranteesthat the processing time of jobs in B cannot exceed d. Similarly,constraint (7) ensures that jobs in A do not exceed T −d. As in thismo<strong>de</strong>l the sequence starts at time 0 and no idle time is allowed (byProperty 1), it ends at time T = ∑ i p i . Constraints (8) and (9) holdwith equality if there is no straddling job.The objective function is calculated in a different way. The contributionof the jobs in B (the first term in the expression (5)) is3. A HYBRID HEURISTIC ALGORITHMWe propose a 4-phase algorithmic scheme. In Phase 1, severalheuristic rules produce assignments of jobs to machines. In Phase2, the one-machine problems are solved by using Mo<strong>de</strong>ls 1 and 2.Phase 3 is a local search and Phase 4 is a Path Relinking procedure.• Phase 1: Assignment of jobs to machinesWe use two strategies:1. Strategy 1– Or<strong>de</strong>r the whole set of jobs according to a priorityrule: Non-increasing p j /β j ; p j β j /α j ; p j β j ;p j .– For the next job in the or<strong>de</strong>red list, choose themachine to which the job is assigned, accordingto a criterion: Next machine; Machine withthe lowest sum of processing times; Machinein which adding a job produces a minimum increasein cost.2. Strategy 2– Select a subset of early jobs (jobs we consi<strong>de</strong>rcandi<strong>da</strong>tes for set B on a machine). That canbe done in several ways: solving a one-machineproblem with all the jobs and a due <strong>da</strong>te equalto m ∗ d, or or<strong>de</strong>ring the sets by some criterionfavouring jobs which should be early (such asnon-increasing β j /α j or β j 2/α j) and selectingthe jobs in or<strong>de</strong>r until the sum of processingtimes exceeds m ∗ d. The remaining jobs areconsi<strong>de</strong>red tardy.– The list of early (tardy) jobs is or<strong>de</strong>red by non<strong>de</strong>creasingp j /α l (p j /β j ) and each job is assignedin or<strong>de</strong>r to the machine with the minimumtotal processing time of the jobs alreadyassigned.Many different assignment strategies can be <strong>de</strong>veloped bycombining the priority criteria listed above. We implemente<strong>da</strong>nd compared them in a preliminary computational studyover a reduced set of 288 instances. As expected, none ofthem always produced the best results and we <strong>de</strong>ci<strong>de</strong>d tokeep the 10 best rules, taking into account not only theirindividual performance but also their complementarity, thatis, their ability to produce good results for instances difficultto solve for other rules. Therefore, the result of Phase1 is a set of 10 assignments which are carried over to thesubsequent phases of the process.• Phase 2: Solving the one-machine subproblemsAccording to the computational experience with Mo<strong>de</strong>ls 1and 2, we use the following strategy:– For instances with up to 20 jobs per machine solvethe subproblem with both Mo<strong>de</strong>ls 1 and 2, and keepthe best solution obtained.ALIO-EURO <strong>2011</strong> – 126


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>– For instances with more than 20 jobs per machinesuse only Mo<strong>de</strong>l 1.Mo<strong>de</strong>ls 1 and 2 are solved using CPLEX 11.0. As the objectivefunction is non-convex, we could have previously use<strong>da</strong> convexification procedure. However, our results show thatthe internal convexification strategy of CPLEX is very efficientand therefore we use CPLEX directly.• Phase 3: Local SearchWe use two simple moves in or<strong>de</strong>r to improve the solutionsobtained in Phases 1 and 2. As the procedures of Phase2 produce the optimal (or near-optimal) sequence of thejobs assigned to each machine, these moves are <strong>de</strong>signedto change the assignment of jobs to machines.– Insertion of jobs: Extract a job from its assigned machineand assign it to the machine on which it producesthe minimum cost increase.– Interchange of sublists: We consi<strong>de</strong>r two sublists ofconsecutive tardy jobs on different machines. If thestarting time of the first sublist is earlier than the startingtime of the second sublist and the sum of its tardinesspenalties is also lower than the sum of the tardinesspenalties on the second sublist, exchanging sublistswill <strong>de</strong>crease the total cost of the solution.• Phase 4: Path Relinking– Elite Set: The 10 solutions obtained in Phase 3– Combination of solutionsWe take each pair of solutions of the Elite Set andconsi<strong>de</strong>r one of them in turn as the Initial Solutionand the other as the Guiding Solution.∗ Or<strong>de</strong>r the machines of the Initial Solution in sucha way that the first machine will be the machinewith more jobs in common with the first machineof the Guiding Solution and repeat the processfor the remaining machines.∗ Take the next machine k on the or<strong>de</strong>red list ofthe initial solution S i and compare it with machinek of the guiding solution S g . Let T ik be theset of jobs assigned to machine k in S i and letT gk be the set of jobs in machine k in S g . Buildthe sets J In = T gk ̸ T ik , J Out = T ik ̸ T gk∗ Take the jobs in J In to insert them into T ik andthe jobs in J Out to eliminate them from T ik andinsert them into the machine where they are inS g . For each insertion, consi<strong>de</strong>r the three possibilities:insert into B (early), into A (tardy), ormake it the straddling job, and choose the alternativeof minimum cost.4. COMPUTATIONAL RESULTSWe have used the test instances generated by Rios-Solis and Sourd[5], kindly provi<strong>de</strong>d by the authors, as well as the best known solutionsfo each instance, obtained by the heuristic proposed in [6].There are four sets of instances, differing in the way the processingtimes and the penalties have been generated. The number ofjobs varies between 10 and 200, the machines between 2 and 8,and three types of due <strong>da</strong>tes (more or less restrictive) are used.Each combination of these factors is replicated 10 times, producing3360 instances. In our study, we are currently using only oneinstance for each combination of factors, excluding those of 200jobs, and therefore we <strong>de</strong>al with a set of 288 instances which canbe seen as representative of the whole set.The overall average percentage <strong>de</strong>viation of the solutions obtainedin Phases 1 and 2 from the best known solution is 0.33 %, indicatingthat the constructive procedure which combines priorityassignment rules with the exact solution of subproblems producesgood quality results. However, if we look at the <strong>de</strong>tailed resultsby number of machines, we can see that as the number of machinesincreases, the solutions worsen. Therefore, the assignmentof jobs to machines has to be improved if better solutions are tobe obtained, which is the purpose of Phases 3 and 4. The average<strong>de</strong>viation of the solutions is now -0.063 %. Detailed results by thenumber of jobs and machines and by the strength of the due <strong>da</strong>teappear in Table 1.Jobs 10 20 50 100 125 150-0.14 -0.42 0.15 0.04 -0.01 -0.01Machines 2 4 6 8-0.24 -0.20 0.001 0.19Due <strong>da</strong>te 0.2 0.4 0.6tightness -0.05 -0.12 -0.01Table 1: Average percentage <strong>de</strong>viations from the best known solution5. CONCLUSIONS AND FUTURE WORKThe results obtained so far are encouraging. The combination ofthese four phases allows us to obtain improved solutions for quitea difficult problem. However, several questions still need to be addressed.First, the use of exact mo<strong>de</strong>ls for solving the one-machinesubproblems. These mo<strong>de</strong>ls are currently applied to the job assignmentsprovi<strong>de</strong>d by simple priority rules and would perhapsbe more usefully applied to improved job assignments obtainedby first applying a local search to the results of the priority rules.Second, more aggressive moves can be ad<strong>de</strong>d to the Local Searchin or<strong>de</strong>r to change the job assignments more substantially. Third,the current version of the Path Relinking is quite simple. Morecomplex procedures, such as Dynamic or Evolutive Path Relinkingcould be implemented.6. ACKNOWLEDGEMENTSWe would like to thank Yasmine Rios-Solis and Francis Sourd forproviding us with their instances and results.This study has been partially supported by the Spanish Ministry ofScience and Technology, DPI2008-02700, cofinanced by FEDERfunds.7. REFERENCES[1] Graham, R., E. Lawler, J.K. Lenstra and A.H.G. Rinnooy Kan.Optimization and approximation in <strong>de</strong>terministic sequencingand scheduling: a survey Annals of Discrete Mathematics,5:287-326, 1979.[2] Hall, N. Single and multi-processor for minimizing completiontime variance. Naval Research Logistics Quarterly 33:49-54, 1986.[3] Sun<strong>da</strong>raghavan, P., M. Ahmed. Minimizing the sum of absolutelateness in single machine and multimachine scheduling.Naval Research Logistics Quarterly 31:325-333, 1984.[4] Chen, Z., W. Powell. A column generation based <strong>de</strong>compositionalgorithm for a parallel machine just in time schedulingproblem. European Journal of Operational Research, 116:220-232, 1999.ALIO-EURO <strong>2011</strong> – 127


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[5] Rios-Solis Y.A., F. Sourd. Exponential neigborhood search fora parallel machine scheduling problem, Computers and OperationsResearch, 35:1697-1712, 2008.[6] Ke<strong>da</strong>d-Sidhoum, S., Rios-Solis Y.A., F. Sourd. Lower boundsfor the earliness-tardiness problem on parallel machines withdistinct due <strong>da</strong>tes, European Journal of Operational Research,189:1305-1316, 2008.[7] Alvarez-Val<strong>de</strong>s R., J.M. Tamarit and F. Villa. Optimal and approximatesolutions for the problem of minimizing weighte<strong>de</strong>arliness-tardiness on a single machine with a common due<strong>da</strong>te. TOP, in press, DOI 10.1007/s11750-010-0163-7, 2010.ALIO-EURO <strong>2011</strong> – 128


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A hybrid algorithm combining heuristics with Monte Carlo simulation to solvethe Stochastic Flow Shop ProblemEsteban Peruyero ∗ Angel A. Juan ∗ Daniel Riera ∗∗ Open University of CataloniaBarcelona, 08018, SPAINajuanp@gmail.comABSTRACTIn this paper a hybrid simulation-based algorithm is proposed forthe Stochastic Flow Shop Problem. The main i<strong>de</strong>a of the methodologyis to transform the stochastic problem into a <strong>de</strong>terministicproblem and then apply simulation. To achieve this goal we useMonte Carlo simulation and a modified version of the well-knownNEH heuristic. This approach aims to provi<strong>de</strong> flexibility and simplicitydue to the fact that it is not constrained by any previousassumption and relies in well-tested heuristics.Keywords: Scheduling, Monte-Carlo simulation, Heuristics, Randomize<strong>da</strong>lgorithm1. INTRODUCTIONThe Flow Shop Problem (FSP) is a well-known scheduling problemin which a set of in<strong>de</strong>pen<strong>de</strong>nt jobs have to be sequentiallyexecuted (processed) by a set of machines. In this scenario, theprocessing time of each job in each machine is a known constantvalue. The classical FSP goal is to <strong>de</strong>termine a sequence of jobsminimizing the total makespan, which is the time difference betweenthe start and finish of processing all the jobs in all the machines(Figure 1).Figure 1: A graphical representation of the FSPThe Stochastic Flow Shop Problem (SFSP) can be seen as a generalizationof the FSP. In this non-<strong>de</strong>terministic version of the FlowShop Problem, the processing time of each job in each machine isnot a constant value, but instead it is a random variable which followsa given probability distribution. Therefore, in this scenariothe goal uses to be minimizing the expected makespan, which isnot the same as the expected total processing time. The study ofthe SFSP is within the current popularity of introducing randomnessinto combinatorial optimization problems. It allows to <strong>de</strong>scribenew problems in more realistic scenarios where uncertaintyis present.It is important to remark the FSP as a relevant topic for currentresearch. As it happened with other combinatorial optimizationproblems, a large number of different approaches and methodologieshave been <strong>de</strong>veloped to <strong>de</strong>al with the FSP. These approachesrange from pure optimization methods (such as linear and integerprogramming), which allow to solve small-sized problems, to approximatemethods such as heuristics and metaheuristics, whichcan find near-optimal solutions for medium- and large-sized problems.Although the usual goal is to minimize the makespan, othergoals could also be consi<strong>de</strong>red, e.g. to minimize the total processingtime. Moreover, some of these methodologies are able to provi<strong>de</strong>a set of near-optimal solutions from which the <strong>de</strong>cision-makercan choose according to his/her specific utility function. The situationis quite different in the case of the SFSP: to the best of ourknowledge, there is a lack of efficient and flexible methodologiesable to provi<strong>de</strong> near-optimal solutions to the stochastic version ofthe FSP. Moreover, most of the existing approaches are quite theoreticaland make use of restrictive assumptions on the probabilitydistributions that mo<strong>de</strong>l job processing times.2. BASIC NOTATION AND ASSUMPTIONSThe Stochastic Flow Shop Problem (SFSP) is a scheduling problemthat can be formally <strong>de</strong>scribed as follows: a set J of n in<strong>de</strong>pen<strong>de</strong>ntjobs have to be processed by a set M of m in<strong>de</strong>pen<strong>de</strong>nt machines.Each job i ∈ J requires a stochastic processing time, p i j , inevery machine j ∈ M. This stochastic processing time is a randomvariable following a certain distribution, e.g. log-normal, exponential,weibull, etc. The goal is to find a sequence for processingthe jobs so that a given criterion is optimized. The most commonlyused criterion is the minimization of the expected completion timeor expected makespan, <strong>de</strong>noted by E [C max ]. In addition, it is alsoassumed that:• All jobs are processed by all machines in the same or<strong>de</strong>r.• There is unlimited storage between the machines, and nonpreemption.• Machines are always available for processing jobs, but eachmachine can process only one job at a time.• A job cannot be processed more than once for each machine.• Job processing times are in<strong>de</strong>pen<strong>de</strong>nt random variables.At this point, it is interesting to notice that our approach does notrequire to assume any particular distribution for the random variablesthat mo<strong>de</strong>l processing times. In a practical situation, thespecific distributions to be employed will have to be fitted fromhistorical <strong>da</strong>ta (observations) using a statistical software. In mostexisting approaches, however, it is frequently assumed that theseprocessing times will follow a normal or exponential distribution.This assumption is, in our opinion, quite unrealistic and restrictive.For instance, it is unlikely that positive processing times canbe conveniently mo<strong>de</strong>led throughout a normal distribution, sinceany normal distribution inclu<strong>de</strong>s negative values .ALIO-EURO <strong>2011</strong> – 129


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>3. STATE OF THE ART AND RELATED WORKThe FSP is a NP-complete problem [1]. Many heuristics and metaheuristicshave been proposed in or<strong>de</strong>r to solve the FSP due to theimpossibility of finding, in reasonable times, exact solutions formost medium- and large-sized instances. Some of the first publicationson FSP are those of Johnson [2] and Makino[3]. These authorspresented approaches for solving small problems, e.g. problemswith only two machines and two jobs. Campbell et al. [4]built a heuristic for the FSP with more than two machines. TheNEH algorithm is consi<strong>de</strong>red by most researchers as one of thebest performing heuristics for solving the FSP. It was introducedby Nawaz et al. [5]. Later, Tailard [6] reduced the NEH complexityby introducing a <strong>da</strong>ta structure to avoid the calculation ofthe makespan. Ruiz and Stützle [7] proposed the Iterated Greedy(IG) algorithm for the FSP built on a two-step methodology. Inour opinion, this is one of the best algorithms <strong>de</strong>veloped so farto solve the FSP, since it combines simplicity with an outstandingperformance.Many works have focused on the importance of consi<strong>de</strong>ring uncertaintyin real-world problems, particularly in those related toscheduling issues. Thus, Al-Fawzan[8]analyzes the Resource ConstrainedProject Scheduling Problem (RCPSP) by focusing onmakespan reduction and robustness. Jensen[9] also introduces theconcepts of neighborhood-based robustness and tardiness minimization.Ke [10] proposes a mathematical mo<strong>de</strong>l for achievinga formal specification of the Project Scheduling Problem. Allaoui[11] studied makespan minimization and robustness related to theSFSP, suggesting how to measure the robustness. Proactive andreactive scheduling are also characterized in his work. On the onehand, an example of reactive scheduling can be found on Honkompet al. [12], where performance is evaluated using several methodologies.On the other hand, robustness in proactive scheduling isanalyzed in Ghezail et al. [13], who propose a graphical representationof the solution in or<strong>de</strong>r to evaluate obtained schedules.As the concept of minimum makespan from FSP is not representativefor the stochastic problem, Dodin [14] proposes an optimalityin<strong>de</strong>x to study the efficiency of the SFSP solutions. The boun<strong>da</strong>riesof the expected makespan are also analyzed mathematically.A theoretical analysis of performance evaluation based on markovianmo<strong>de</strong>ls is performed in Gourgand et al. [15], where a methodto compute expected time for a sequence using performance evaluationis proposed. A study of the impact of introducing differenttypes of buffering among jobs is also provi<strong>de</strong>d in this work. On theother hand, Integer and linear programming have been employedtogether with probability distributions to represent the problem inJanak et al. [16].Simulation has been applied in Juan et al. [17] to solve the FSP.In this work, the NEH algorithm is randomized using a biasedprobability distribution. Thus, their approach is somewhat similarto a GRASP-like methodology. Simulation-based approachesfor the SFSP have mainly focused on performance evaluation, as inGougard et al. [18]. Similarly, Dodin [14] performs simulations asa way to vali<strong>da</strong>te his empirical analysis on the makespan boun<strong>da</strong>ries.Finally, Honkomp et al. [12] also make use of simulationtechniques in their approach for reactive scheduling.In a recent work, Juan et al. [19] <strong>de</strong>scribe the application of simulationtechniques to solve the Vehicle Routing Problem withStochastic Demands (VRPSD). The VRPSD is a variation of theclassical Vehicle Routing Problem where customer <strong>de</strong>mands arenot known in advance. These <strong>de</strong>mands are random variables followingsome probability distributions. The authors propose totransform the original stochastic problem into a set of related <strong>de</strong>terministicproblems, which are then solved using an efficient algorithmintroduced in a previous work [20]. As it will be discussed inmore <strong>de</strong>tail next, this paper proposes a similar approach for solvingthe SFSP.4. PROPOSED METHODOLOGYThe main i<strong>de</strong>a behind our simulation-based approach is to transformthe initial SFSP instance into a FSP instance and then to obtaina set of near-optimal solutions for the <strong>de</strong>terministic problemby using an efficient FSP algorithm. Notice that, by construction,these FSP solutions are also feasible solutions of the original SFSPinstance. Then, simulation is used to <strong>de</strong>termine which solution,among the best-found <strong>de</strong>terministic ones, shows a lower expectedmakespan when consi<strong>de</strong>ring stochastic times. This strategy assumesthat a strong correlationship exists between near-optimalsolutions for the FSP and near-optimal solutions for the SFSP. Putin other words, good solutions for the FSP are likely to representgood solutions for the SFSP. Notice, however, that not necessarilythe best-found FSP solution will become the best-found SFSP solution,since its resulting makespan might be quite sensitive to variationsin the processing times. The transformation step is achievedby simply consi<strong>de</strong>ring the expected value of each processing timeas a constant value. Since any FSP solution will be also a feasibleSFSP solution, it is possible to use Monte Carlo simulation toobtain estimates for the expected makespan. That is, we obtainthese estimates by iteratively reproducing the stochastic behaviourof the processing times in the sequence of jobs given by the FSPsolution. Of course, this simulation process will take as many iterationsas necessary to obtain accurate estimates. If necessary,variance reduction techniques could be employed in or<strong>de</strong>r to reducethe number of iterations to run. Figure 2 shows the flow chartdiagram of our approach, which is <strong>de</strong>scribed next in <strong>de</strong>tail:1. Consi<strong>de</strong>r a SFSP instance <strong>de</strong>fined by a set J of jobs and aset M of machines with random processing times, p i j , foreach job i ∈ J in each machine j ∈ M.2. For each random processing time p i j , consi<strong>de</strong>r its expectedor mean value p ∗ i j = E [ p i j].3. Let FSP* be the non-stochastic problem associated with theprocessing times p ∗ i j , ∀i ∈ J, j ∈ M.4. Using any efficient algorithm (e.g. [7, 17]), obtain a set Sof n near-optimal solutions for the FSP*.5. For each s k ∈ S, k = 1,2,...n, consi<strong>de</strong>r the sequence of jobsin s k and then start a Monte Carlo simulation in or<strong>de</strong>r to estimatethe expected makespan associated with this sequenceof jobs. Notice that for each s k , random observations fromeach p i j (i ∈ J, j ∈ M) are iteratively generated while maintainingthe sequence of jobs provi<strong>de</strong>d by s k .6. Return the sequence of jobs (solution) which provi<strong>de</strong>s thelowest expected makespan.5. CONTRIBUTION OF OUR APPROACHThe i<strong>de</strong>a of solving a stochastic combinatorial optimization problemthrough solving one related <strong>de</strong>terministic problem and thenapplying simulation is not new (see [19]). However, to the best ofour knowledge, this is the first time this approach has been used tosolve the SFSP. In fact, most of the SFSP research to <strong>da</strong>te has focusedon theoretical aspects of stochastic scheduling. By contrast,the proposed method provi<strong>de</strong>s a relatively simple and flexible approachto the SFSP, which in our opinion offers some valuablebenefits. In particular, our approach suggests a more practical perspectivewhich is able to <strong>de</strong>al with more realistic scenarios: byintegrating Monte Carlo simulation in our methodology, it is possibleto naturally consi<strong>de</strong>r any probabilistic distribution for mo<strong>de</strong>lingthe random job processing times.ALIO-EURO <strong>2011</strong> – 130


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>of these solutions offers the lowest expected makespan. This approachdoes not require any previous assumption and is valid forany probability distribution.7. ACKNOWLEDGEMENTSThis work has been partially supported by the Spanish Ministryof Science and Innovation (TRA2010-21644-C03). This work hasbeen <strong>de</strong>veloped in the context of the CYTED-IN3-HAROSA Network(http://dpcs.uoc.edu).8. REFERENCESFigure 2: Flow chart of the proposed algorithmThus, as far as we know, the presented methodology offers someunique advantages over other existing SFSP approaches. To bespecific: (a) the methodology is valid for any statistical distributionwith a known mean, both theoretical -e.g. Normal, Log-normal,Weibull, Gamma, etc.- or experimental; and (b) the methodologyreduces the complexity of solving the SFSP -where no efficientmethods are known yet- to solving the FSP, where mature and extensivelytested algorithm have been <strong>de</strong>veloped already. All in all,the credibility and utility of the provi<strong>de</strong>d solution is increased. Noticealso that, being based on simulation, the methodology canbe easily exten<strong>de</strong>d to consi<strong>de</strong>r a different distribution for eachjob-machine processing time, possible <strong>de</strong>pen<strong>de</strong>ncies among thesetimes, etc. Moreover, the methodology can be applied to SFSP instancesof any size as far as there exists efficient FSP metaheuristicsable to solve those instances. In summary, the benefits provi<strong>de</strong>dby our methodology can be summarized in two propierties:simplicity and flexibility.6. CONCLUSIONSIn this paper we have presented a hybrid approach for solvingthe Stochastic Flow Shop Problem. The methodology combinesMonte Carlo simulation with well tested algorithms for the FlowShop Problem. The basic i<strong>de</strong>a of our approach is to transform theinitial stochastic problem into a related <strong>de</strong>terministic problem, thenobtain a set of alternative solutions for this latter problem usingany efficient algorithm, and finally use simulation to verify which[1] A. H. R. Kan, Machine scheduling problems: Classification,complexity and computations. Nijhoff (The Hague), 1996.[2] S. M. Johnson, “Optimal two- and three-stage productionschedules with setup times inclu<strong>de</strong>d. naval research logistics,”Naval Research Logistics Quarterly, no. 1, pp. 61–68,1954.[3] T. Makino, “On a scheduling problem,” Operations ResearchSociety of Japan, vol. 8, pp. 32–44, 1965.[4] H. G. Campbell, R. A. Du<strong>de</strong>k, and M. L. Smith, “A heuristicalgorithm for the n job, m machine sequencing problem,”Management Science, vol. 23, no. 16, pp. B630–B637, 1973.[5] M. Nawaz, E. Enscore, and I. Ham, “A heuristic algorithmfor the m-machine, n-job flow-shop sequencing problem,”Omega, vol. 11, no. 1, pp. 91–95, 1983.[6] E. Taillard, “Some efficient heuristic methods for the flowshop sequencing problem,” European Journal of OperationalResearch, vol. 47, no. 1, pp. 65–74, 1990.[7] R. Ruiz and T. Stützle, “A simple and effective iteratedgreedy algorithm for the permutation flowshop schedulingproblem,” European Journal of Operational Research, vol.177, pp. 2033–2049, 2007.[8] M. A. Al-Fawzan and M. Haouari, “A bi-objective mo<strong>de</strong>lfor robust resource-constrained project scheduling,” InternationalJournal of Production Economics, vol. 96, no. 2, pp.175–187, 2005.[9] M. T. Jensen, “Improving robustness and flexibility of tardinessand total flow-time job shops using robustness measures,”Applied Soft Computing, vol. 1, no. 1, pp. 35–52,2001.[10] H. Ke and B. Liu, “Project scheduling problem with stochasticactivity duration times,” Applied Mathematics and Computation,vol. 168, no. 1, pp. 342–353, 2005.[11] H. Allaoui, S. Lamouri, and M. Lebbar, “A robustness frameworkfor a stochastic hybrid flow shop to minimize themakespan,” in International Conference on Service Systemsand Service Management, 2006, pp. 1097–1102.[12] S. Honkomp, L. Mockus, and G. Reklaitis, “Robust schedulingwith processing time uncertainty,” Computers & ChemicalEngineering, vol. 21, no. Supplement 1, pp. S1055–S1060, 1997.[13] F. Ghezail, H. Pierreval, and S. Hajri-Gabouj, “Analysis ofrobustness in proactive scheduling: A graphical approach,”Computers & Industrial Engineering, vol. 58, no. 2, pp. 193–198, 2010.[14] B. Dodin, “Determining the optimal sequences and the distributionalproperties of their completion times in stochasticflow shops,” Computers & Operations Research, vol. 23,no. 9, pp. 829–843, 1996.ALIO-EURO <strong>2011</strong> – 131


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[15] M. Gourgand, N. Grangeon, and S. Norre, “Markovian analysisfor performance evaluation and scheduling in m machinestochastic flow-shop with buffers of any capacity,” EuropeanJournal of Operational Research, vol. 161, no. 1, pp. 126–147, 2005.[16] S. L. Janak, X. Lin, and C. A. Flou<strong>da</strong>s, “A new robust optimizationapproach for scheduling un<strong>de</strong>r uncertainty: Uncertaintywith known probability distribution,” Computers &Chemical Engineering, vol. 31, no. 3, pp. 171–195, 2007.[17] A. Juan, R. Ruiz, H. Lourenço, M. Mateo, and D. Ionescu,“A simulation-based approach for solving the flowshop problem,”in <strong>Proceedings</strong> of the 2010 Winter Simulation Conference.Baltimore, Maryland, USA., 2010, pp. 3384–3395.[18] M. Gourgand, N. Grangeon, and S. Norre, “A contributionto the stochastic flow shop scheduling problem,” EuropeanJournal of Operational Research, vol. 151, no. 2, pp. 415–433, 2003.[19] A. Juan, J. Faulin, S. Grasman, D. Riera, J. Marull,and C. Men<strong>de</strong>z, “Using safety stocks and simulationto solve the vehicle routing problem with stochastic<strong>de</strong>mands,” Transportation Research Part C, 2010,doi:10.1016/j.trc.2010.09.007.[20] A. Juan, J. Faulin, J. Jorba, D. Riera, D. Masip, and B. Barrios,“On the use of monte carlo simulation, cache and splittingtechniques to improve the clarke and wright savingsheuristics,” Journal of the Operational Research Society,2010, doi:10.1057/jors.2010.29.ALIO-EURO <strong>2011</strong> – 132


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A Simulation-based algorithm for solving the Vehicle Routing Problem withStochastic DemandsAngel Juan ∗ Javier Faulin † Daniel Riera ∗ Jose Caceres ∗ Scott Grasman ‡∗ Open University of Catalonia - IN3Barcelona, Spain{ajuanp, drierat, jcaceresc}@uoc.edu† Public University of NavarrePamplona, Spainjavier.faulin@unavarra.es‡ Missouri University of Science & TechnologyRolla, MO, USAgrasmans@mst.eduABSTRACTThis paper proposes a flexible solution methodology for solvingthe Vehicle Routing Problem with Stochastic Demands (VRPSD).The logic behind this methodology is to transform the issue ofsolving a given VRPSD instance into an issue of solving a small setof Capacitated Vehicle Routing Problem (CVRP) instances. Thus,our approach takes advantage of the fact that extremely efficientmetaheuristics for the CVRP already exists. The CVRP instancesare obtained from the original VRPSD instance by assigning differentvalues to the level of safety stocks that routed vehicles mustemploy to <strong>de</strong>al with unexpected <strong>de</strong>mands. The methodology alsomakes use of Monte Carlo Simulation (MCS) to obtain estimatesof the expected costs associated with corrective routing actions (recourseactions) after a vehicle runs out of load before completingits route.Keywords: Metaheuristics, Routing, Scheduling1. INTRODUCTIONThe Vehicle Routing Problem with Stochastic Demands (VRPSD)is a well-known NP-hard problem in which a set of customers withrandom <strong>de</strong>mands must be served by a fleet of homogeneous vehicles<strong>de</strong>parting from a <strong>de</strong>pot, which initially holds all availableresources. There are some tangible costs associated with the distributionof these resources from the <strong>de</strong>pot to the customers. Inparticular, it is usual for the mo<strong>de</strong>l to explicitly consi<strong>de</strong>r costsdue to moving a vehicle from one no<strong>de</strong> -customer or <strong>de</strong>pot- toanother. These costs are often related to the total distance traveled,but they can also inclu<strong>de</strong> other factors such as number of vehiclesemployed, service times for each customer, etc. The classical goalhere consists of <strong>de</strong>termining the optimal solution (set of routes)that minimizes those tangible costs subject to the following constraints:(i) all routes begin and end at the <strong>de</strong>pot; (ii) each vehiclehas a maximum load capacity, which is consi<strong>de</strong>red to be the samefor all vehicles; (iii) all (stochastic) customer <strong>de</strong>mands must be satisfied;(iv) each customer is supplied by a single vehicle; and (v) avehicle cannot stop twice at the same customer without incurringin a penalty cost.Notice that the main difference between the Capacitated VehicleRouting Problem (CVRP) and the VRPSD is that in the formerall customer <strong>de</strong>mands are known in advance, while in the latterthe actual <strong>de</strong>mand of each customer has a stochastic nature, i.e.,its statistical distribution is known beforehand, but its exact valueis revealed only when the vehicle reaches the customer. For theCVRP, a large set of efficient optimization methods, heuristics andmetaheuristics have been already <strong>de</strong>veloped ([1]). However, thisis not yet the case for the VRPSD, which is a more complex problemdue to the uncertainty introduced by the random behavior ofcustomer <strong>de</strong>mands. Therefore, as suggested by Novoa and Storer[2], there is a real necessity for <strong>de</strong>veloping more efficient and flexibleapproaches for the VRPSD. On one hand, these approachesshould be efficient in the sense that they should provi<strong>de</strong> optimalor near-optimal solutions to small and medium VRPSD instancesin reasonable times. On the other hand, they should be flexible inthe sense that no further assumptions need to be ma<strong>de</strong> concerningthe random variables used to mo<strong>de</strong>l customer <strong>de</strong>mands, e.g., thesevariables should not be assumed to be discrete neither to followany particular distribution. To the best of our knowledge, most ofthe existing approaches to the VRPSD do not satisfy this flexibilityrequirement.The random behavior of customer <strong>de</strong>mands could cause an expectedfeasible solution to become infeasible if the final <strong>de</strong>mandof any route exceeds the actual vehicle capacity. This situationis referred to as “route failure”, and when it occurs, some correctiveactions must be introduced to obtain a new feasible solution.For example, after a route failure, the associated vehicle might beforced to return to the <strong>de</strong>pot in or<strong>de</strong>r to reload and resume the distributionat the last visited customer. Our methodology proposesthe construction of solutions with a low probability of sufferingroute failures. This is basically attained by constructing routes inwhich the associated expected <strong>de</strong>mand will be somewhat lowerthan the vehicle capacity. Particularly, the i<strong>de</strong>a is to keep a certainamount of surplus vehicle capacity (safety stock or buffer) while<strong>de</strong>signing the routes so that if the final routes’ <strong>de</strong>mands exceedtheir expected values up to a certain limit, they can be satisfiedwithout incurring a route failure.2. BASIC NOTATIONThe Stochastic Vehicle Routing Problem (SVRP) is a family ofwell-known vehicle routing problems characterized by the randomnessof at least one of their parameters or structural variables[3]. This uncertainty is usually mo<strong>de</strong>led by means of suitablerandom variables which, in most cases, are assumed to be in<strong>de</strong>pen<strong>de</strong>nt.The Vehicle Routing Problem with Stochastic DemandsALIO-EURO <strong>2011</strong> – 133


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>(VRPSD) is among the most popular routing problems within theSVRP family. There are two other classical problems belongingto that family: the Vehicle Routing Problem with Stochastic Customers(VRPSC) which was solved by Gendreau et al. [4] usingan a<strong>da</strong>pted Tabu Search, and the Vehicle Routing Problem withStochastic Times (VRPST), but their applications are rather limitedin comparison with the VRPSD, which is <strong>de</strong>scribed in <strong>de</strong>tailnext.Consi<strong>de</strong>r a complete network constituted by n + 1 no<strong>de</strong>s, V ={0,1,2,...,n}, where no<strong>de</strong> 0 symbolizes the central <strong>de</strong>pot andV ∗ = V \{0} is the set of no<strong>de</strong>s or vertices representing the n customers.The costs associated with traveling from no<strong>de</strong> i to no<strong>de</strong> jare <strong>de</strong>noted by c(i, j) ∀i, j ∈ V , where the following assumptionshold true: (i) c(i, j) = c( j,i) (i.e., costs are usually assumed tobe symmetric, although this assumption could be relaxed if necessary);(ii) c(i,i) = 0, and (iii) c(i, j) ≤ c(i,k) + c(k, j) ∀k ∈ V (i.e.,the triangle inequality is satisfied). These costs are usually expressedin terms of traveled distances, traveling plus service timeor a combination of both. Let the maximum capacity of each vehiclebe V MC >> max i∈V ∗{D i }, where D i ≥ 0 ∀i ∈ V ∗ are thein<strong>de</strong>pen<strong>de</strong>nt random variables that <strong>de</strong>scribe customer <strong>de</strong>mands -itis assumed that the <strong>de</strong>pot has zero <strong>de</strong>mand. This capacity constraintimplies that the <strong>de</strong>mand random value never will be greaterthan the V MC value, which allows us an a<strong>de</strong>quate performance ofour procedure. For each customer, the exact value of its <strong>de</strong>mandis not known beforehand but it is only revealed once the vehiclevisits. No further assumptions are ma<strong>de</strong> on these random variablesother than that they follow a well-known theoretical or empiricalprobability distribution -either discrete or continuous- with existingmean <strong>de</strong>noted by E[D i ]. In this context, the classical goal isto find a feasible solution (set of routes) that minimizes the expected<strong>de</strong>livery costs while satisfying all customer <strong>de</strong>mands andvehicle capacity constraints. Even when these are the most typicalrestrictions, other constraints and factors are sometimes consi<strong>de</strong>red,e.g., maximum number of vehicles, maximum allowablecosts for a route, costs associated with each <strong>de</strong>livery, time windowsfor visiting each customer, solution attractiveness or balance, environmentalcosts, and other externalities.3. OUR SIMULATION-BASED APPROACHOur approach is inspired by the following facts: (a) the VRPSDcan be seen as a generalization of the CVRP or, to be more specific,the CVRP is just a VRPSD with constant <strong>de</strong>mands –random<strong>de</strong>mands with zero variance–; and (b) while the VRPSD is yetan emerging research area, extremely efficient metaheuristics doalready exists for solving the CVRP. Thus, one key i<strong>de</strong>a behindour approach is to transform the issue of solving a given VRPSDinstance into a new issue which consists of solving several “conservative”CVRP instances, each characterized by a specific risk(probability) of suffering route failures. The term conservativerefers here to the fact that only a certain percentage of the vehicletotal capacity will be consi<strong>de</strong>red as available during the routing<strong>de</strong>sign phase. In other words, part of the total vehicle capacitywill be reserved for attending possible “emergencies” causedby un<strong>de</strong>r-estimated random <strong>de</strong>mands during the actual distribution(routing execution) phase. This part can be consi<strong>de</strong>red as a safetystock since it reflects the level of extra stock that is maintained tobuffer against possible route failures. Next, the specific steps ofour methodology are <strong>de</strong>scribed in <strong>de</strong>tail:1. Consi<strong>de</strong>r a VRPSD instance <strong>de</strong>fined by a set of customers withstochastic <strong>de</strong>mands, where each <strong>de</strong>mand is a random variable followinga given statistical distribution –either theoretical or empiricalas long as its mean exists.2. Set a value k for the percentage of the maximum vehicle capacitythat will be used as safety stock during the routing <strong>de</strong>signstage.3. Consi<strong>de</strong>r the CVRP(k) <strong>de</strong>fined by: (a) the reduced total vehiclecapacity, and (b) the <strong>de</strong>terministic <strong>de</strong>mands given by the expectedvalue of the real stochastic <strong>de</strong>mands.4. Solve the CVRP(k) by using any efficient CVRP methodology.Notice that the solution of this CVRP(k) is also an aprioristicsolution for the original VRPSD. Moreover, it will be a feasibleVRPSD solution as long as there will be no route failure, i.e., aslong as the extra <strong>de</strong>mand that might be originated during executiontime in each route does not exceed the vehicle reserve capacity orsafety stock. Notice also that the cost given by this solution can beconsi<strong>de</strong>red as a base or fixed cost of the VRPSD solution, i.e., thecost of the VRPSD in case that no route failures occur. Chancesare that some route failures occur during the execution phase. If so,corrective actions -such as returning to the <strong>de</strong>pot for a reload beforeresuming distribution- and their corresponding variable costswill need to be consi<strong>de</strong>red. Therefore, the total costs of the correspondingVRPSD solution will be the sum of the CVRP(k) fixedcosts and the variable costs due to the corrective actions.5. Using the solution obtained in the previous step, estimate the expected(average) costs due to possible failures in each route. Thiscan be done by using Monte Carlo simulation, i.e., random <strong>de</strong>mandsare generated and whenever a route failure occurs (or justbefore it happens), a corrective policy is applied and its associatedcosts are registered. In the experimental section of this paper, everytime a route fails we consi<strong>de</strong>r the costs of a round-trip fromthe current customer to the <strong>de</strong>pot; but, since we are using simulation,other alternative policies and costs could also be consi<strong>de</strong>redin a natural way. After iterating this process for some hundred/thousandtimes, a random sample of observations regardingthese variable costs are obtained and an estimate for its expectedvalue can be calculated.6. Depending on the total costs associated with the solutions alreadyobtained, repeat the process from Step 1 with a new value ofk -i.e., explore different scenarios to check how different levels ofsafety stock affect the expected total cost of the VRPSD solution.7. Finally, provi<strong>de</strong> a sorted list with the best VRPSD solutionsfound so far as well as their corresponding properties: fixed costs,expected variable costs, and expected total costs.4. EXPERIMENTAL RESULTS AND DISCUSSIONIn the CVRP literature, there exists a classical set of very wellknownbenchmarks commonly used to test their algorithm. However,as noticed by Bianchi et al. [5], there are no commonly usedbenchmarks in the VRPSD literature and, therefore, each paperpresents a different set of randomly generated benchmarks. Thus,we <strong>de</strong>ci<strong>de</strong>d to employ a natural generalization of several classicalCVRP instances by using stochastic <strong>de</strong>mands instead of constantones. So, for each instance, while we <strong>de</strong>ci<strong>de</strong>d to keep all no<strong>de</strong>coordinates and vehicle capacities, we changed d i , the <strong>de</strong>terministic<strong>de</strong>mands of client i (∀i ∈ {1,2,...,#no<strong>de</strong>s − 1}) to stochastic<strong>de</strong>mands D i following an exponential distribution with E[D i ] = d i .For each instance, a total of 16 scenarios were simultaneously executedusing a cluster of 16 personal computers IntelRCore TM 2Quad Q8200 at 2.33GHz and 2GB RAM. The 16 scenarios wereobtained by varying the available vehicle capacity (i.e., the complementaryof the safety-stocks level) from 100% to 85% duringthe routing-<strong>de</strong>sign stage. Table 1 shows the complete results obtainedfor all 55 classical instances we generalized and tested.The first column in Table 1 contains the name of each instance,which inclu<strong>de</strong>s the number of no<strong>de</strong>s and also the number of routesof the ‘stan<strong>da</strong>rd’ solution, e.g. B-n78-k10 is an instance of classB with 78 no<strong>de</strong>s and able to be solved with a 10-route solution.ALIO-EURO <strong>2011</strong> – 134


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Columns 2 to 4 are related to solutions obtained by our algorithmwhen a 100 % of the vehicle maximum capacity is consi<strong>de</strong>red duringthe <strong>de</strong>sign stage. Notice that this strategy always provi<strong>de</strong>spseudo-optimal solutions in terms of fixed costs (Column 2), sincethey can be directly compared with the CVRP best-known solution.However, since no safety stock is used, there is a chance thatthese solutions can suffer from route failures. In turn, route failuresmight imply high expected variable costs (estimated in Column3 by Monte Carlo simulation), thus increasing the total expectedcosts, which is estimated in Column 4. Here is where usingsafety stocks can be of value: by not necessarily using all vehiclemaximum capacity during the <strong>de</strong>sign stage, some route failurescan be avoi<strong>de</strong>d. Hopefully, this might lead to new solutions withslightly higher fixed costs but also with lower expected variablecosts. At the end, these alternative solutions might present lowertotal expected costs, which are the ones to be minimized. On theone hand, columns 5 to 9 show the results obtained with our algorithm.Notice that fixed costs in Column 7 are always higher orequal to those in Column 2. However, total expected costs in Column9 are always lower or equal to those in Column 4. Notice alsothat sometimes the best-found strategy (for this set of benchmarks)is to use a 100 % of the vehicle maximum capacity (i.e. no safetystocks at all) when <strong>de</strong>signing the routes (Column 5).5. CONCLUDING REMARKSWe have presented a hybrid approach to solving the Vehicle RoutingProblem with Stochastic Demands (VRPSD). The approachcombines Monte Carlo simulation with well-tested metaheuristicsfor the Capacitated Vehicle Routing Problem (CVRP). One of thebasic i<strong>de</strong>as of our methodology is to consi<strong>de</strong>r a vehicle capacitylower than the actual maximum vehicle capacity when <strong>de</strong>signingVRPSD solutions. This way, this capacity surplus or safety stockscan be used when necessary to cover route failures without havingto assume the usually high costs involved in vehicle restocktrips. Another important i<strong>de</strong>a is to transform the VRPSD instanceto a limited set of CVRP instances -each of them <strong>de</strong>fined by agiven safety-stocks level-, to which efficient solving methods canbe applied. Our approach provi<strong>de</strong>s the <strong>de</strong>cision-maker with a setof alternative solutions, each of them characterized by their totalestimated costs, leaving to him/her the responsibility of selectingthe specific solution to be implemented according to his/her utilityfunction. Although other previous works have proposed to benefitfrom the relationship between the VRPSD and the CVRP, theyusually require hard assumptions that are not always satisfied inrealistic scenarios. On the contrary, our approach relaxes most ofthese assumptions and, therefore, it allows for consi<strong>de</strong>ring morerealistic customer <strong>de</strong>mand scenarios. Thus, for example, our approachcan be used to solve CVRPSD instances with hundreds ofno<strong>de</strong>s in a reasonable time and, even more important, it is valid forvirtually any statistical distribution –the one that best fits historical<strong>da</strong>ta on customer <strong>de</strong>mands.6. ACKNOWLEDGEMENTSThis work has been partially supported by the Spanish Ministry ofScience and Innovation (TRA2010-21644-C03) and by the Navarreseand Catalan Governments (IIQ13172.RI1-CTP09-R2, 2009CTP 00007 and Jerónimo <strong>de</strong> Ayanz network). This work has been<strong>de</strong>veloped in the context of the CYTED-IN3-HAROSA Network(http://dpcs.uoc.edu).7. REFERENCES[1] G. Laporte, “What you should know about the vehicle routingproblem,” Naval Research Logistics, vol. 54, pp. 811–819,2007.[2] C. Novoa and R. Storer, “An approximate dynamic programmingapproach for the vehicle routing problem with stochastic<strong>de</strong>mands,” European Journal of Operational Research, no.196, pp. 509–515, 2009.[3] C. Bastian and A. R. Kan, “The stochastic vehicle routingproblem revisited,” European Journal of Operations Research,vol. 56, pp. 407–412, 2000.[4] M. Gendreau, G. Laporte, and R. SÈguin, “A tabu searchheuristic for the vehicle routing problem with stochastic <strong>de</strong>mands,”Operations Research, vol. 44(3), pp. 469–477, 1996.[5] L. Bianchi, M. Birattari, M. Chiarandini, M. Mastrolilli, L. Paquete,O. Rossi-Doria, and T. Schiavinotto, “Hybrid metaheuristicsfor the vehicle routing problem with stochastic <strong>de</strong>mands,”Journal of Mathematical Mo<strong>de</strong>lling and Algorithms,vol. 5, pp. 91–110, 2006.ALIO-EURO <strong>2011</strong> – 135


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Using 100% of the CapacityUsing a Percentage P of the CapacityInstance Fixed Variable Total (1) P Routes Fixed Variable Total (2) Time (s) Gap (1) - (2)A-n32-k5 787.08 179.49 966.57 100% 5 787.08 179.49 966.57 1 0.00%A-n33-k5 662.11 159.77 821.88 97% 5 676.10 135.80 811.90 1 1.21%A-n33-k6 742.69 162.45 905.14 100% 6 742.69 162.45 905.14 1 0.00%A-n37-k5 672.47 134.43 806.89 97% 5 692.53 109.47 802.00 1 0.61%A-n38-k5 733.95 157.48 891.43 93% 6 761.25 117.97 879.22 1 1.37%A-n39-k6 835.25 178.10 1,013.35 94% 6 842.92 150.35 993.27 1 1.98%A-n45-k6 944.88 254.68 1,199.55 94% 7 979.31 197.70 1,177.01 1 1.88%A-n45-k7 1,154.39 325.68 1,480.07 100% 7 1,154.39 325.68 1,480.07 2 0.00%A-n55-k9 1,074.96 304.33 1,379.28 100% 9 1,074.96 304.33 1,379.28 1 0.00%A-n60-k9 1,362.19 395.42 1,757.61 100% 9 1,362.19 395.42 1,757.61 2 0.00%A-n61-k9 1,040.31 288.01 1,328.32 95% 10 1,073.86 241.57 1,315.43 1 0.97%A-n63-k9 1,632.19 518.31 2,150.50 100% 9 1,632.19 518.31 2,150.50 4 0.00%A-n65-k9 1,184.95 341.43 1,526.37 99% 10 1,213.73 304.73 1,518.46 1 0.52%A-n80-k10 1,773.79 548.84 2,322.63 100% 10 1,773.79 548.84 2,322.63 7 0.00%B-n31-k5 676.09 169.46 845.54 95% 5 680.98 158.07 839.05 1 0.77%B-n35-k5 958.89 267.77 1,226.66 99% 5 978.51 239.61 1,218.12 3 0.70%B-n39-k5 553.20 142.48 695.68 100% 5 553.20 142.48 695.68 1 0.00%B-n41-k6 834.92 248.30 1,083.22 96% 7 856.76 224.13 1,080.89 1 0.22%B-n45-k5 754.23 146.48 900.71 100% 5 754.23 146.48 900.71 1 0.00%B-n50-k7 744.23 202.85 947.07 93% 7 754.26 186.11 940.37 1 0.71%B-n52-k7 754.38 204.83 959.21 92% 7 771.02 164.87 935.88 1 2.43%B-n56-k7 716.42 211.94 928.36 88% 8 757.68 140.32 898.00 1 3.27%B-n57-k9 1,602.28 559.89 2,162.17 96% 9 1,623.27 515.53 2,138.80 1 1.08%B-n64-k9 868.40 277.39 1,145.79 100% 9 868.40 277.39 1,145.79 10 0.00%B-n67-k10 1,039.46 316.59 1,356.05 100% 10 1,039.46 316.59 1,356.05 1 0.00%B-n68-k9 1,283.16 442.17 1,725.33 97% 9 1,303.09 388.54 1,691.63 8 1.95%B-n78-k10 1,245.82 367.24 1,613.06 98% 10 1,252.38 357.03 1,609.41 9 0.23%Table 1: Results for instances A and B using exponentially distributed <strong>de</strong>mands with E[D i ] = d iALIO-EURO <strong>2011</strong> – 136


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Vehicle routing for mixed solid waste collection - comparing alternativehierarchical formulationsTeresa Bianchi-Aguiar ∗ Maria Antónia Carravilla ∗ José F. Oliveira ∗∗ INESC–Porto, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> <strong>Engenharia</strong>, Universi<strong>da</strong><strong>de</strong> do PortoRua Dr. Roberto Frias s/n, 4200-465 Porto, Portugal{mtbaguiar, mac, jfo}@fe.up.ptABSTRACTThe aim of this paper is to present and compare alternative hierarchicalformulations for the periodic vehicle routing problem forsolid waste collection. The solution of this problem is a one–weekplan of <strong>da</strong>ily routes for the transportation of mixed solid wastefrom containers to disposal facilities, taking into consi<strong>de</strong>ration thefrequency of collection of each container within the planning horizon,the road network and the resources available. The objective isto minimize operation costs.The real-world case that supported this study was the collection ofmixed solid waste in Ponte <strong>de</strong> Lima, a municipality in the northof Portugal, and the problem was mo<strong>de</strong>lled as a Periodic VehicleRouting Problem (PVRP) with the additional constraint that routesmust pass through one of the alternative disposal facilities beforereturning to the <strong>de</strong>pot.Based on this real case scenario, we propose a framework of MIPmo<strong>de</strong>ls with three hierarchical approaches besi<strong>de</strong>s the monolithicmo<strong>de</strong>l. The hierarchical approaches are i<strong>de</strong>ntified by the aggregationof the <strong>de</strong>cisions in each level: (1) assign and route together;(2) assign <strong>da</strong>ys first - assign vehicles and route second; (3) assignfirst - route second and (4) assign <strong>da</strong>ys first - assign vehicles second- route third. Some new estimates for downstream constraintswere <strong>de</strong>veloped and integrated in upstream levels in or<strong>de</strong>r to guaranteefeasibility.Keywords: Waste collection, Hierarchical formulations, Periodicvehicle routing1. INTRODUCTIONThe costs of the collection of solid waste range between 40 and60% of a community’s solid waste management system expenditures[1]. An efficient management of the solid waste collectioncan therefore generate significant savings while ensuring hygienepatterns and satisfaction of the inhabitants, besi<strong>de</strong>s all the otheradvantages common to the efficient management of transportationsystems.This work is based on a real case concerning Ponte <strong>de</strong> Lima, amunicipality in the north of Portugal. The municipality managesthe collection of the mixed waste generated in Ponte <strong>de</strong> Lima andguarantees its transport to disposal facilities. The main objectiveof the work done with the municipality was the reduction of thecollection costs, that are highly <strong>de</strong>pen<strong>de</strong>nt of the distance traveledby the vehicles. The resources such as the number and locationof the <strong>de</strong>pots and containers, the number of vehicles and staff, aswell as the collection frequency of the containers in each parishwere already fixed.The output of the study should therefore be the visiting calen<strong>da</strong>rof each container within the weekly planning horizon, consi<strong>de</strong>ringthe constrains of the collection frequency, and the plan of theroutes for each vehicle and <strong>da</strong>y, with the additional constraint thatthe routes must go through a disposal facility to unload the wastebefore returning to the <strong>de</strong>pot. Problems with these characteristicsare mo<strong>de</strong>led in the literature as Periodic Vehicle Routing Problems(PVRP), a variant of the Vehicle Routing Problem (VRP).The PVRP is known to be an NP-hard problem and the additionalconstraints that had to be inclu<strong>de</strong>d to a<strong>da</strong>pt the mo<strong>de</strong>l to the real situationof Ponte <strong>de</strong> Lima ma<strong>de</strong> the resolution even more challenging.In or<strong>de</strong>r to be able to solve the real problem we built a frameworkwith three hierarchical approaches, which we have teste<strong>da</strong>long with the monolithic mo<strong>de</strong>l. The hierarchical approaches arei<strong>de</strong>ntified by the aggregation of the <strong>de</strong>cisions in each level: (1)assign and route together; (2) assign <strong>da</strong>ys first - assign vehiclesand route second; (3) assign first - route second and (4) assign<strong>da</strong>ys first - assign vehicles second - route third. Some estimatesof downstream constraints were <strong>de</strong>veloped and ad<strong>de</strong>d in upstreamlevels in or<strong>de</strong>r to guarantee feasibility. We compared the resultsobtained with the MIP formulations <strong>de</strong>veloped for the approachesand with the current practice of the municipality.The remain<strong>de</strong>r of this paper is organized as follows: in section 2, abrief review of the relevant literature is presented. The problem is<strong>de</strong>scribed in section 3 and in section 4 the hierarchical frameworkas well as the <strong>de</strong>veloped formulations are presented. In section 5the results obtained are <strong>de</strong>scribed and the approaches compared.Conclusions are drawn in section 6.2. LITERATURE REVIEWRouting problems have been wi<strong>de</strong>ly treated in the literature becauseof their high complexity and practical relevance. The TravelingSalesman Problem (TSP) is the most discussed routing problemand consists in <strong>de</strong>termining a minimum distance route thatbegins in a given location, passes through all the other locations(customers) and returns to the initial location [2]. In the VehicleRouting Problem (VRP), a fleet of vehicles with known capacityis available to visit customers which have a known <strong>de</strong>mand. Theobjective is to <strong>de</strong>sign routes for the vehicles at minimal total cost,guaranteeing that all the customers are served and that the capacityof the vehicles is not excee<strong>de</strong>d [3]. This problem adds to the TSPthe <strong>de</strong>cision of which customers assign to which vehicles.The Periodic Vehicle Routing Problem (PVRP) is an extension ofthe VRP where customers must be visited with pre-<strong>de</strong>fined frequenciesover an exten<strong>de</strong>d period. The additional component ofthe problem consists in the assignment of one visiting calen<strong>da</strong>rfrom a given set to each customer. The overall objective is to assignroutes to the vehicles for each <strong>da</strong>y of the planning horizonthat minimize the total travel cost. The visiting calen<strong>da</strong>r of eachclient must be met and routes are subject to vehicle capacity androute duration constraints. This problem was formally introducedin 1974 by Beltrami and Bodin as a generalization of the VRP,precisely in an application of municipal waste collection [4].ALIO-EURO <strong>2011</strong> – 137


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Russel and Igo called the PVRP an “Assignment Routing Problem”and mentioned the difficulties of choosing a calen<strong>da</strong>r for eachcustomer together with solving the routing problem [4]. To <strong>de</strong>alwith the complexity and large scale nature of the problem, severalauthors consi<strong>de</strong>r the PVRP as a multilevel problem:1. In the first level, a calen<strong>da</strong>r is selected for each customer. Inthis way, it is <strong>de</strong>ci<strong>de</strong>d which customers are visited on each<strong>da</strong>y of the planning horizon;2. In the second level, and for each <strong>da</strong>y of the planning horizon,customers are assigned to the vehicles available in that<strong>da</strong>y;3. Finally, in the third level, a route is <strong>de</strong>signed for each combinationof <strong>da</strong>y and vehicle.Note that in the VRP, only the last two <strong>de</strong>cisions need to be ma<strong>de</strong>and over a single <strong>da</strong>y only. Being the VRP an NP-hard problem,the PVRP is therefore at least as difficult [5].A significant body of work has been evolving, with multiple variants,formulations and solution methods applied to the PVRP. Threeimportant variants of the PVRP are mostly addressed in the literature:the PVRP with time window constraints – PVRPTW [6], withservice choice – PVRP-SC [7], with multiple <strong>de</strong>pots – MDPVRP[8] and with intermediate facilities – PVRP-IF [9]. In this last variant,capacity replenishment is possible at different points along theroute. As far as formulations are concerned, the most used one isthe 4-in<strong>de</strong>x formulation from Christofi<strong>de</strong>s and Beasley, based onthe VRP 3-in<strong>de</strong>x formulation from Gol<strong>de</strong>n et al [4]. Other formulationshave been emerging, consi<strong>de</strong>ring only the assignment problems[10, 11, 12]. More recently, alternative mo<strong>de</strong>ling approacheshave been emerging, such as the Set Partitioning (SP) [13]. For instancesof realistic size, the problem has been solved mostly withheuristics and metaheuristics and in sequential phases. Two-phasesolution methods are more commonly found (a survey on solutionmethods can be found in [4]).In [14], Ball states that solving an hierarchical problem is morethan solving a set of distinct problems. It is necessary to guaranteefeasibility in the downstream levels by including approximatemeasurements of lower level constraints in upstream levels. In thePVRP, this means that in the assignment problems it is necessaryto guarantee that the number of customers assigned to a vehicle ina <strong>da</strong>y neither exceeds its capacity nor leads to subproblems whereit is not possible to create any route without exceeding its maximumduration. Whereas vehicle capacity constraints have alreadyappeared in assignment problems, approximate measurements ofroute duration have not been covered so far.To conclu<strong>de</strong>, and concerning waste collection, this practical applicationhas already been studied in the literature, not only concerningmixed but also separate waste [15, 16, 5, 17, 18].3. PROBLEM DEFINITIONThe municipality of Ponte <strong>de</strong> Lima owns and operates a fleet of5 vehicles with different capacities for the mixed-waste collection.These vehicles are parked in a garage in a central parish– Arca. The 994 mixed-waste containers are non-uniformly distributedover Ponte <strong>de</strong> Lima and the waste is periodically collecte<strong>da</strong>nd transported to disposal facilities, where afterwards itis whether dumped in a controlled environment or transformed.The filling rates of the containers are highly <strong>de</strong>pen<strong>de</strong>nt on the <strong>de</strong>nsityof both the containers and the inhabitants of the region. Theyalso <strong>de</strong>pend on the collection frequency imposed. The collectionis performed 6 <strong>da</strong>ys a week. Figure 1 shows the location of the twoexisting disposal facilities and the <strong>de</strong>pot as well as the collectionfrequency of the containers within each parish.Currently the plans are monthly hand-ma<strong>de</strong>, without assuring thatthe collection frequency matches the frequencies <strong>de</strong>fined for eachparish.3.1. ObjectiveDifferent filling rates led the municipality to establish different frequenciesof collection for the containers. Therefore, for a givenplanning horizon, a set of routes is required for each vehicle aswell as a visiting schedule for each container. Each route shouldconsist of an or<strong>de</strong>red list of visiting sites that ends on a disposalfacility to <strong>de</strong>posit the waste after collection. The lowest frequencyfor a container is one visit in a week, which suggests a collectionplan of one week.The objective is to minimize collection costs, which are essentially<strong>de</strong>pen<strong>de</strong>nt on the distance traveled by the vehicles. Routes are constrainedby vehicle capacity and work shift duration. Each containershould be visited as many times per week as its frequencyand the visiting <strong>da</strong>ys should be distributed as uniformly as possiblethrough the period.4. A FRAMEWORK OF ALTERNATIVEHIERARCHICAL FORMULATIONSThe problem <strong>de</strong>scribed in section 3 can be formulated as a PeriodicVehicle Routing Problem. An additional constraint is observedthough: routes must pass through a disposal facility to unload thewaste before returning to the <strong>de</strong>pot.The <strong>de</strong>composition of highly complex optimization problems intosubproblems, hierarchically solved, is a well-known strategy in theliterature (e.g. [11, 14]). Not only the problem becomes moreefficiently solvable, but it is also taken into account that, in thecontext of real-world applications, these complex problems ariseun<strong>de</strong>r broa<strong>de</strong>r <strong>de</strong>cision making contexts, with <strong>de</strong>cisions ma<strong>de</strong> bydifferent actors and with different time horizon scopes. Therefore,it does make sense to break down the problem into subproblems,not loosing sight from the hierarchical relationships among them.On the other hand there is the well-known fact that solving untiloptimality a sequence of subproblems does not guarantee optimalityfor the overall problem resolution. However, given the size ofreal-world applications, the global optimum would be out of reach.An additional advantage of hierarchical approaches is the possibilityof consi<strong>de</strong>ring different optimization criteria at each level [11].Bearing this in mind, in figure 2 we propose a framework of <strong>de</strong>compositionprocesses for the PVRP, based on different aggregationsof the three <strong>de</strong>cisions involved in the problem and i<strong>de</strong>ntifiedin section 2. In fact, the PVRP is too difficult to be solved directlyby exact methods when consi<strong>de</strong>ring instances of realistic size. Allthe subproblems i<strong>de</strong>ntified are smaller and more amenable to rapidsolutions.The approaches are:1. Deciding at the same time which customers will be servedin each <strong>da</strong>y of the week, by which vehicle, and in whichsequence (assign and route together);2. Deciding first which customers will be served in each <strong>da</strong>yof the week, and afterwards by which vehicle and in whichsequence (assign <strong>da</strong>ys first - assign vehicles and route second);3. Deciding at the same time which customers will be servedin each <strong>da</strong>y of the week and by which vehicle, and afterwardsin which sequence (assign first - route second);4. Deciding first which customers will be served in each <strong>da</strong>yof the week, then by which vehicle, and finally in whichALIO-EURO <strong>2011</strong> – 138


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 1: Ponte <strong>de</strong> Lima Collection System: (i) Disposal Facilities, (ii) Depot, (iii) Collection frequency in each parishsequence (assign <strong>da</strong>ys first - assign vehicles second - routethird).of the upper levels. Finally, the experience with the case study instanceallowed some adjustments in the parameters of the mo<strong>de</strong>ls.5. COMPUTATIONAL RESULTSFigure 2: Alternative Decomposition Approaches to the PVRPThe first levels correspond to assignment problems whereas thelast level of each approach corresponds to a routing problem. Thecomplexity of the routing problems <strong>de</strong>crease from the first to thelast approach but the number of times that a routing problem issolved increases. For instance, to solve the problem of the casestudy, in the approach 2 the VRP is solved 6 times, whereas inapproaches 3 and 4 the TSP is solved a maximum of 30 times.Some authors proposed approaches complementary to cluster first- route second, namely route first - cluster second. However, asstated in [14], these approaches do not perform as well from acomputational perspective.To build the framework, different formulations from the literaturewere put together, and divi<strong>de</strong>d by type of approach. All theproblems i<strong>de</strong>ntified in the framework were formulated taking intoconsi<strong>de</strong>ration the practical application features and the formulationsscattered before. As far as routing is concerned, the traditionaltwo (TSP) and three (VRP) in<strong>de</strong>x formulations were consi<strong>de</strong>redbecause of their greater flexibility in incorporating additionalfeatures [3]. To eliminate subtours, a transit load constraint wasused instead of the traditional Dantzig-Fulkerson-Johnson subtourelimination constraint [2, 3, 19]. This constraint is a 4–in<strong>de</strong>x versionof the generalized Miller-Tucker-Zemlin subtour eliminationconstraints. Concerning the assignment problems, our formulationsinclu<strong>de</strong> some new <strong>de</strong>velopments to prevent infeasibility in thedownstream levels. An estimation of route duration is proposed inor<strong>de</strong>r to prevent that the routes exceed maximum duration. To thebest of our knowledge, this is the first time that this constraint isaddressed in upper levels. In what concerns vehicle capacity, wehave introduced a slack parameter in the corresponding constraintThe alternative approaches, and corresponding MIP formulations,were evaluated with the case study instance, whose characteristicswere <strong>de</strong>scribed in section 3. The results were compared in termsof objective function value, total execution time and average gapbetween the integer solution and the lower bound found by CPLEXin each sub-problem (Gap). Additionally, the number of routes andthe duration of the longest route were recor<strong>de</strong>d. The total numberof variables and constraints of the mo<strong>de</strong>ls generated to solve eachlevel were also analyzed.All hierarchical approaches presented a reduction of more than70% on both the number of variables and on the number of constraints,when compared with the monolithic mo<strong>de</strong>l. It is importantto bear in mind that these numbers <strong>de</strong>pend not only on the instancebut also on the running conditions because the number of variablesand constraints of the lower levels are influenced by the results(concrete <strong>de</strong>cision variable values) of upper levels’ problems.When tested with the case study instance, the monolithic mo<strong>de</strong>lof approach 1 did not achieve any solution within the time limit.This confirms, also for this case study, the difficulty of the problemwhich was precisely the reason that has led several authors toconsi<strong>de</strong>r the PVRP as a multilevel problem and the motivation forthis work.The best results were obtained with approach 2 (assign <strong>da</strong>ys first- assign vehicles and route second), not only concerning total distancebut also the number of routes. Interestingly, this was theapproach with higher gaps in its two levels. In fact, the overallsolution quality is mostly influenced by routing <strong>de</strong>cisions as these<strong>de</strong>cisions directly influence total distance and the duration of theroutes. By assigning vehicles together with the routing activity weare giving the mo<strong>de</strong>l freedom to explore a wi<strong>de</strong>r solution spacebased on correct estimates of distances and times.In spite of achieving optimal solutions on the routing problems andhaving the lowest gap in the first level, approach 3 (assign <strong>da</strong>ys andvehicles first - route second) had the worst global performance. Infact, the problem of assigning <strong>da</strong>ys and vehicles still has a consi<strong>de</strong>rabledimension, with three times more constraints than the othertwo equivalent hierarchical approaches.At last, approach 4 (assign <strong>da</strong>ys first - assign vehicles second -ALIO-EURO <strong>2011</strong> – 139


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>route third) performed in second. This is the only approach withthree levels and it was the one generating the smallest number ofvariables and constraints, which suggests that its problems are simplerand more efficiently solvable.The fun<strong>da</strong>mental reason for <strong>de</strong>composing a problem is that theoverall problem is too difficult to be solved monolithically. Thus,it is essential that individual problems are efficiently solvable. Onthe other hand, when increasing the number of levels we are restrictingmore and more the solution space. These facts, supportedby the results obtained, raise once more the question of the tra<strong>de</strong>–off between the number of <strong>de</strong>compositions and the difficulty ofthe resulting problems. Also important is the ability to estimateaccurately the distance measure in the upper levels. In fact, thismeasure evaluates the solutions and should remain as close as possibleto the original objective function.Improved route plans were obtained, not only concerning totaldistance run by the vehicles but also the number of routes. Besi<strong>de</strong>sthe reduction in operational costs, an improved service levelis expected, since the frequency of collection is guaranteed andthe space between consecutive visits to each container is balanced.Moreover, the work shift duration is not excee<strong>de</strong>d. These wereproblems faced by the municipality with its current plans.6. CONCLUSIONSIn this paper, motivated by a real case scenario of a waste collectionproblem, we proposed a framework of MIP mo<strong>de</strong>ls with amonolithic mo<strong>de</strong>l and three hierarchical approaches to the PeriodicVehicle Routing Problem. The hierarchical approaches were i<strong>de</strong>ntifiedby the aggregation of the <strong>de</strong>cision variables in each level:(1) assign and route together; (2) assign <strong>da</strong>ys first - assign vehiclesand route second; (3) assign first - route second and (4) assign<strong>da</strong>ys first - assign vehicles second - route third. Estimates of downstreamconstraints were also <strong>de</strong>veloped and ad<strong>de</strong>d at the upper levelsin or<strong>de</strong>r to guarantee feasibility at the lower levels: maximumduration of routes and maximum load capacity of vehicles.The hierarchical approach (2), assign <strong>da</strong>ys first - assign vehiclesand route second, led to better results consi<strong>de</strong>ring either the totaldistance traveled or the total number of routes. The hierarchicalresolution raised two important points: the tra<strong>de</strong>–off between thenumber of <strong>de</strong>compositions and the difficulty of the resulting subproblemand the importance of an accurate estimation of the distanceof the routes in the upper levels.In what concerns our case study, our mo<strong>de</strong>ls were able to obtainbetter results when compared to the current practice in the municipality.An improved service level is also expected, since thefrequency of collection is guaranteed and the space between consecutivevisits to each container is balanced, moreover, the workshift duration is not excee<strong>de</strong>d. These were problems faced by themunicipality with its current plans.As future work, the framework can be exten<strong>de</strong>d to take into accountmultiple <strong>de</strong>pots (MDPVRP). For urban areas with minordistances between collection points, the possibility of returning tocollection activity after disposal can also be incorporated (PVRP-IF). Other MIP formulations might be <strong>de</strong>veloped for the subproblems,with alternative distance estimates or consi<strong>de</strong>ring differentapproaches to the subproblems. Another area of future research isthe incorporation of other optimization criteria.7. REFERENCES[1] J. Pichtel, Waste management practices: municipal, hazardous,and industrial. Taylor & Francis, 2005.[2] T. Oncan, I. K. Altinel, and G. Laporte, “A comparative analysisof several asymmetric traveling salesman problem formulations,”Computers and Operations Research, vol. 36,no. 3, pp. 637–654, 2009.[3] P. Toth and D. Vigo, The vehicle routing problem, ser. SIAMmonographs on discrete mathematics and applications. Societyfor Industrial and Applied Mathematics, 2002.[4] P. M. Francis, K. R. Smilowitz, and M. Tzur, The periodvehicle routing problem and its extensions, ser. OperationsResearch/Computer Science Interfaces Series, B. Gol<strong>de</strong>n,S. Raghavan, and E. Wasil, Eds. Springer US, 2008, vol. 43.[5] J. Teixeira, A. Antunes, and J. Desousa, “Recyclable wastecollection planning–a case study,” European Journal of OperationalResearch, vol. 158, no. 3, pp. 543–554, Nov. 2004.[6] J. François Cor<strong>de</strong>au, G. Laporte, and A. Mercier, “A unifiedtabu search heuristic for vehicle routing problems with timewindows,” The Journal of the Operational Research Society,vol. 52, no. 8, pp. 928–936, 2001.[7] P. Francis, K. Smilowitz, and M. Tzur, “The period vehiclerouting problem with service choice,” Transportation Science,vol. 40, no. 4, pp. 439–454, 2006.[8] E. Hadjiconstantinou and R. Bal<strong>da</strong>cci, “A multi-<strong>de</strong>pot periodvehicle routing problem arising in the utilities sector,” TheJournal of the Operational Research Society, vol. 49, no. 12,pp. 1239–1248, 1998.[9] E. Angelelli and M. G. Speranza, “The periodic vehicle routingproblem with intermediate facilities,” European Journalof Operational Research, vol. 137, no. 2, pp. 233–247, 2002.[10] C. C. R. Tan and J. E. Beasley, “A heuristic algorithm for theperiod vehicle routing problem,” Omega, vol. 12, no. 5, pp.497–504, 1984.[11] M. Mourgaya and F. Van<strong>de</strong>rbeck, “Column generation basedheuristic for tactical planning in multi-period vehicle routing,”European Journal of Operational Research, vol. 183,no. 3, pp. 1028–1041, 2007.[12] B. M. Baker and J. Sheasby, “Extensions to the generalise<strong>da</strong>ssignment heuristic for vehicle routing,” European Journalof Operational Research, vol. 119, no. 1, pp. 147–157, 1999.[13] R. Bal<strong>da</strong>cci, E. Bartolini, A. Mingozzi, and R. Roberti, “Anexact solution framework for a broad class of vehicle routingproblems,” Computational Management Science, vol. 7, pp.229–268, 2010.[14] M. O. Ball, “Heuristics based on mathematical programming,”Surveys in Operations Research and ManagementScience, vol. 16, no. 1, pp. 21–38, <strong>2011</strong>.[15] D. Tung, “Vehicle routing-scheduling for waste collectionin Hanoi,” European Journal of Operational Research, vol.125, no. 3, pp. 449–468, Sep. 2000.[16] E. Angelelli and M. G. Speranza, “The application of a vehiclerouting mo<strong>de</strong>l to a waste-collection problem: two casestudies,” The Journal of the Operational Research Society,vol. 53, no. 9, pp. 944–952, 2002.[17] A. C. Matos and R. C. Oliveira, “An experimental study ofthe ant colony system for the period vehicle routing problem,”Ant Colony, Optimization and Swarm Intelligence, vol.3172, pp. 1–29, 2004.[18] T. R. P. Ramos and R. C. Oliveira, “Delimitation of serviceareas in reverse logistics networks with multiple <strong>de</strong>pots,”Journal of the Operational Research Society, pp. 1–13, Jun.2010.[19] I. Kara, G. Laporte, and T. Bektas, “A note on the liftedMiller-Tucker-Zemlin subtour elimination constraints for thecapacitated vehicle routing problem,” European Journal ofOperational Research, vol. 158, no. 3, pp. 793–795, Nov.2004.ALIO-EURO <strong>2011</strong> – 140


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Branch and Cut and Price for the Time Depen<strong>de</strong>nt Vehicle Routing Problem withTime WindowsSaid Dabia ∗ Stefan Røpke † Tom Van Woensel ∗ Ton De Kok ∗∗ Eindhoven University of Technology, School of Industrial EngineeringEindhoven, The Netherlands{s.<strong>da</strong>bia, t.v.woensel, a.g.d.kok}@tue.nl† Denmark University of Technology, Department of TransportCopenhagen, Denmarksr@transport.dtu.dkABSTRACTIn this paper, we consi<strong>de</strong>r the Time-Depen<strong>de</strong>nt Vehicle RoutingProblem with Time Windows (TDVRPTW). Travel times are time<strong>de</strong>pen<strong>de</strong>nt(e.g. due to road congestion), meaning that <strong>de</strong>pendingon the <strong>de</strong>parture time from a customer a different travel time isincurred. Because of time-<strong>de</strong>pen<strong>de</strong>ncy, vehicles’ dispatch timesfrom the <strong>de</strong>pot are crucial as road congestion might be avoi<strong>de</strong>d.Due to its complexity, all existing solutions to the TDVRPTW arebased on (meta-) heuristics and no exact methods are known forthis problem. In this paper, we propose the first exact method tosolve the TDVRPTW. The MIP formulation is <strong>de</strong>composed into amaster problem that is solved by means of column generation, an<strong>da</strong> pricing problem. To insure integrality, the resulting algorithm isembed<strong>de</strong>d in a Branch and Cut framework. We aim to <strong>de</strong>terminethe set of routes with the least total travel time. Furthermore, foreach vehicle, the best dispatch time from the <strong>de</strong>pot is calculated.Keywords: Vehicle routing problem, Column generation, Time<strong>de</strong>pen<strong>de</strong>nttravel times, Branch and cut1. INTRODUCTIONThe vehicle routing problem with time windows (VRPTW) concernsthe <strong>de</strong>termination of a set of routes starting and ending at a<strong>de</strong>pot, in which the <strong>de</strong>mand of a set of geographically scatteredcustomers is fulfilled. Each route is traversed by a vehicle with afixed and finite capacity, and each customer must be visited exactlyonce. The total <strong>de</strong>mand <strong>de</strong>livered in each route should not exceedthe vehicle’s capacity. At customers time windows are imposed,meaning that service at a customer is only allowed to start withinits time window. The solution to the VRPTW consists of the set ofroutes with the least traveled distance.Due to its practical relevance, the VRPTW has been extensivelystudied in the literature. Consequently, many (meta-) heuristicsand exact methods have been successfully <strong>de</strong>veloped to solve it.However, most of the existing mo<strong>de</strong>ls are time-in<strong>de</strong>pen<strong>de</strong>nt, meaningthat a vehicle is assumed to travel with constant speed throughoutits operating period. Because of road congestion, vehicleshardly travel with constant speed. Obviously, solutions <strong>de</strong>rivedfrom time-in<strong>de</strong>pen<strong>de</strong>nt mo<strong>de</strong>ls to the VRPTW could be infeasiblewhen implemented in real-life. In fact, in real-life road congestionresults in tremendous <strong>de</strong>lays. Consequently, it is unlikely that avehicle respects customers’ time windows. Therefore, it is highlyimportant to consi<strong>de</strong>r time-<strong>de</strong>pen<strong>de</strong>nt travel times when <strong>de</strong>alingwith the VRPTW.In this paper, we consi<strong>de</strong>r the time-<strong>de</strong>pen<strong>de</strong>nt vehicle routing problemwith time windows (TDVRPTW). We take road congestioninto account by assuming time-<strong>de</strong>pen<strong>de</strong>nt travel times: <strong>de</strong>pendingon the <strong>de</strong>parture time at a customer a different travel time is incurred.We divi<strong>de</strong> the planning horizon into time zones (e.g. morning,afternoon, etc.) where a different speed is associated with eachof these zones. The resulting stepwise speed function is translatedinto travel time functions that satisfy the First-In First-Out (FIFO)principle. Because of the time-<strong>de</strong>pen<strong>de</strong>ncy, the vehicles’ dispatchtimes from the <strong>de</strong>pot are crucial. In fact, a later dispatch time fromthe <strong>de</strong>pot might result in a reduced travel time as congestion mightbe avoi<strong>de</strong>d. In this paper, we aim to <strong>de</strong>termine the set of routeswith the least total travel time. Furthermore, for each vehicle, thebest dispatch time from the <strong>de</strong>pot is calculated.Despite numerous publications <strong>de</strong>aling with the vehicle routingproblem, very few addressed the inherent time-<strong>de</strong>pen<strong>de</strong>nt natureof this problem. Additionally, to our knowledge, all existing algorithmsare based on (meta-) heuristics, and no exact approach hasbeen provi<strong>de</strong>d for the TDVRPTW. In this paper, we solve the TD-VRPTW exactly. We use the flow arc formulation of the VRPTWwhich is <strong>de</strong>composed into a master problem (set partitioning problem)and a pricing problem. While the master problem remainsunchanged, compared to that of the VRPTW (as time-<strong>de</strong>pen<strong>de</strong>ncyis implicitly inclu<strong>de</strong>d in the set of feasible solutions) the pricingproblem is translated into a time-<strong>de</strong>pen<strong>de</strong>nt elementary shortestpath problem with resource constraints (TDESPPRC), where timewindows and capacity are the constrained resources. The relaxationof the master problem is solved by means of column generation.To guarantee integrality, the resulting column generation algorithmis embed<strong>de</strong>d in a branch-and-bound framework. Furthermore,in each no<strong>de</strong>, we use cutting planes in the pricing problem toobtain better lower bounds and hence reduce the size of branchingtrees. This results in a branch-and-cut-and-price (BCP) algorithm.Time-<strong>de</strong>pen<strong>de</strong>ncy in travel times increases the complexity of thepricing problem. In fact, the set of feasible solutions increases asthe cost of a generated column (i.e. route) does not <strong>de</strong>pend only onthe visited customers, but also on the vehicles’ dispatch time fromthe <strong>de</strong>pot. The pricing problem in case of the VRPTW is usuallysolved by means of a labeling algorithm. However, the labelingalgorithm <strong>de</strong>signed for the VRPTW is incapable to <strong>de</strong>al with time<strong>de</strong>pen<strong>de</strong>ncyin travel times and needs to be a<strong>da</strong>pted. In this paper,we <strong>de</strong>velop a time-<strong>de</strong>pen<strong>de</strong>nt labeling (TDL) algorithm such thatin each label the arrival time function (i.e. function of the <strong>de</strong>parturetime from the <strong>de</strong>pot) of the corresponding partial path is stored. theTDL generates columns that have negative reduced cost togetherwith their best dispatch time from the <strong>de</strong>pot. To accelerate the BCPalgorithm, two heuristics based on the TDL algorithm are <strong>de</strong>signedto quickly find columns with negative reduced cost. Moreover,new dominance criteria are introduced to discard labels that do notALIO-EURO <strong>2011</strong> – 141


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>lead to routes in the final optimal solution. Furthermore, we relaxthe pricing problem by allowing non-elementary paths. The resultingpricing problem is a time-<strong>de</strong>pen<strong>de</strong>nt shortest path problemwith resource constraints (TDSPPRC). Although the TDSPPRCresults in worse lower bounds, it is easier to solve and integralityis still guaranteed by branch-and-bound. Moreover, TDSPPRCshould work well for instances with tight time windows. Over thelast <strong>de</strong>ca<strong>de</strong>s, BCP proved to be the most successful exact methodfor the VRPTW. Hence, our choice for a BCP framework to solvethe TDVRPTW is well motivated.The main contributions of this paper are summarized as follows.First, we present an exact method for the TDVRPTW. We proposea branch-and-cut-and price algorithm to <strong>de</strong>termine the set of routeswith the least total travel time. Contrary to the VRPTW, the pricingproblem is translated into a TDESPPRC and solved by a time<strong>de</strong>pen<strong>de</strong>ntlabeling algorithm. Second, we capture road congestionby incorporating time-<strong>de</strong>pen<strong>de</strong>nt travel times. Because of time <strong>de</strong>pen<strong>de</strong>ncy,vehicles’ dispatch times from the <strong>de</strong>pot are crucial. Inthis paper, dispatch times from the <strong>de</strong>pot are also optimized. Inthe literature as well as in practice, dispatch time optimization isapproached as a post-processing step, i.e. given the routes, the optimaldispatch times are <strong>de</strong>termined. In this paper, the scheduling(dispatch time optimization) and routing are simultaneously performed.2. LITERATURE REVIEWAn abun<strong>da</strong>nt number of publications is <strong>de</strong>voted to the vehicle routingproblem (see [1], [2], and [3] for good reviews). Specifically,the VRPTW has been extensively studied. For good reviews onthe VRPTW, the rea<strong>de</strong>r is referred to [4], and [5]. The majority ofthese publications assume a time-in<strong>de</strong>pen<strong>de</strong>nt environment wherevehicles travel with a constant speed throughout their operatingperiod. Perceiving that vehicles operate in a stochastic and dynamicenvironment, more researchers moved their effort towardsthe optimization of the time-<strong>de</strong>pen<strong>de</strong>nt vehicle routing problems.Nevertheless, literature on this subject remains scarce.In the context of dynamic vehicle routing, we mention the work of[6], [7] and [8] where a probabilistic analysis of the vehicle routingproblem with stochastic <strong>de</strong>mand and service time is provi<strong>de</strong>d.[9], [10] and [11] tackle the vehicle routing problem where vehicles’travel time <strong>de</strong>pends on the time of the <strong>da</strong>y, and [12] consi<strong>de</strong>rsa time-<strong>de</strong>pen<strong>de</strong>nt traveling salesman problem. Time-<strong>de</strong>pen<strong>de</strong>nttravel times has been mo<strong>de</strong>led by dividing the planning horizoninto a number of zones, where a different speed is associated witheach of these time zones (see [11] and [13]). In [14], traffic congestionis captured using a queuing approach. [9] and [12] mo<strong>de</strong>lstravel time using stepwise function, such that different time zonesare assigned different travel times. [15] emphasized that mo<strong>de</strong>lingtravel times as such leads to the un<strong>de</strong>sired effect of passing.That is, a later start time might lead to an earlier arrival time. Asin [11], we consi<strong>de</strong>r travel time functions that adhere to the FIFOprinciple. Such travel time functions does not allow passing.While several successful (meta-) heuristics and exact algorithmshave been <strong>de</strong>veloped to solve the VRPTW, algorithms <strong>de</strong>signed to<strong>de</strong>al with the TDVRPTW are somewhat limited to (meta-) heuristics.In fact, most of the existing algorithms are based on tabusearch ([11], [14], [13] and [16]). In [9] mixed integer linearformulations the time-<strong>de</strong>pen<strong>de</strong>nt vehicle routing problem are presente<strong>da</strong>nd several heuristics based on nearest neighbor and cuttingplanes are provi<strong>de</strong>d. [17] proposes an algorithm based on a multiant colony system and [18] presents a genetic algorithm. In [19] alocal search algorithm for the TDVRPTW is <strong>de</strong>veloped and a dynamicprogramming is embed<strong>de</strong>d in the local search to <strong>de</strong>terminethe optimal starting for each route. [20] consi<strong>de</strong>rs a multi-criteriarouting problem, they propose an approach based on the <strong>de</strong>compositionof the problem into a sequence of elementary itinerary subproblemsthat are solved by means of dynamic programming. [12]presents a restricted dynamic programming for the time-<strong>de</strong>pen<strong>de</strong>nttraveling salesman problem. In each iteration of the dynamic programming,only a subset with a pre<strong>de</strong>fined size and consisting ofthe best solutions is kept and used to compute solutions in the nextiteration. [21] emphasizes the difficulty of implementing route improvementprocedures in case of time-<strong>de</strong>pen<strong>de</strong>nt travel times andproposes efficient ways to <strong>de</strong>al with that issue. In this paper, weattempt to solve the TDVRPTW to optimality using column generation.To the best of our knowledge, this is the first time an exactmethod for the TDVRPTW is presented.Column generation has been successfully implemented for theVRPTW. For an overview of column generation algorithms, therea<strong>de</strong>r is referred to [22]. in the context of the VRPTW, [23] <strong>de</strong>signe<strong>da</strong>n efficient column generation algorithm where they appliedsubtour elimination constraints and 2-path cuts. This hasbeen improved by [24] by applying k-path cuts. [25] proposes acolumn generation algorithm by applying subset-row inequalitiesto the master problem (set partitioning). Although, adding subsetrowinequalities to the master problem increases the complexity ofthe pricing problem, [25] shows that better lower bounds can beobtained from the linear relaxation of the master problem. To acceleratethe pricing problem solution, [26] proposes a tabu searchheuristic for the ESPPRC. Furthermore, elmentarity is relaxed fora subset of no<strong>de</strong>s and generalized k-inequalities are introduced.Recently, [27] introduce a new route relaxation, called ng-route,used to solve the pricing problem. Their framework proves to bevery effective in solving difficult instances of the VRPTW withwi<strong>de</strong> time windows. [15] argued that existing algorithms for theVRPTW fail to solve the TDVRPTW. One major drawback of theexisting algorithms is the incapability to <strong>de</strong>al with the dynamic natureof travel times. Therefore, existing algorithms for the VRPTWcan not be applied to the TDVRPTW without a radical modificationof their structure. In this paper, a branch-and-cut-and-priceframework is modified such that time-<strong>de</strong>pen<strong>de</strong>nt travel times canbe incorporated.3. PROBLEM DESCRIPTIONWe consi<strong>de</strong>r a graph G(V,A) on which the problem is <strong>de</strong>fined. V ={0,1,...,n,n+1} is the set of all no<strong>de</strong>s such that V c = V /{0,n+1}represents the set of customers that need to be served. Moreover,0 is the start <strong>de</strong>port and n + 1 is the end <strong>de</strong>pot. A = {(i, j) : i ≠j and i, j ∈ V } is the set of all arcs between the no<strong>de</strong>s. Let Kbe the set of homogeneous vehicles such that each vehicle has afinite capacity Q and q i <strong>de</strong>mand of customer i ∈ V c . We assumeq 0 = q n+1 = 0 and |K| is unboun<strong>de</strong>d. Let a i and b i be respectivelythe opening and closing time of no<strong>de</strong>’s i time window. At no<strong>de</strong> i,a service time s i is nee<strong>de</strong>d. We <strong>de</strong>note t i <strong>de</strong>parture time from no<strong>de</strong>i ∈ V and τ i j (t i ) travel time from no<strong>de</strong> i to no<strong>de</strong> j which <strong>de</strong>pendon the <strong>de</strong>parture time at no<strong>de</strong> i.3.1. Travel Time and Arrival Time FunctionsWe divi<strong>de</strong> the planning horizon into time zones where a differentspeed is associated with each of these zones. The resultingstepwise speed function is translated into travel time functions thatsatisfy the First-In First-Out (FIFO) principle. Usually traffic networkshave a morning and an afternoon congestion period. Therefore,we consi<strong>de</strong>r speed profiles that have two periods with relativelylow speeds. In the rest of the planning horizon, speeds arerelatively high. This complies with <strong>da</strong>ta collected for a Belgianhighway ([28]). Given a partial path P i starting at the <strong>de</strong>pot 0 an<strong>de</strong>nding at some no<strong>de</strong> i, the arrival time at i <strong>de</strong>pends on the dispatchALIO-EURO <strong>2011</strong> – 142


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>time t 0 at the <strong>de</strong>pot. Due to the FIFO property of the travel timefunctions, a later dispatch at the <strong>de</strong>pot will result in a later arrivalat no<strong>de</strong> i. Therefore, if route P i is unfeasible for some dispatchtime t 0 at the <strong>de</strong>pot (i.e. time windows are violated), P i will beunfeasible for any dispatch time at the <strong>de</strong>pot that is later than t 0 .Moreover, If we <strong>de</strong>fine δ i (t 0 ) as the arrival time function at no<strong>de</strong> igiven a dispatch time t 0 at the <strong>de</strong>pot, δ i (t 0 ) will be non-<strong>de</strong>creasingin t 0 . We call the parent no<strong>de</strong> j of no<strong>de</strong> i, the no<strong>de</strong> that is visiteddirectly before no<strong>de</strong> i on route P i . δ j (t 0 ) is the arrival time at jgiven a dispatch time t 0 at the <strong>de</strong>pot, and τ ji (δ j (t 0 )) is the incurredtravel time from j to i. Consequently, for every i ∈ V , δ i (t 0 ) isrecursively calculated as follows:δ 0 (t 0 ) = t 0 and δ i (t 0 ) = δ j (t 0 ) + τ ji (δ j (t 0 )) (1)4. COLUMN GENERATIONTo <strong>de</strong>rive the set partitioning formulation for the TDVRPTW, we<strong>de</strong>fine Ω as the set of feasible paths. A feasible path is <strong>de</strong>fined bythe sequence of customers visited along it and the dispatch time atthe <strong>de</strong>pot. To each path p ∈ Ω, we associate the cost c p which issimply its duration. Hence:c p = e p − s p (2)Where e p and s p are respectively the end time and the start time ofpath p. Furthermore, if y p is a binary variable that takes the value 1if and only if the path p is inclu<strong>de</strong>d in the solution, the TDVRPTWis formulated as the following set partitioning problem:minz M = ∑ c p y p (3)p∈Ωsubject to:∑ a ip y p = 1 ∀i ∈ V (4)p∈Ωy p ∈ {0,1} ∀p ∈ Ω.(5)The objective function (3) minimize the duration of the chosenroutes. Constraint (4) guarantees that each no<strong>de</strong> is visited onlyonce. Solving the LP-relaxation, resulting from relaxing the integralityconstraints of the variables y p , of the master problem provi<strong>de</strong>sa lower bound on its optimal value. The set of feasible pathsΩ is usually very large making it hard to solve the LP-relaxationof the master problem. Therefore, we use column generation. Incolumn generation, a restricted master problem is solved by consi<strong>de</strong>ringonly a subset Ω ′ ⊆ Ω of feasible paths. Additional pathswith negative reduced cost are generated after solving a pricingsubproblem. The pricing problem for the TDVRPTW is (the in<strong>de</strong>xk is dropped):minz P = ∑ τ i j (ω i )x i j (6)(i, j)∈AFurthermore, τ i j (ω i ) = τ i j (ω i ) − π i is the arc reduced cost, whereπ i is the dual variable associated with servicing no<strong>de</strong> i. In themaster problem, π i results from the constraint corresponding tono<strong>de</strong> i in the set of constraints (4). The objective function of thepricing problem can be expressed as:or in the variables x i j as:z P = e p − s p − ∑i∈V ca ip π i (7)( )z P = e p − s p − ∑ π i ∑ x i ji∈V c j∈γ + (i)(8)4.1. The Pricing ProblemSolving the pricing problem involves finding columns (i.e. routes)with negative reduced cost that improve the objective function ofmaster problem. In case of the TDVRPTW, this corresponds tosolving the TDESPPRC and finding paths with negative cost. TheTDESPPRC is a generalization of the ESPPRC in which costs aretime-<strong>de</strong>pen<strong>de</strong>nt. In this paper, we solve the pricing problem bymeans of a time-<strong>de</strong>pen<strong>de</strong>nt labeling (TDL) algorithm which is amodification of the labeling algorithm applied to the ESPPRC. Tospeed up the TDL algorithm , a bi-directional search is performedin which labels are exten<strong>de</strong>d both forward from the <strong>de</strong>pot (i.e. no<strong>de</strong>0) to its successors, and backward from the <strong>de</strong>pot (i.e. no<strong>de</strong> n+1) toits pre<strong>de</strong>cessors. While forward labels are exten<strong>de</strong>d to some fixedtime t m (e.g. the middle of the planning horizon) but not further,backward labels are exten<strong>de</strong>d to t m but are allowed to cross t m . Forwar<strong>da</strong>nd backward labels are finally merged to construct completetours. The running time of a labeling algorithm <strong>de</strong>pends on thelength of partial paths associated with its labels. A bi-directionalsearch avoids generating long paths and therefore limits runningtimes.5. COMPUTATIONAL RESULTSThe open source framework COIN is used to solve the linear programmingrelaxation of the master problem. For our numericalstudy, we use the well known Solomon’s <strong>da</strong>ta sets ([29]) that followa naming convention of DT m.n. D is the geographic distributionof the customers which can be R (Random), C (Clustered)or RC (Randomly Clustered). T is the instance type which can beeither 1 (instances with tight time windows) or 2 (instances withwi<strong>de</strong> time windows). m <strong>de</strong>notes the number of the instance andn the number of customers that need to be served. Road congestionis taken into account by assuming that vehicles travel throughthe network using different speed profiles. We consi<strong>de</strong>r speed profileswith two congested periods. Speeds in the rest of the planninghorizon (i.e. the <strong>de</strong>pot’s time window) are relatively high. We consi<strong>de</strong>rspeed profiles that comply with <strong>da</strong>ta from real life. Furthermore,we assume three types of links: fast, normal and slow. Slowlinks might represent links within the city center, fast links mightrepresent highways and normal links might represent the transitionfrom highways to city centers. Moreover, without loss of generality,we assume that breakpoints are the same for all speed profilesas congestion tends to happen around the same time regardless ofthe link’s type (e.g. rush hours).The choice for a link type is donerandomly and remains the same for all instances. Our BCP frameworkis able to solve 75% of the instances with 25 customers, 50%of the instances with 50 customers, and 20% of the instances with100 customers.6. REFERENCES[1] G. Laporte, “The vehicle routing problem: an overview ofexact and approximate algorithms,” European Journal of OperationalResearch, vol. 59, no. 3, pp. 345–358, 1992.[2] P. Toth and D. Vigo, The vehicle Routing Problem. SIAMMonographs on Discrete Mathematics and Applications.SIAM, Phila<strong>de</strong>lphia, 2002, vol. 9.[3] G. Laporte, “What you should know about the vehicle routingproblem,” Naval Research Logistics, vol. 54, pp. 811–819, 2007.[4] O. Bräysy and M. Gendreau, “Vehicle routing problem withtime windows, part i: Route construction and local searchalgorithms,” Transportation Science, vol. 39, no. 1, pp. 104–118, 2005.ALIO-EURO <strong>2011</strong> – 143


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[5] ——, “Vehicle routing problem with time windows, part ii:Metaheuristics,” Transportation Science, vol. 39, no. 1, pp.119–139, 2005.[6] D. J. Bertsimas and D. Simchi-Levi, “A new generation ofvehicle routing research: robust algorithms, addressing uncertainty,”Operations Research, vol. 44, no. 2, pp. 286–304,1996.[7] D. J. Bertsimas and G. V. Ryzin, “A stochastic and dynamicvehicle routing problem in the euclidian plane,” OperationsResearch, vol. 39, pp. 601–615, 1991.[8] ——, “Stochastic and dynamic vehicle routing problems inthe eucli<strong>de</strong>an plane with multiple capcitated vehicles,” OperationsResearch, vol. 41, pp. 60–76, 1993a.[9] C. Malandraki and R. B. Dial, “A restricted dynamic programmingheuristic algorithm for the time <strong>de</strong>pen<strong>de</strong>nt travelingsalesman problem,” European Journal of OperationalResearch, vol. 90, pp. 45–55, 1996.[10] A. Hill and W. Benton, “Mo<strong>de</strong>ling intra-city time-<strong>de</strong>pen<strong>de</strong>nttravel speeds for vehicle scheduling problems,” EuropeanJournal of Operational Research, vol. 43, no. 4, pp. 343–351, 1992.[11] S. Ichoua, M. Gendreau, and J. Y. Potvin, “Vehicle dispatchingwith time-<strong>de</strong>pen<strong>de</strong>nt travel times,” European Journal ofOperational Research, vol. 144, no. 2, pp. 379–396, 2003.[12] C. Malandraki and M. Daskin, “Time <strong>de</strong>pen<strong>de</strong>nt vehicle routingproblems: formulations, properties and heuristic algorithms,”Transportation Science, vol. 26, no. 3, pp. 185–200,1992.[13] O. Jabali, T. van Woensel, A. <strong>de</strong> Kok, C. Lecluyse, andH. Permans, “Time-<strong>de</strong>pen<strong>de</strong>nt vehicle routing subject to time<strong>de</strong>lay perturbations,” IIE Transaction, vol. 41, pp. 1049–1066, 2009.[14] T. Van Woensel, L. Kerbache, H. Peremans, and N. Van<strong>da</strong>ele,“Vehicle routing with dynamic travel times: a queueingapproach,” European Journal of Operational Research,vol. 186, no. 3, pp. 990–1007, 2008.[15] B. Fleischmann, M. Gietz, and S. Gnutzmann, “Timevaryingtravel times in vehicle routing,” Transportation Science,vol. 38, no. 2, pp. 160–173, 2004.[16] W. Ma<strong>de</strong>n, R. Eglese, and D. Black, “Vehicle routing andscheduling with time-varying <strong>da</strong>ta: A case study,” Journal ofthe Operational Research Society, vol. 61, no. 61, pp. 515–522, 2010.[17] A. F. Donati, R. Montemanni, N. casagran<strong>de</strong>, A. E. Rizzoli,and L. M. Gambar<strong>de</strong>lla, “Time <strong>de</strong>pen<strong>de</strong>nt vehicle routingproblem with a multi colony system,” Eurorpean Journal ofOperational Research, vol. 185, pp. 1174–1191, 2008.[18] A. Haghani and S. Jung, “A dynamic vehicle routing problemwith time-<strong>de</strong>pen<strong>de</strong>nt travel times,” Computers and OperationsResearch, vol. 32, pp. 2959–2986, 2005.[19] H. Hashimoto, M. Yagiura, and T. Ibaraki, “An iterated localsearch algorithm for the time-<strong>de</strong>pen<strong>de</strong>nt vehicle routingproblem with time windows,” Discrete Optimization, vol. 5,pp. 434–456, 2008.[20] K. N. Androutsopoulos and K. G. Zografos, “Solving themulti-criteria time-<strong>de</strong>pen<strong>de</strong>nt routing and scheduling in amultimo<strong>da</strong>l fixed scheduled network,” European Journal ofOperational Research, vol. 192, pp. 18–28, 2009.[21] H. Tang, “Efficcient implementation of improvement proceduresfor vehicle routing with time-<strong>de</strong>pen<strong>de</strong>nt travel times,”Transportation Research Record, pp. 66–75, 2008.[22] M. E. Lübbecke and J. Desrosiers, “Selected topics in columngeneration,” Operations Research, vol. 53, no. 6, pp.1007–1023, 2005.[23] N. Kohl, J. Desrosiers, O. B. G. Madsen, M. M. Solomon,and F. Soumis, “2-path cuts for the vehicle routing problemwith time windows,” Transportation Science, vol. 33, no. 1,pp. 101–116, 1999.[24] W. Cook and J. L. Rich, “A parallel cutting plane algorithmfor the vehicle routing problem with time windows,” TechnicalReport TR99-04, Computational and Applied Mathematics,Rice University, Housten, USA, 1999.[25] M. Jespen, B. Petersen, S. Spoorendonk, and D. Pisinger,“Subset-row inequalities applied to the vehicle-routing problemwith time windows,” Operations Research, vol. 56, no. 2,pp. 497–511, 2008.[26] G. Desaulniers, F. Lessard, and A. Hadjar, “Tabu search, partialelementarity, and generalized k-path inequalities for thevehicle routing problem with time windows,” TransportationScience, vol. 42, no. 3, pp. 387–404, 2008.[27] R. Bal<strong>da</strong>cci, A. Mingozzi, and R. Roberti, “New route relaxationand pricing strategies for the vehicle routing problem,”Working paper, the university of Bologna, 2010.[28] T. Van Woensel and N. Van<strong>da</strong>ele, “Empirical vali<strong>da</strong>tion ofa queueing approach to uninterrupted traffic flows,” 4OR, AQuarterly Journal of Operations Research, vol. 4, no. 1, pp.59–72, 2006.[29] M. M. Solomon, “Algorithms for the vehicle routing andscheduling problems with time window constraints,” OperationsResearch, vol. 35, no. 2, pp. 254–265, 1987.ALIO-EURO <strong>2011</strong> – 144


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>An algorithm based on Iterated Local Search and Set Partitioning for the VehicleRouting Problem with Time WindowsS. Ribas ∗ A. Subramanian ∗ I. M. Coelho ∗ L. S. Ochi ∗ M. J. F. Souza †∗ Universi<strong>da</strong><strong>de</strong> Fe<strong>de</strong>ral FluminenseRua Passo <strong>da</strong> Pátria, 156 - Bloco E - Niterói, Brazil{sribas, anand, imcoelho, satoru}@ic.uff.br† Universi<strong>da</strong><strong>de</strong> Fe<strong>de</strong>ral <strong>de</strong> Ouro PretoCampus Universitário, Morro do Cruzeiro, Ouro Preto, Brazilmarcone@iceb.ufop.brABSTRACTThe Vehicle Routing Problem with Time Windows is a well knownoptimization problem and it has received a lot of attention in operationalresearch literature. This work proposes a hybrid algorithmthat combines the Iterated Local Search metaheuristic, the VariableNeighborhood Descent method and an exact Set Partitioningmo<strong>de</strong>l for solving it. The computational results <strong>de</strong>monstrate thatthe proposed hybrid approach is quite competitive, since out ofthe 56 test problems consi<strong>de</strong>red, the algorithm improved the bestknown solution in 12 cases and equaled the result of another 27.Keywords: Vehicle Routing Problem with Time Windows, HybridAlgorithm, Iterated Local Search, Set Partitioning1. INTRODUCTIONThe Vehicle Routing Problem with Time Windows (VRPTW) is awell known optimization problem and it has received a lot of attentionin operational research literature. In this problem, a fleet ofvehicles must leave the <strong>de</strong>pot, serve customer <strong>de</strong>mands, and returnto the <strong>de</strong>pot, at minimum cost, without violating the capacity of thevehicles as well as the time window specified by each customer.There are two main reasons (operational and theoretical) for investingin research to <strong>de</strong>velop new algorithms for the efficient resolutionof this problem. From the practical/operational point ofview, the costs related to transporting people or merchandise aregenerally high, with a ten<strong>de</strong>ncy to increase, motivated by the actualexpansion of commerce of all types [1]. Researchers calculate that10% to 15% of the final cost of the merchandise commercializedin the world is due to its transport [2]. From the theoretical aspect,since the VRP and most of its variants, including the VRPTW, areNP-hard problems [3], the efficient resolution of these problemsrepresents a challenge for researchers, who, in general, opt forheuristic approaches. The size of this challenge is <strong>de</strong>monstratedby the great number of articles <strong>de</strong>aling with this type of problem.The VRPTW has been <strong>de</strong>alt with various objectives and, in thepresent work, the aim is to minimize the total traveling distancewhich is one of the most commonly found in literature.Given the complexity of the problem, its resolution using pureexact methods is often an extremely arduous task due the largeamount of computational time required. This fact has motivatedthe <strong>de</strong>velopment of new heuristic algorithms for solving VRPTW.It is noteworthy to mention that such algorithms aims at findingnear-optimal solutions using less computational effort.The algorithm proposed in this article for solving VRPTW combinesthe concepts of Iterated Local Search metaheuristic, the VariableNeighborhood Descent method and an exact Set Partitioningmo<strong>de</strong>l, which periodically <strong>de</strong>termines the best combination ofroutes generated during the execution of the algorithm.2. PROPOSED METHODOLOGYThis section explains the proposed hybrid algorithm. Section 2.1presents the <strong>da</strong>ta structure used to represent a VRPTW solution,while Section 2.2 <strong>de</strong>scribes the penalty-based function that evaluatesa solution for the problem. Next, Section 2.3 <strong>de</strong>monstratesthe procedure used to construct the initial solution; and Section2.4 <strong>de</strong>scribes the used neighborhood structures. Finally, Section2.5 presents the proposed algorithm.2.1. Solution representationA route r is <strong>de</strong>fined by a sequence of integer numbers that correspondsto the i<strong>de</strong>ntifiers of the customers in r. A solution s containsa set of routes.2.2. Evaluation functionA solution s is evaluated by the function f , given by the equation(1), which must be minimized:f (s) = ∑r∈sg(r) = ∑r∈s(c(r) + w l .l(r) + w e .e(r)) (1)where: g is a function that evaluates routes; c(r) is the cost of theroute r; l(r) corresponds to the lateness time for r; e(r) is the loa<strong>de</strong>xcess in the route r; w l and w e are penalties per unit of <strong>de</strong>lay an<strong>de</strong>xcess load, respectively. They were empirically set in w l = 200and w e = 300.Notice that when s is feasible, the value given by f will only correspondto the travel cost, since in this case: l(r) = e(r) = 0, ∀r ∈ s.2.3. Constructive procedureTo obtain an initial solution for the VRPTW, a cheapest insertionmethod, called CI-POP(), that explores the Proximate OptimalityPrinciple [4] was <strong>de</strong>veloped. According to this principle, inan optimal sequence of choices, each sub-sequence should also beoptimal. It is worth mentioning that although this principle <strong>de</strong>alsALIO-EURO <strong>2011</strong> – 145


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>with optimal cases, in the <strong>de</strong>veloped algorithm there is no guaranteethat the optimal solution will be obtained, or even parts of theoptimal solution. Thus, this principle is only employed to generatebetter initial solutions.Let |K| be the maximum number of available vehicles. Initially,the constructive algorithm creates |K| empty routes and a list ofcandi<strong>da</strong>tes to be inserted in the set of routes. The i<strong>de</strong>a of the procedureis to iteratively insert each candi<strong>da</strong>te customer in the bestlocation. A local search is periodically performed in the partialsolution. More specifically, the parameters of the method werecalibrated in such a way that five local searches occur during theconstruction; for example, if there is a total of 100 customers, thelocal search is performed for every twenty customers ad<strong>de</strong>d to thepartial solution. In this case, the local search is performed usingthe RVND (see Section 2.5.2). The procedure terminates when allcustomers have been ad<strong>de</strong>d.2.4. Neighborhood structuresIn or<strong>de</strong>r to explore the solution space, 10 neighborhood structuresare used, where six of these modify two routes at each movementperformed (inter-route), while the other four modify only a singleroute (intra-route). The inter-route neighborhood structuresare generated by the following movements: Shift(1,0), Shift(2,0),Shift(3,0), Swap(1,1), Swap(2,1) and Swap(2,2). A movement ofthe neighborhood structure Shift(k,0) involves transferring k adjacentcustomers from route r 1 to another route r 2 ; and a movementof the type, Swap(k,l), interchanges k adjacent customers fromroute r 1 to l other adjacent customers from another route r 2 .As for those neighborhood structures that only modify one routeat a time, the following movements are used: Exchange, Shift’(1),Shift’(2) and Shift’(3). The Exchange neighborhood involves thepermutation between two customers of the same route and it canbe seen as an intra-route version of the Swap(1,1) neighborhood.The other three neighborhoods can be consi<strong>de</strong>red as intra-routeversions of the Shift(1,0), Shift(2,0) e Shift(3,0) neighborhoods,respectively.2.5. Proposed algorithmThe proposed algorithm, called Intensified Iterated Local Search(IILS-SP), involves the construction of an initial solution accordingto the procedure presented in Section 2.3, followed by a localsearch that combines a<strong>da</strong>pted versions of the Iterated Local Search(ILS) and Variable Neighborhood Descent (VND) methods withan exact approach based on the mathematical formulation of theSet Partitioning (SP). The pseudo-co<strong>de</strong> of IILS-SP is presented inAlgorithm 1. Let s 0 be an initial solution; s ∗ the best solutionobtained during the procedure execution; s ′ a perturbed solution;and, s ′′ a local optimal solution obtained by the application of theRVND to the perturbed solution.The following sections <strong>de</strong>tail each part of this algorithm.2.5.1. Intensified Iterated Local SearchIntensified Iterated Local Search is an extension of the IteratedLocal Search – ILS [5] metaheuristic. ILS explores the solutionspace by applying perturbations to the current local optimal solution.This metaheuristic starts with the initial solution s 0 and appliesa local search to it, obtaining s ∗ . Next, the method iterativelyperforms the following steps: (i) perturbs the current best solutions ∗ ; (ii) obtains a solution s ′ ; and (iii) performs a local search in s ′ ,obtaining a local optimal s ′′ . If s ′′ is better than the current bestsolution s ∗ , then the method transforms s ′′ into the new currentAlgorithm 1: IILS-SP()1 s 0 ← CI-POP()2 s ∗ ← RVND(s 0 )3 repeat4 s ′ ← Perturbation(s ∗ , history)5 s ′′ ← RVND(s ′ )6 if AppropriatedMoment(history) then7 s ′′ ← Intensification (s ′′ )8 end910 s ∗ ← AcceptanceCriterion(s ′′ , s ∗ , history)11 until stopping criterion not met12 return s ∗solution. Otherwise, the method performs another iteration. Thisprocedure is repeated until the stopping criterion is met.It is important to emphasize that ILS’s success strongly <strong>de</strong>pendson the perturbations performed. This way, the perturbation appliedto a given solution should be proportioned in such a way thatthe resulting modification is sufficient to escape from local optimaand to explore different regions of the search space, but keepingsome characteristics of the current best solution, in or<strong>de</strong>r to avoi<strong>da</strong> complete random restart in next iterations.In this work, a perturbation (line 4 of Algorithm 1) consists ofapplying p + 2 movements randomly chosen in the neighborhoodShift, presented in Section 2.4, where p ∈ {0,1,2,...} representsthe perturbation level. This way, the greater this value, the greaterthe number of modifications performed in the solution. Herein,ILSmax iterations without improvement are applied in the sameperturbation level. When this value is achieved, the perturbationlevel is increased.In this case, the local search of the IILS (lines 2 and 5 of Algorithm1) is performed using the Variable Neighborhood Descentwith random neighborhood or<strong>de</strong>ring, <strong>de</strong>noted by RVND and <strong>de</strong>scribedin Section 2.5.2.Finally, the proposed algorithm contains an intensification module(line 7 of Algorithm 1). This module is activated at appropriatemoments of the search and invokes a mathematical programmingprocedure, based on Set Partitioning, to find the optimal setof routes among those generated during the search. More specifically,the partitioning mo<strong>de</strong>l is applied to the set formed by all theroutes belonging to the solutions generated after the local searchphase of the IILS algorithm. That is, for each IILS iteration, theroutes of the solution s ′′ (line 5 of Algorithm 1) are ad<strong>de</strong>d to theset to be partitioned. This is done in such a way that there are norepeated routes in the set, which has an unlimited size.A <strong>de</strong>scription of this module is given in Section 2.5.3.2.5.2. Variable Neighborhood Descent with random neighborhoodor<strong>de</strong>ringThe procedure Variable Neighborhood Descent (VND) [6] involvesan exhaustive exploration of the solution space by means of systematicexchanges of the neighborhood structures. During the localsearch, only the solution that is better than the current bestsolution is accepted. When a better solution is found, the methodrestarts the search, beginning with the first neighborhood structure.The method VND is based on three principles: (i) a local optimumfor a given neighborhood structure does not necessarily correspondto a local optimum of another neighborhood structure; (ii)a global optimum corresponds to a local optimum for all neighborhoodstructures; and (iii) for many problems, the local optimum ofALIO-EURO <strong>2011</strong> – 146


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>a given neighborhood structure is close to the local optima of otherneighborhood structures.The latter principle, of empirical nature, indicates that a local optimumfrequently gives some type of information about the globaloptimal. This is the case in which local and global optimum sharea lot of variables with the same value.The classical version of VND searches local optimal solutions followinga fixed or<strong>de</strong>r of neighborhood structures. This strategy iswi<strong>de</strong>ly applied and the results in literature confirm its efficiency.However, for the results presented in this work, a random or<strong>de</strong>rwas used to explore the neighborhoods. This strategy is adoptedwith success in [7]. Here, this strategy is so-called RVND.2.5.3. Set partitioning mo<strong>de</strong>lThe intensification phase of the proposed algorithm involves theexact resolution of a Set Partitioning Problem (SPP). Let R bethe subset of routes generated by the IILS-algorithm and let y j ,∀ j ∈ R, be the binary variables that indicate if the route j ∈ R ispart of the solution (y j = 1); or not (y j = 0). Each route j ∈ Rhas an associated cost g j . The parameter m i j is equal to 1 if thecustomer i ∈ N is atten<strong>de</strong>d by the route j ∈ R; and 0, otherwise.The mathametical formulation is as follows.Minimize ∑ g j y j (2)j∈R∑ m i j y j = 1,∀i ∈ N (3)j∈R∑ y j ≤ |K| (4)j∈Ry j ∈ {0,1},∀ j ∈ R (5)The objective of this formulation is to find a set of routes that attendthe constraints of the problem with a minimum cost (2). Constraints(3) guarantee that each customer is visited by exactly oneroute. Constraints (4) ensure that a solution contains up to |K|routes. Constraints (5) <strong>de</strong>fine the domain of the variables.In this work, the SPP mo<strong>de</strong>l was implemented using ILOG APIConcert for C++ and solved by the CPLEX optimizer, version 12.3. COMPUTATIONAL RESULTSThe proposed algorithm (IILS-SP) was <strong>de</strong>veloped in C++ programminglanguage and tested in a computer with an Intel QuadCore 2.4 GHz microprocessor with 8 GB RAM and operationalsystem Ubuntu Linux 9.10 Kernel 2.6.31.IILS-SP was applied to solve the set of instances proposed bySolomon [8], which is well known in the literature.For each of the 56 instances, five runs were performed using a 10-minute processing time limit for each run as stopping criterion 1 .The algorithm was empirically calibrated and the parameters werefixed as follows: (i) in the construction of an initial solution, ascustomers are being inserted, five local searches were performe<strong>da</strong>s <strong>de</strong>scribed in Section 2.3; (ii) the number of no-improvement iterationsat a given level of perturbation of IILS was fixed as 20;(iii) the procedure is iteratively performed according to the Multi-Start [9] method, where at each iteration, an initial solution isconstructed by a non-<strong>de</strong>terministic method <strong>de</strong>scribed in the Section2.3 and a local search is performed by IILS-SP; and (iv) the1 The computational results of this research are availableat http://www.<strong>de</strong>com.ufop.br/sabir/shared/<strong>2011</strong>alio-vrptw-results.zipTable 1: Comparisons between different works that optimize thetotal distance traveledClass Work ∗ This workRT95 CA99 SC00 AL07 OV08C1 NV 10.00 10.00 10.00 10.00 10.00 10.00TD 828.38 828.38 828.38 828.38 828.38 828.38C2 NV 3.00 3.00 3.00 3.00 3.00 3.00TD 589.86 596.63 589.86 589.86 589.86 589.86R1 NV 12.16 12.42 12.08 13.25 13.33 13.17TD 1208.50 1233.34 1211.53 1183.38 1186.94 1181.03R2 NV 2.91 3.09 2.82 5.55 5.36 5.36TD 961.71 990.99 949.27 899.90 878.79 883.10RC1 NV 11.87 12.00 11.88 12.88 13.25 12.75TD 1377.39 1403.74 1361.76 1341.67 1362.44 1338.54RC2 NV 3.37 3.38 3.38 6.50 6.13 6.13TD 1119.59 1220.99 1097.63 1015.90 1004.59 1009.17All CNV 414 420 412 489 488 482CTD 57231 58927 56830 55134 55021 54842∗ RT95 [10], CA99 [11], SC00 [12], AL07 [1] and OV08 [13]maximum processing time for each execution of the mathematicalsolver in the intensification phase was limited to 5 seconds.In summary, the best solutions found during the executions by theIILS-SP were: 100% (9/9) tied values for C1; 100% (8/8) tiedvalues for C2; 33.3% (4/12) improved and 41.6% (5/12) tied valuesfor R1; 27.3% (3/11) improved and 9.1% (1/11) tied values for R2;37.5% (3/8) improved and 37.5% (3/8) tied values RC1; and 25%(2/8) improved and 12.5% (1/8) tied values for RC2. Overall, thevalues improved in 21.4% (12/56) of the cases, the values tied in48.2% (27/56) and the values <strong>de</strong>creased in 30.4% (17/56).The algorithm proved to be robust, since it presented relativelysmall gaps. In 80.4% (45/56) of the analyzed instances, gap wasless that 1.0%. When this value was improved, the gap was alwayssmaller than 4.16% (as in the R208). These results show that thealgorithm produces final solutions with quite little variability interms of solution quality. In addition, in some cases (R110, R202 eRC105) the proposed algorithm produced better results in averagethan those found in literature.Table 1 presents the results of different researches that had as a primaryobjective the minimization of the total distance traveled. Thecolumns represent the algorithm whereas the lines show the averagenumber of vehicles and the total distance traveled of the bestsolutions obtained for each class. For each algorithm, the averageresults with respect to Solomon’s benchmarks are reported withrespect to number of vehicles (“NV”) and total distance (“TD”).CNV and CTD indicate, respectively, the cumulative number ofvehicles and cumulative total distance over all the 56 instances.When observing the results of each group separately, the conclusionis that the algorithm values tied with those of the best resultsfound in literature in the cluster groups of C1 and C2, and outperformedthem in the groups of R1 and RC1. In the R2 andRC2 groups, although the results were close, they were not ableto improve the values of the other groups. Therefore, when consi<strong>de</strong>ringthe overall scenario, IILS-SP outperformed all the othersalgorithms in terms of solution quality.To verify the influence of the intensification phase of IILS-SP overits version without this strategy, named ILS, computational experimentswere carried out according to Aiex et al. [14]. In eachexperiment, we measure the CPU time to find or improve the targetvalue. For each instance/target pair, the n running times aresorted in increasing or<strong>de</strong>r. We associate with the i-th sorted runningtime t(i) a probability p(i) = (i − 1/2)/n, and plot the pointsz(i) = [t(i), p(i)], for i = 1,...,n. Figure 1 illustrates this cumulativeprobability distribution plot for IILS-SP and ILS algorithms,using the R208 instance and having as target a value 5% far fromthe best known value.ALIO-EURO <strong>2011</strong> – 147


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 1: Cumulative probability distributionThis Figure clearly shows that IILS-SP is able to find a cost functionvalue at least as good as the given target value faster than theILS algorithm.4. CONCLUSIONSThis paper presents a hybrid algorithm for the Vehicle RoutingProblem with Time Windows. The proposed algorithm, so-calledIILS-SP, combines the Iterated Local Search metaheuristic, theVariable Neighborhood Descent method and an exact Set Partitioningmo<strong>de</strong>l that, periodically, performs the best combination ofthe routes generated along the algorithm. Hence, the IILS-SP combinesthe flexibility of heuristic methods and the power of mathematicalprogramming.The IILS-SP was tested in 56 well-known VRPTW instances andthe results were compared with the best solutions found in literature.The computational results show that the proposed hybri<strong>da</strong>pproach is quite competitive, since out of the 56 test problemsconsi<strong>de</strong>red, the algorithm improved the best known solution in 12cases and equaled the result of another 27.5. ACKNOWLEDGEMENTSThe authors acknowledge CAPES, CNPq and FAPEMIG for supportingthe <strong>de</strong>velopment of this research.6. REFERENCES[1] G. B. Alvarenga, G. R. Mateus, and G. <strong>de</strong> Tomi, “A geneticand set partitioning two-phase approach for the vehicle routingproblem with time windows,” Computers and OperationsResearch, vol. 34, pp. 1561–1584, 2007.[2] G. F. King and C. F. Mast, “Excess travel: causes, extent andconsequences,” Transportation Research Record, no. 1111,pp. 126–134, 1997.[3] J. K. Lenstra and A. H. G. R. Kan, “Complexity of vehiclerouting and scheduling problems,” Networks, vol. 11, no. 2,pp. 221–227, 2006.[4] M. G. C. Resen<strong>de</strong> and C. C. Ribeiro, “Grasp,” in SearchMethodologies, 2nd ed., E. K. Burke and G. Ken<strong>da</strong>ll, Eds.Springer (to appear), 2010, available at: http://www.ic.uff.br/$\sim$celso/artigos/grasp.pdf.[5] H. R. Lourenco, O. C. Martin, and T. Stutzle, “Iterated localsearch,” in Handbook of Metaheuristics, F. Glover andG. Kochenberger, Eds. Boston: Kluwer Aca<strong>de</strong>mic Publishers,2003, ch. 11.[6] N. Mla<strong>de</strong>novic and P. Hansen, “A variable neighborhoodsearch,” Computers and Operations Research, vol. 24, pp.1097–1100, 1997.[7] A. Subramanian, L. Drummond, C. Bentes, L. Ochi, andR. Farias, “A parallel heuristic for the vehicle routing problemwith simultaneous pickup and <strong>de</strong>livery,” Computers andOperations Research, vol. 37, pp. 1899–1911, 2010.[8] M. M. Solomon, “Algorithms for the vehicle routing andscheduling problem with time window contraints,” OperationalResearch, vol. 35, pp. 254–265, 1987.[9] R. Martí, “Multi-start methods,” in Handbook of Metaheuristics,F. Glover and G. Kochenberger, Eds. Boston: KluwerAca<strong>de</strong>mic Publishers, 2003, ch. 12.[10] Y. Rochat and E. Taillard, “Probabilistic diversification andintensification in local search for vehicle routing,” Journal ofHeuristics, vol. 1, pp. 147–167, 1995.[11] Y. Caseau and F. Laburthe, “Heuristics for large constrainedvehicle routing problems,” Journal of Heuristics, vol. 5, pp.281–303, 1999.[12] G. Schrimpf, J. Schnei<strong>de</strong>r, H. Stamm-Wilbrandt, andG. Dueck, “Record breaking optimization results usingthe ruin and recreate principle,” Journal of ComputationalPhysics, vol. 159, pp. 139–171, 2000.[13] H. <strong>de</strong> OLIVEIRA and G. Vasconcelos, “A hybrid searchmethod for the vehicle routing problem with time windows,”Annals of Operations Research, 2008. [Online]. Available:http://dx.doi.org/10.1007/s10479-008-0487-y[14] R. M. Aiex, M. G. C. Resen<strong>de</strong>, and C. C. Ribeiro, “Probabilitydistribution of solution time in grasp: An experimentalinvestigation,” Journal of Heuristics, vol. 8, pp. 200–2, 2000.ALIO-EURO <strong>2011</strong> – 148


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A medium term short sea fuel oil distribution problemAgostinho Agra ∗ Marielle Christiansen † Alexandrino Delgado ‡∗ Department of Mathematics and CIDMAUniversity of Aveiroaagra@ua.pt† Department of Industrial Economics and Technology ManagementNorwegian University of Science and Technologymarielle.christiansen@iot.ntnu.no‡ Department of MathematicsUniversity of Cape Ver<strong>de</strong>alexandrino.<strong>de</strong>lgado@unicv.edu.cvABSTRACTWe consi<strong>de</strong>r a real inventory routing problem occurring in thearchipelago of Cape Ver<strong>de</strong>, where an oil company is responsiblefor the inventory management of multiple fuel oil products and forthe routing of ships between the islands. Inventory managementconsi<strong>de</strong>rations are taken into account at the <strong>de</strong>mand si<strong>de</strong>, but notat the supply si<strong>de</strong>. Demands are assumed to be constant over thetime horizon of several months. The objective of the company is toestablish a medium term plan that satisfies <strong>de</strong>mand requirementsand minimizes the transportation costs. We present a formulationfor the problem based on the one given by Christiansen (1999).Since this formulation provi<strong>de</strong>s large integrality gaps we discussdifferent exten<strong>de</strong>d formulations and compare them for a time horizonof fifteen <strong>da</strong>ys. In or<strong>de</strong>r to obtain feasible solutions for timehorizons of several months, we construct a rolling horizon heuristicthat uses the exten<strong>de</strong>d formulation that provi<strong>de</strong>d best computationalresults.Keywords: Maritime transportation, Inventory, Routing, Exten<strong>de</strong>dFormulations1. INTRODUCTIONWe present a real maritime inventory routing problem occurring inthe archipelago of Cape Ver<strong>de</strong>. An oil company is responsible forthe inventory management of different oil products in several tankslocated in the main islands. The inventory management must becoordinated with the routing of ships between the islands in or<strong>de</strong>rto prevent shortfalls. We consi<strong>de</strong>r a time horizon of six months.This problem can be classified within maritime transportation as ashort-sea medium-term inventory-routing problem.Maritime transportation has received a clear increased interest inthe last <strong>de</strong>ca<strong>de</strong>. Christiansen et al. [1] present a review on maritimetransportation, and Christiansen and Fagerholt [2] is <strong>de</strong>votedto maritime inventory routing problems. Combined routing andinventory management within maritime transportation have beenpresent in the literature the last <strong>de</strong>ca<strong>de</strong> only. See [3, 4, 5, 6, 7, 8,9, 10, 11, 12, 13, 14, 15, 16]. Most of these maritime inventoryrouting articles are based on real problems.In Cape Ver<strong>de</strong>, fuel oil products are imported and <strong>de</strong>livered to specificislands and stored in large supply storage tanks. From theseislands, fuel oil products are distributed among all the inhabitedislands using a small heterogeneous fleet of ships. These productsare stored in consumption storage tanks. Some ports have bothsupply tanks for some products and consumption tanks of otherproducts. Not all the islands consume all the products. In themedium term planning consi<strong>de</strong>red here capacities in supply tanksare ignored. However, for the consumption ports the capacity ofthe tanks for a particular product is usually less than the total <strong>de</strong>mandover the planning horizon for that product making the inventorymanagement an important issue.Unlike the short term planning case, in the medium term planningthe <strong>da</strong>ta is typically forecasted. Hence, safety stocks must beconsi<strong>de</strong>red. We assume the <strong>de</strong>mands are constant over the timehorizon. Several important issues taken into account in a shortterm plan are relaxed or incorporated indirectly in the <strong>da</strong>ta. For instance,berth capacities and operating time windows at ports, thatare essential in the short term plan, are ignored here. To transportthe fuel oil products between the islands, the planners control asmall, heterogeneous fleet consisting of two ships. Each ship has aspecified load capacity, fixed speed and cost structure. The travelingtimes are an estimation based on the practical experience andinclu<strong>de</strong> the travel time, set up times for the coupling and <strong>de</strong>couplingof pipes, operation times and an additional time to accountfor <strong>de</strong>lays and waiting times.The cargo hold of each ship is separated into several cargo tanks.The products cannot be mixed and cleaning operations are requiredwhen changing from dirty oil product to clean oil products. Againthis issue is more relevant on a short term plan then in a mediumterm where the quantities transported and the traveling times areestimations. Hence we ignore this issue.Given the initial stock levels at the <strong>de</strong>mand tanks, the initial position(which can be a point at sea) and the quantities onboar<strong>de</strong>ach ship, the inter island distribution plan consists of <strong>de</strong>signingroutes and schedules for the fleet of ships including <strong>de</strong>terminingthe (un)loading quantity of each product at each port. This planmust satisfy the safety stocks of each product at each island, thecapacities of the ships and tanks. The transportation cost of thedistribution plan is to be minimized.Following Christiansen [4], we present an initial arc-load flow formulationfor the problem. Since this formulation leads to largeintegrality gaps we discuss how to obtain tighter formulations. Usingthe (exten<strong>de</strong>d) formulation that provi<strong>de</strong>d better computationalresults to solve to optimality instances up to fifteen <strong>da</strong>ys we constructa rolling horizon heuristic that allows us to obtain feasibleplans for horizons of several months.ALIO-EURO <strong>2011</strong> – 149


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>2. FORMULATIONFollowing [4] we present an Arc-Load Flow (ALF) formulation.We divi<strong>de</strong> the formulations in the following parts: routing constraints,loading and discharging constraints, time constraints andinventory constraints. Finally the objective function is presented.Routing constraints:Let V <strong>de</strong>note the set of ships. Each ship v ∈ V must <strong>de</strong>part from itsinitial port position, that can be a point at sea, in the beginning ofthe planning horizon. The set of ports is <strong>de</strong>noted by N. Each possibleport arrival is <strong>de</strong>noted by the pair (i,m) representing the m thvisit to port i. Direct ship movements (arcs) from port arrival (i,m)to port arrival ( j,n) are represented by (i,m, j,n). We <strong>de</strong>fine S A asthe set of possible port arrivals (i,m), S Av as the set of possible portarrivals ma<strong>de</strong> by ship v, and set S Xv as the set of all possible movements(i,m, j,n) of ship v. We <strong>de</strong>note by µ ithe minimum numberof visits to port i.For the routing we <strong>de</strong>fine the following binary variables: arc flowvariables x im jnv that are equal to 1 if ship v sails from port arrival(i,m) directly to port arrival ( j,n), and 0 otherwise; variables xo imvthat indicate whether ship v sails directly from its initial positionto port arrival (i,m) or not; w imv is 1 if ship v visits port i at arrivalnumber (i,m) and 0 otherwise; z imv is equal to 1 if ship v endsits route at port arrival (i,m) and 0 otherwise, and y im indicateswhether a ship is visiting port arrival (i,m) or not.The routing constraints are as follows:∑( j,n)∈S Avxo jnv = 1, v ∈ V, (1)∑w imv − x jnimv − xo imv = 0,v ∈ V,(i,m) ∈ S Av , (2)( j,n)∈S Av∑w imv − x im jnv − z imv = 0, v ∈ V,(i,m) ∈ S Av , (3)( j,n)∈S Av∑ w imv = y im , (i,m) ∈ S A , (4)v∈Vy i(m−1) − y im ≥ 0, (i,m) ∈ S A : m > 1, (5)y iµi = 1, i ∈ N. (6)Equations (1) ensure that each ship <strong>de</strong>parts from its initial port positionto some port arrival. Equations (2) and (3) are the flow conservationconstraints, ensuring that a ship arriving at a port alsoleaves that port by either visiting another port or ending its route.Constraints (4) ensure that each port arrival (i,m) is visited at mostonce. Constraints (5) state that if port i receives the m th visit, thenit also must receive the m − 1 th visit. I Equations (6) guaranteethe minimum number of visits at each port. These constraints arenot necessary but computational experience showed that these constraintscan be very important when good bounds are given for theminimum number of visits to each port.Loading and discharging:Let K represent the set of all products. Not all ports consume allproducts. Parameters Jik assume value 1 if port i is a supplier ofproduct k;-1 if port i is a consumer of product k, and 0 if i is neithera consumer nor a supplier of product k. The quantity of product kon board ship v at the beginning of the planning horizon is givenby Q k v. C v is the total storage capacity of ship v. The minimum andthe maximum discharge quantity of product k are given by Q k im andQ k im respectively.In or<strong>de</strong>r to mo<strong>de</strong>l the loading and unloading constraints we <strong>de</strong>finethe following binary variables: o k imv is equal to 1 if product kis loa<strong>de</strong>d onto or unloa<strong>de</strong>d from ship v at port arrival (i,m), and0 otherwise; and the following continuous variables: q k imv is theamount of product k (un)loa<strong>de</strong>d at port arrival (i,m), limv k is theamount of product k onboard ship v when leaving from port arrival(i,m).The loading and unloading constraints are given by:x im jnv [l k imv + Jk j qk jnv − lk jnv ] = 0,v ∈ V,(i,m, j,n) ∈ S Xv,k ∈ K,xo i1v [Q k v + Ji k qk i1v − lk i1v ] = 0, v ∈ V,(i,1) ∈ S Av,k ∈ K, (8)∑limv vw imv , v ∈ V,(i,m) ∈ S Av , (9)kQ k im ok imv ≤ qk imv ≤ Qk imo k imv , v ∈ V,(i,m) ∈ S Av,∀k ∈ K : Ji k = −1,(10)∑o imv ≥ w imv, v ∈ V,(i,m) ∈ S Av , (11)ko k imv ≤ w imv, v ∈ V,(i,m) ∈ S Av ,k ∈ K, (12)∑o k imv ≤ x jnimv , v ∈ V,(i,m) ∈ S Av ,k ∈ K : Ji k = −1,( j,n)∈S Wv(7)(13)Equations (7) ensure that if ship v sails from port arrival (i,m) toport arrival ( j,n), then there must be satisfied the equilibrium ofthe quantity of product k on board each ship. These constraints canbe linearized as follows:l k imv + Jk j qk jnv − lk jnv +C vx im jnv ≤ C v , v ∈ V,(i,m, j,n) ∈ S Xv ,k ∈ K,(14)l k imv + Jk j qk jnv − lk jnv −C vx im jnv ≥ −C v ,v ∈ V,(i,m, j,n) ∈ S Xv ,k ∈ K.(15)Constraints (8) are similar to (7), and ensure the equilibrium of theload on board the ship for the first visit. These constraints can bereplaced by the following linear constraints:Q k v + J k i qk i1v − lk i1v +C vxo i1v ≤ C v , v ∈ V,(i,1) ∈ S Av ,k ∈ K,(16)Q k v + J k i qk i1v − lk i1v −C vxo i1v ≥ −C v ,v ∈ V,(i,1) ∈ S Av ,k ∈ K.(17)The ship capacity constraints are given by (9). To prevent un<strong>de</strong>sirablesituations such as a ship visiting a port to discharge a verysmall quantity, constraints (10) impose a lower and upper limits onthe unload quantities. Constraints (11) ensure that if ship v visitsport arrival (i,m), then at least one product must be (un)loa<strong>de</strong>d.Constraints (12) ensure that if ship v (un)loads one product at visit(i,m), then w imv must be one. Constraints (13) relate the variableso k imv to x jnimv.Time constraints:Since the <strong>de</strong>mand is assumed to be constant during the planninghorizon we consi<strong>de</strong>r a continuous time mo<strong>de</strong>l. In or<strong>de</strong>r to keeptrack of the inventory level it is necessary to <strong>de</strong>termine the startand the end times at each port arrival. We <strong>de</strong>fine the followingparameters: T Qik is the time required to load/unload one unit ofproduct k at port i; T i jv is the traveling time between port i and jby ship v; Tiv O indicates the traveling time required by ship v to sailfrom its initial port position to port i; TiB is the minimum intervalbetween the <strong>de</strong>parture of one ship and the next arrival at port i. Tis a large constant.ALIO-EURO <strong>2011</strong> – 150


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Given the start and end time variables, t im and t Eim at port arrival(i,m), the time constraints can be written as:t Eim = t im +∑vT Qik qk imv , (i,m) ∈ S A, (18)t im −t Ei(m−1) + T Bi y i(m+1) ≥ T Bi ,(i,m) ∈ S A : m > 1, (19)t Eim + T i jv −t jn ≤ T (1 − x im jnv ), v ∈ V,(i,m, j,n) ∈ S Xv , (20)∑ Tiv O i1v ≤ t i1 , i ∈ N. (21)v∈VConstraints (18) <strong>de</strong>fine the end time of service of arrival (i,m).Constraints (19) impose a minimum interval between two consecutivevisits at port i. Constraints (20) relate the end time of portvisit (i,m) to the start time of port visit ( j,n) when ship sails directlyfrom port (i,m) to ( j,n). Constraints (21) ensure that starttime at port arrival (i,1) occurs after a ship sails from its initialport position to port arrival (i,1).Inventory constraints:Inventory constraints are consi<strong>de</strong>red for each unloading port i (Ji k =−1). The <strong>de</strong>mand rate R k i of product k at port i, as well as the minimumS k i and maximum S k i stock levels of each products k at theconsumption ports are given. The parameter µ i <strong>de</strong>notes the maximumnumber of visits at port i.We <strong>de</strong>fine the continuous variables s k im and sk Eim indicating thestock level at the start and end of port visit (i,m), respectively.The inventory constraints are as follow:s k im + ∑vq k imv + Rk i t Eim − R k i t im − s k Eim = 0,(i,m) ∈ S A,k ∈ K,s k Eim + Rk i t i(m+1) − R k i t Eim − s k i(m+1) = 0, (i,m) ∈ S A,k ∈ KS k i ≤ sk im ,sk Eim ≤ Sk i ,(22)(23)∀(i,m) ∈ S A ,k ∈ K(24)S k i ≤ sk Eiµ i+ R k i (T −t Eiµ i) ≤ S k i , ∀i ∈ N,k ∈ K, (25)Equations (22) calculate the stock level of each product when theservice ends at port arrival (i,m). Similarly, equations (23) relatethe stock level at the start of port arrival (i,m + 1) to the stocklevel at the end of port visit (i,m). The upper and lower bound onthe stock levels are ensured by constraints (24). Constraints (25)ensure that the stock level at the end of the planning horizon iswithin the stock limits.Objective function:The objective is to minimize the total routing cost:∑∑C i jv x im jnv (26)v∈V (i,m, j,n)∈S Xv3. EXTENDED FORMULATIONThe arc-load flow mo<strong>de</strong>l provi<strong>de</strong>s, in general, large integralitygaps. In or<strong>de</strong>r to improve the original formulation, we proposeand test, for periods up to 15 <strong>da</strong>ys, different reformulations. Nextwe introduce the formulation that provi<strong>de</strong>d best computational results.productDefine fim kjnv as the amount of product k that ship v transports fromport arrival (i,m) to port arrival ( j,n), and f o k i1v as the amount ofk that ship v transports from its initial port position to portarrival (i,1).Using these additional variables we can provi<strong>de</strong> the following ArcFlow (AF) formulation:min (26) subject to (1)-(6), (10)-(25), and∑∑f o k j1v + fim kjnv + Jk j qk jnv = f jnimv k ,(i,m)∈S Av (i,m)∈S Avv ∈ V,( j,n) ∈ S Av ,∀k ∈ K (27)∑ fim kjnv ≤ C vx im jnv , v ∈ V,(i,m, j,n) ∈ S Xv (28)k∈KConstraints (27) ensure the equilibrium on the quantity on boardthe ship, and constraints (28) impose un upper bound of the capacityof ship v.It can be shown that the AF formulation is stronger than the ALFformulation. For the computational experiments we consi<strong>de</strong>red 11instances based on real <strong>da</strong>ta and used a computer with processorIntel Core 2 Duo, CPU 2.2GHz, with 4GB of RAM using the optimizationsoftware Xpress Optimizer Version 21.01.00 with XpressMosel Version 3.2.0. For a short time horizon of 15 <strong>da</strong>ys, therunning times were, in average, lower when the ALF formulationwas used. Of course, other exten<strong>de</strong>d formulations, as multicommoditytype formulations that are not presented here, provi<strong>de</strong> bestlower bounds but higher average running times. The followingtable compares the average integrality gaps and average runningtime obtained with each formulation. Both formulations have beentightened with valid inequalities imposing a minimum number ofoperations to each port.Formulation Gap(%) Time (seconds)ALF 31.8 162AF 28 1294. ROLLING HORIZON HEURISTICConsi<strong>de</strong>ring a planning horizon of several months, the tested instancesbecome too large to be solved to optimality by a commercialsoftware. To provi<strong>de</strong> feasible solutions we <strong>de</strong>velop a rollinghorizon heuristic. The main i<strong>de</strong>a of the rolling horizon heuristicsis to split the planning horizon into smaller sub-horizons, and thenrepeatedly solve limited and tractable mixed integer problem forthe shorter sub-horizons.Rolling horizon heuristics have been used intensively in the past,see for instance the related works [17, 18, 19, 20].In each iteration k (excepted the first one), the sub-horizon consi<strong>de</strong>redis divi<strong>de</strong>d into three parts: (i) a frozen part where binaryvariables are fixed; (ii) a central part (CP k ) where the nature of thevariables is kept, and (iii) a forecasting period (FP k ) where binaryvariables are relaxed. We assume that the central and forecastingperiods have equal length. The central period in iteration k becomesthe frozen period in iteration k + 1, and the forecasting periodfrom iteration k becomes part of the central period in iterationk + 1, see Figure 1. The process is repeated until the whole planninghorizon in covered. In each iteration the limited mixed integerproblem is solved using the AF formulation. When moving fromiteration k to iteration k + 1 we (a) fix the value of the binary variables,(b) up<strong>da</strong>te the initial stock level of each product at each port,(c) calculate the quantity of each product onboard each ship, and(d) up<strong>da</strong>te, for each ship, the initial position and the travel timeand cost from that position to every port. Computational resultsare reported.ALIO-EURO <strong>2011</strong> – 151


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>IterationkIterationk+1✛✛frozen periodfreezing strategy✲✛central period<strong>de</strong>tailed planningCP k✲✛✲✛forecasting periodsimplification strategyFP k✲CP k+1 FP k+1✲✛Figure 1: The rolling horizon heuristic5. REFERENCES[1] M. Christiansen, K. Fagerholt, B. Nygreen, and D. Ronen,“Maritime transportation. In: C. Barnhart, G. Laporte(Eds.),” Handbook in Operations Research and ManagementScience, vol. 14, no. 96, pp. 189–284, 2007.[2] M. Christiansen and K. Fagerholt, “Maritime inventory routingproblem. In: C. Flou<strong>da</strong>s, P. Par<strong>da</strong>los,(Eds.),” Encyclopediaof optimization, second edition. Springer, pp. 1947–1955, 2009.[3] F. Al-Khayyal and S. J. Hwang, “Inventory constrained maritimerouting and scheduling for multi-commodity liquidbulk, part I: Applications and mo<strong>de</strong>l,” European Journal ofOperational Research, no. 176, pp. 106–130, 2007.[4] M. Christiansen, “Decomposition of a combined inventoryand time constrained ship routing problem,” TransportationScience, vol. 33, no. 14, pp. 3–6, February 1999.[5] M. Christiansen and K. Fagerholt, “Robust ship schedulingwith multiple time windows,” Naval Research Logistics,vol. 49, no. 15, pp. 611–625, February 2002.[6] M. Christiansen and B. Nygreen, “A method for solving shiprouting problems with inventory constraints,” Annals of OperationsResearch, no. 22, pp. 357–378, 1998a.[7] J. Desrosiers, Y. Dumas, M. Solomon, and F. Soumis, “Timeconstrained routing and scheduling. In: M. O. Ball, T. L.Magnanti, C. L. M., Nemhauser, G. L. (Eds.) ,” Handbooksin Operations Research and Management Science 8, NetworkRouting. North-Holland, Amster<strong>da</strong>m, pp. 35–139, 1995.[8] K. Fagerholt, “Ship scheduling with soft time windows - anoptimization based approach,” European Journal of OperationalResearch, no. 131, pp. 559–571, 2001.✲t✲[9] T. Flatberg, H. Haavardtun, O. Kloster, and A. Løkketangen,“Combining exact and heuristic methods for solving a vesselrouting problem with inventory constraints and time windows,”Ricerca Operativa, vol. 29, no. 91, pp. 55–68, 2000.[10] R. Giesen, J. Muñoz, M. Silva, and M. Leva, “Método <strong>de</strong>solucíon al problema <strong>de</strong> ruteo e inventarios <strong>de</strong> múltiplos produtospara una frota heterogénea <strong>de</strong> naves,” Ingeniería <strong>de</strong>Transporte, vol. 13, no. 1, pp. 31–40, 2007.[11] R. Grønhaug, M. Christiansen, M. Desaulniers, andG. Desrosiers, “A branch-and-price method for a liquefiednatural gas inventory routing problem,” Transportation Science,vol. 44, no. 3, pp. 400–415, 2010.[12] S. J. Hwang, “Inventory constrained maritime routing andscheduling for multi-commodity liquid bulk,” Ph.D. dissertation,Georgia Institute of Technology, Atlanta, 2005.[13] D. Ronen, “Marine inventory routing: Shipments planning.”Journal of the Operational Research Society, vol. 53, no. 1,pp. 108–114, 2002.[14] H. Sherali, S. Al-Yakoob, and M. Hassan, “Fleet managementmo<strong>de</strong>ls and algorithms for an oil-tanker routing andscheduling problem,” IIE Transactions, vol. 31, no. 5, pp.395–406, 1999.[15] M. Stålhane, J. Rakke, H. Moe, R. An<strong>de</strong>rsson, M. Christiansen,and K. Fagerholt, “A construction and improvementheuristic for a liquefied natural gas inventory routing problem,”Submitted to Journal, 2010.[16] N. Siswanto, D. Essam, and R. Sarker, “Solving the shipinventory routing and scheduling problem with un<strong>de</strong>dicatedcompartments,” Computers and Industrial Engineering,DOI: 10.1016/j.cie.2010.06.011, 2010.[17] C. Mercé and G. Fontain, “Mip - based heuristics for capacitatedlotsizing problems,” International Journal of ProductionsEconomics, no. 85, pp. 97–111, 2003.[18] D. Bredström and M. Rönnqvist, “Supply chain optimizationin pulp distribution using a rolling horizon solution approach,”NHH Dept. of Finance and Management ScienceDiscussion Paper, no. 17, December 2006.[19] J. Rakke, M. Stålhane, C. Moe, M. Christiansen, H. An<strong>de</strong>rsson,K. Fagerholt, and I. Norstad, “A rolling horizon heuristicfor creating a liquefied natural gas annual <strong>de</strong>livery program,”Transportation Research Part C: Emerging Technologies,doi:10.1016/j.trc.2010.09.006,2010.[20] M. Savelsbergh and Jin-Hwa Song, “Inventory routing withcontinuous moves,” Computers and Operations Research,vol. 34, no. 6, pp. 1744 – 1763, 2007.ALIO-EURO <strong>2011</strong> – 152


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Nash Equilibria in Electricity MarketsMargari<strong>da</strong> Carvalho ∗ João P. Pedroso ∗ João Saraiva †∗ INESC Porto and <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências, Universi<strong>da</strong><strong>de</strong> do PortoRua do Campo Alegre, 4169-007 Porto, Portugalmmsc@inescporto.pt jpp@fc.up.pt† INESC Porto and <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> <strong>Engenharia</strong>, Universi<strong>da</strong><strong>de</strong> do PortoRua Dr. Roberto Frias, 4200 - 465 Porto, Portugaljsaraiva@fe.up.ptABSTRACTNash equilibria are solutions for many problems arising in Economics.In a restructured electricity sector, the pool market can beseen as a game where some players, the producers, submit theirproposals. The profits of each producer <strong>de</strong>pend on the proposalsof the others. So, in this context, the strategies reached by the producersin a Nash equilibria are the best solutions for them. Here,we present our work in the <strong>de</strong>velopment of techniques that can beused for <strong>de</strong>termining Nash equilibria for this game.Keywords: Nash Equilibria, Energy Sector, Adjustmet Process,Electricity Markets1. INTRODUCTIONAt the end of the 19 th century, electricity started to be generated,transported and distributed through low power networks with smallgeographic size. The companies were vertically integrated andthere was not any competition in this sector. This kind of organizationin the electricity market implies that: the consumers couldnot choose an electricity company to be supplied, the prices were<strong>de</strong>fined in an administrative and sometimes unclear way, planningactivities were ma<strong>de</strong> with less complexity than to<strong>da</strong>y (also becausethe economic environment was less volatile). Therefore, before theoil crises (1973), the electricity companies easily ma<strong>de</strong> forecasts,because the risk or uncertainty were not prior concerns. This situationchanged in the beginning of the 70’s: high inflation and interestrates ma<strong>de</strong> the economic environment more volatile. Addingto this fact, the evolution of technology forced the <strong>de</strong>regulation ofthe electric sector and its vertical unbundling. Thus, new companieswere established and market mechanisms were implementedin or<strong>de</strong>r to generate competition conditions (see [1]).Nowa<strong>da</strong>ys, in this restructured sector, many electricity markets arebased on a pool-based auction for the purchase and sale of power.In this work, we apply game theory to this problem, in particularthe notion of Nash equilibria. We look at this pool-based auctionas a game, in which producers are the players that have to choosea strategy (or proposal) to submit. Here, their goal is to submit onethat maximizes the profit. A Nash equilibrium is a set of strategiesfor each player, where nobody wants to change unilaterallyhis behaviour(see [2]). So, in the electricity pool market, we areinterested in finding such equilibrium strategies for the producers,since it is the best answer that we can give to this non-cooperativegame.There is some literature related to this subject. For example, [3]and [4] study strategic bidding in electricity pool markets, withelastic and inelastic <strong>de</strong>mand, respectively. We assumed almostinelastic <strong>de</strong>mand because in the current markets this is the mostrealistic assumption: the consumers will pay (almost) anything tomeet the <strong>de</strong>mand.The authors of [5] consi<strong>de</strong>red the case of constant, stochastic <strong>de</strong>mand.They used the methodology of [4] to eliminate the bilinearterms of the generation companies’ profit maximization problem,using a piecewise linear function and binary variables. They alsocontributed with a procedure that has the aim of finding all Nashequilibria in pure strategies. There, the proposals’ prices and quantitiestake discrete values, unlike in our work, which focuses onthe use of methods to compute Nash equilibria in games with finitestrategies. However, as reported by the authors of [6], thediscretization of the space of strategies can artificially eliminatesome true Nash equilibria and add some equilibria that do not existin the original game. In [7] it is proposed a fast computation ofNash equilibria in pure strategies by observing their properties; inthat work, discretization is not required.To approach this problem, we present an adjustment process thatcould be seen as a learning process by companies generating electricity.When this process converges we find a Nash equilibrium inpure strategies.This exten<strong>de</strong>d abstract is organized as follows: Section 2 presentsthe electricity market mo<strong>de</strong>l, Section 3 clarifies the concept ofNash equilibrium, explains the <strong>de</strong>veloped approach to achieve themand presents an example, Section 4 presents our future work in thisproblem.2. ELECTRICITY MARKET MODELIn the pool market, consumers and producers submit their proposalsto buy and sell electricity. Here we assume very inelastic <strong>de</strong>mand,that is characterized by the real constants m < 0 and b, insuch a way that the <strong>de</strong>mand is represented by almost a vertical segmenty = mx + b. The generation companies simultaneously submittheir proposals, that corresponds to pairs of quantity (MWh)and price ($/MWh). Let n be the number of the selling proposals.For each hour of the <strong>da</strong>y we have a matrix M that containsall the information about the proposals of the producers and thegeneration costs. This matrix has the following form, for each rowj ∈ {1,2,...,n}:M j = [ j s j E j λ j c j]where the proposals are in<strong>de</strong>xed by j = 1,2,...,n, s j is the producerof proposal j, E j is the proposal quantity in MWh, λ j is theprice in $/MWh and c j is the associated marginal cost.Then the market operator, an in<strong>de</strong>pen<strong>de</strong>nt agent, carries out an economicdispatch, ED, once the price and quantity bids of the producerswere submitted. He wants to find which proposals shouldALIO-EURO <strong>2011</strong> – 153


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>be dispatched so that the <strong>de</strong>mand is satisfied and the market clearingprice P d is minimized. The market operator organizes the proposalsby ascending or<strong>de</strong>r of the prices λ j and aggregates them,forming the supply curve. Thus the intersection of this curve withthe <strong>de</strong>mand segment gives the market clearing price P d and quantityQ d , and the proposals that are dispatched, as shown in Figure1. Therefore, the revenue for each producer i is given by:( )Π i = ∑ Pd − c j g jj∈{ED:s j =i}where g j is the energy produced by i = s j in the ED. This profit<strong>de</strong>eply <strong>de</strong>pends on the strategies of the other players, which makesthe problem complex.3. NASH EQUILIBRIA COMPUTATIONGame theory provi<strong>de</strong>s important tools in economics. The conceptof Nash equilibrium of a game plays a relevant role in this context.Basically, it is a probability distribution over the set of strategiesof each player, such that nobody wants to change unilaterally thisbehaviour. If some player would change his strategy with respectto a Nash equilibrium, his profit would not increase (see [2]).In our case, the strategies of each player are the proposal prices, so,in a Nash equilibrium, we have the probability of choosing λ j overthe set [0,b], where b is the maximum price at which the consumersbuy electricity (see section 2 where the <strong>de</strong>mand is <strong>de</strong>fined). ANash equilibrium in which each player plays with probability onea certain strategy is called a equilibrium in pure strategies.The method that we use in this abstract only provi<strong>de</strong>s pure Nashequilibria, but we are currently working towards finding mixedNash equilibria.3.1. Adjustment ProcessIn current electricity markets, the producers have to communicatethe market operator their proposals for each hour of the following<strong>da</strong>y. We admit that each producer predicts exactly the <strong>de</strong>mand foreach hour and knows the technology of his competitors, so that heknows the marginal costs of the others. Our goal is to find the beststrategy for each company.We will apply an adjustment process to find out the Nash equilibriaof this non-cooperative game. An adjustment process is an iterativeprocess in which each player adjusts his strategy according tothe past iterations. This is a learning process. It is easy to fin<strong>de</strong>xamples in which this method diverges or has chaotic behaviour,so this process does not always work. However, if a solution isfound, then it is a Nash equilibrium.In this context, we started only with the prices λ j as <strong>de</strong>cision variables,but it follows immediately how to a<strong>da</strong>pt the process in or<strong>de</strong>rto have both prices and quantities as <strong>de</strong>cision variables.We have used two adjustment processes: the ones <strong>de</strong>scribed in [8]and in [9]. The first one only uses the <strong>da</strong>ta of last iteration to adjustthe strategy, while the second uses an estimation based on the allpast iterations. After some experiences, we noted that the firstmethod converges faster than the second (in this case) and also itis simpler to <strong>de</strong>scribe. Hence, we focus in that one in this work.In [10] a very similar process is presented, but there the <strong>de</strong>cisionvariables are the quantities.Our method can be <strong>de</strong>scribed with the following pseudo co<strong>de</strong>:1: initialise with an information matrix M, ε > 0, and <strong>de</strong>mandparameters m and b;2: let S i be the set of proposals of the producer i and k the numberof producers;3: repeat4: X ← fourth column of M;5: for i = 1 to k do( )6: λ Si ← argmax λ j , j∈S i∑ s j ∈ED∩S i Pd − c j g j ;7: up<strong>da</strong>te the fourth column of M with λ j for j ∈ S i ;8: end for9: Y ← fourth column of M;10: ∆ ← ||Y − X||;11: until ∆ < εIn short, in each step every producer finds the strategy that maximizeshis profit assuming that the other players are going to followthe strategy of the previous iteration. The process stops when twoiterations are sufficiently close to each other (that means that thecurrent matrix M is a Nash equilibrium, because nobody ma<strong>de</strong> asignificant change in his behaviour. In fact, when ∆ = 0, M isexactly a Nash Equilibrium). It is important to notice that themaximization process, in step six, needs a method able to tacklenon-smooth functions, as the profit of the companies is a functionwith discontinuities.The most important step in our adjustment process is the maximizationof the producers’s profits. To solve this problem, wehave used the MATLAB implementation of a global optimizationmethod <strong>de</strong>veloped by Ismael Vaz and Luís Vicente, see [11]. Inthis method we only need to evaluate the objective function valuesresulting from pattern search and particle swarm, so this is exactlywhat we need in our adjustment process.3.2. Case StudyIn this example, we consi<strong>de</strong>r one period of the pool Market, withfive producers, each with one generation unit. They want to knowthe prices to attribute to their proposals so as to maximize theirprofit. We assume that this is a competitive market, so there is notcooperation between the companies.The information matrix M is:⎡⎤j s j E j λ j c j1 Producer A 100 0.50 0.402 Producer B 150 0.30 0.30M =⎢ 3 Producer C 200 0.80 0.55 ⎥⎣4 Producer D 180 0.55 0.50⎦5 Producer E 250 0.85 0.60and the <strong>de</strong>mand is mo<strong>de</strong>led byy = −72000 x + 7 4 .This initial situation is represented in Figure 1, with P d = 0.50$/MWh, Q d = 342.8571MWh and accepted proposals:⎡⎢M ED = ⎣j s j g j λ j2 Producer B 150.0000 0.301 Producer A 100.0000 0.504 Producer D 92.8571 0.55In our simulation of the pool market, if we have two or moreproposals with the same price we accept them in a proportionalway. For example, the market clearing price is 30 $/MWh and weneed 200MWh to be allocated; we have three proposals with price30$/Mwh and quantities 300MWh, 60MWh and 240MWh; thenwe accept 1 3 = 200300+60+240 of the quantity of each proposal.Applying our algorithm to this situation, with ε = 10 −9 and a max-⎤⎥⎦(1)ALIO-EURO <strong>2011</strong> – 154


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 1: Pool Market.Figure 3: Evolution of ∆ through iterations of the adjustment process.imum of 50 adjustment iterations, we achieve:⎡M =⎢⎣j s j E j λ j c j1 Producer A 100 0.400000 0.402 Producer B 150 0.300000 0.303 Producer C 200 0.550000 0.554 Producer D 180 0.549999 0.505 Producer E 250 0.600000 0.60in 17 iterations and final ∆ = 2.44×10 −10 . This is a Nash equilibriumas we can see in Figures 2, 3 and 4 :⎤⎥⎦Figure 4: Evolution of proposals’s prices through iterations of theadjustment process.Figure 2: Pool Market.Only the producer D can achieve a larger profit by increasing theprice of the proposal up to 0.55 $/MWh, but if it chooses λ 1 = 0.55it would have to divi<strong>de</strong> the sold quantity with producer C. So itchooses a price slightly less than 0.55 $/MWh. Figure 3 shows thatthis process starts converging to the Nash equilibrium very fast,<strong>de</strong>spite the existence of some fluctuations in ∆ due to the numericaloptimization process. The ED of this equilibrium is:⎡⎢M ED = ⎣j s j g j λ j2 Producer B 150.0000 0.3000001 Producer A 100.0000 0.4000004 Producer D 92.8571 0.549999⎤⎥⎦ (2)give different results. We used the adjustment process in this exampleseveral times and observed that the method always foundthis equilibrium, but the number of iterations used ranged between3 and 25. Another relevant comment is that changing the initialmatrix M can achieve different Nash equilibria. In this examplewe always found the same Nash equilibrium on pure strategies.Note that if we modify, for example, bid 4 in Equation (2) fora quantity in the domain [92.8571,180], we keep having a NashEquilibrium. Thus we expected that adjusting prices and quantitiescould lead us to chaotic behaviours and consequently to the nonconvergenceof the process, because usually there is more than oneanswer with the same profit for the optimization step.As a matter of fact, producers can also choose the quantities fortheir bids, hence we applied to this example a more general adjustmentprocess, where the quantities and prices are <strong>de</strong>cision variables.To make that possible, we need to add to our <strong>da</strong>ta, M, m andb, the capacity of each generator. Here, it is assumed that the thirdcolumn of the initial M is concerned with the maximum capacityof each generator.Applying to the matrix of Equation (1) the adjustment process withwith P d = 0.549999$/MWh and Q d = 342.8571MWh.It is important to mention that the optimization method PSwarmused (see [11]) is stochastic, so applying again this method couldALIO-EURO <strong>2011</strong> – 155


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>respect to prices and quantities we achieve with four iterations:⎡⎤j s j E j λ j c j1 Producer A 100.000 0.400000 0.402 Producer B 150.000 0.300000 0.30M =⎢ 3 Producer C 200.000 0.550000 0.55 ⎥⎣4 Producer D 92.8571 0.500000 0.50⎦5 Producer E 250.000 0.600000 0.60and economic dispatch:⎡⎤j s j E j λ j c j2 Producer B 100.000 0.400000 0.40M ED = ⎢ 1 Producer A 150.000 0.300000 0.30⎥⎣4 Producer D 92.8571 0.500000 0.50⎦3 Producer C .354 · 10 −6 0.550000 0.55with ∆ = 0, so there is no doubt that this represent a Nash equilibrium.4. FUTURE WORKThe problem of non-convergence of our process when there ismore than one answer with the same profit for the optimizationstep, can possibly be overcomed if we search for equivalence classes(two proposals are in the same class if both lead to the same profit)and use a single representant for each class. In this context, weare studying the convergence conditions that tell us when the adjustmentprocess works successfully. Another reason that coul<strong>de</strong>xplain the non-convergence of the method is that we may enter acycle or an orbit that diverges. Here it may be preferable to usethe adjustment process based in the estimative of past iterations,because it is more suitable in or<strong>de</strong>r to converge.We are also currently studying the possibility of discretizing thespace of strategies to make it finite (see [6]). In that case, thereis a large number of available methods to find Nash equilibria (includingmixed Nash equilibria). Then, we can a<strong>da</strong>pt the results tothe original game through interpolation. We have to be careful inthis process because sometimes an equilibrium in discrete gamesis not one in the original game. An important step in this processis to recognize strategies that are dominated, meaning that they arenever adopted and played.In conclusion, this work addresses important questions arising inthe pool market, and can contribute to the <strong>de</strong>velopment of algorithmsto find mixed Nash equilibria where the sets of strategiesare continuous.5. ACKNOWLEDGEMENTS6. REFERENCES[1] J. Saraiva, J. P. <strong>da</strong> Silva, and M. P. <strong>de</strong> Leão, Mercados <strong>de</strong>Electrici<strong>da</strong><strong>de</strong> - Regulação <strong>de</strong> Tarifação <strong>de</strong> Uso <strong>da</strong>s Re<strong>de</strong>s.FEUP Edições, 2002.[2] D. Fu<strong>de</strong>nberg and J. Tirole, Game Theory, 5th ed. Cambridge,MA: MIT Press, 1996.[3] D. Pozo, J. Contreras, Ángel Caballero, and A. <strong>de</strong> Andrés,“Long-term Nash equilibria in electricity markets,”Electric Power Systems Research, vol. In Press, CorrectedProof, pp. –, 2010. [Online]. Available: http://www.sciencedirect.com/science/article/B6V30-519VV21-1/2/e7fcc2861b27be46806cd9aaf0aed724[4] M. Pereira, S. Granville, M. Fampa, R. Dix, and L. Barroso,“Strategic bidding un<strong>de</strong>r uncertainty: a binary expansionapproach,” Power Systems, IEEE Transactions on, vol. 20,no. 1, pp. 180 – 188, Feb. 2005.[5] D. Pozo and J. Contreras, “Finding multiple nash equilibriain pool-based markets: A stochastic EPEC approach,” PowerSystems, IEEE Transactions on, vol. PP, no. 99, pp. 1 –9,<strong>2011</strong>.[6] K.-H. Lee and R. Baldick, “Tuning of discretization in bimatrixgame approach to power system market analysis,” PowerSystems, IEEE Transactions on, vol. 18, no. 2, pp. 830 – 836,May 2003.[7] E. Hasan and F. Galiana, “Fast computation of pure strategynash equilibria in electricity markets cleared by merit or<strong>de</strong>r,”Power Systems, IEEE Transactions on, vol. 25, no. 2, pp. 722–728, May 2010.[8] J. P. Pedroso and Y. Smeers, “Equilibria on a game with discretevariables,” in Programs, Proofs, Processes, F. Ferreira,H. Guerra, E. Mayordomo, and J. Rasga, Eds. Azores, Portugal:Computability in Europe 2010, 2010, pp. 326–335.[9] L. Thorlund-Petersen, “Iterative computation of Cournotequilibrium,” Norwegian School of Economics and BusinessAdministration-, Working Papers, 1988. [Online]. Available:http://econpapers.repec.org/RePEc:fth:norgee:1-88[10] J. Contreras, M. Klusch, and J. Krawczyk, “Numerical solutionsto Nash-Cournot equilibria in coupled constraintelectricity markets,” Power Systems, IEEE Transactions on,vol. 19, no. 1, pp. 195 – 206, Feb. 2004.[11] A. Vaz and L. Vicente, “A particle swarm patternsearch method for bound constrained global optimization,”Journal of Global Optimization, vol. 39, pp. 197–219,2007, 10.1007/s10898-007-9133-5. [Online]. Available:http://dx.doi.org/10.1007/s10898-007-9133-5This work was supported by a INESC Porto fellowship in the settingof the Optimization Interunit Line.ALIO-EURO <strong>2011</strong> – 156


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Application of Combinatorial Optimization in Natural Gas System OperationTeresa Nogueira ∗∗ Institute of Engineering, Polytechnic Institute of PortoRua Dr. Antonio Benardino <strong>de</strong> Almei<strong>da</strong>, 431 – 4200-072 Porto, PORTUGALtan@isep.ipp.ptABSTRACTThe best places to locate the Gas Supply Units on natural gas systemsand their optimal allocation to loads are the key factors toorganize an efficient upstream gas infrastructure. In this work weuse the P-median problem to locate the GSUs on a gas networkand the transportation problem to assign gas <strong>de</strong>mand no<strong>de</strong>s to thesource facilities. Due to its mathematical structure, the applicationof P-median problem to large networks needs heuristic techniques.This paper presents two Lagrangean heuristics, tested on a realisticnetwork - the primary Iberian natural gas network. Computationalresults are presented, showing the location arrangement and systemtotal costs.Keywords: Gas supply units – GSUs, Lagrangean heuristic, P-median problem, Relocation heuristic1. INTRODUCTIONTo comply with natural gas <strong>de</strong>mand growth patterns and Europeimport <strong>de</strong>pen<strong>de</strong>ncy, the Iberian natural gas industry needs to organizean efficient upstream infrastructure [1]. Marine terminals,storage facilities and gas injection points, are the source points ofthe natural gas system: the Gas Supply Units – GSUs. The locationof such infrastructures in gas networks, as well as allocated loads,should be carefully planned in or<strong>de</strong>r to minimize overall costs [2].Most of gas loads are connected to GSUs by pipelines, being thenatural gas transported in the gas form at high pressure . Alternatively,when there is no physical pipeline between supply/<strong>de</strong>mandpoints, gas may be transported by virtual pipeline – gas transportedby road trucks in its liquefied form.The aim of this paper is the presentation of two Lagrangean heuristicsto support the <strong>de</strong>cision of GSUs location on a gas network.This location problem studies the best places to locate GSUs onnetwork, minimizing total distances between sources and loads[3].Once <strong>de</strong>fined, GSUs serve load sites with known gas <strong>de</strong>mands,minimizing combined GSUs location and transport costs. Thisquestion is addressed by the transportation problem.For the location problem, we use the P-median problem, that findsthe location of a number of P facilities (in this case, GSUs), soas to minimize the weighted average distance of the system [4].Due to its mathematical structure, the P-median problem is NPhar<strong>da</strong>nd therefore cannot be solved in polynomial time. So, itis necessary to use heuristics methods for large and realistic P-median problems.In [5], was presented a simple Lagrangean heuristic, by using Lagrangeanrelaxation and subgradient optimization to solve the dualproblem. In this paper we improve the solution by adding the Lagrangeanrelocation heuristic. This is done by analyzing somechanges between medians and non-medians locations. With thisexhaustive procedure, we can obtain better solutions, not reachedby simple Lagrangean heuristics.In section two we present the Lagrangean relaxation for P-medianproblems. Section three presents the relocation heuristic, an improvementto the simple Lagrangean heuristic. To conclu<strong>de</strong> aboutthe effectiveness of the Lagrangean relocation heuristic, we compareits computational results to those of the simple Lagrangeanapproach.The location mo<strong>de</strong>lling presented in this work is applied to theIberian natural gas system, to find the best GSUs location and theiroptimal allocation to loads.The Iberian natural gas network is geographically organised with65 <strong>de</strong>mand no<strong>de</strong>s (Fig. 1). Most of these <strong>de</strong>mand points are connectedby physical pipelines (red lines in Fig. 1); the others aresupplied by road trucks with gas in liquefied form – the virtualpipelines. These virtual pipelines are all the connections betweentwo no<strong>de</strong>s without a physical pipeline.Figure 1: Iberian natural gas network.2. THE LAGRANGEAN APPROACHTo exemplify the application of Lagrangean relaxation to the locationP-median problem, we will consi<strong>de</strong>r the following binaryinteger programming problem, with m GSUs potential sites and n<strong>de</strong>mand no<strong>de</strong>s:subject to:m nZ = Min ∑ ∑ α.d ij .X ij (1)i=1 j=1ALIO-EURO <strong>2011</strong> – 157


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Where:Total Costs (Me)GSUs Values α/α v Simple LagrangeanLocated (e/m 3 km) Lagrangean RelocationHeuristic HeuristicP = 14 0,015 / 0,018 286.690,7 281.782,2P = 20 0,019 / 0,022 318.110,3 310.359,3P = 25 0,024 / 0,027 365.457,4 355.393,9P = 28 0,027 / 0,030 382.454,2 372.758,9Table 1: Lagrangean Heuristics Results.m∑ X ij = 1 j = 1,...,n (2)i=1m∑ X ii = P (3)i=1X ij ≤ X ii i = 1,..,m; j = 1,..,n (4)X ij ∈ {0,1} i = 1,..,m; j = 1,..,n (5)α.[d i j ], is the symmetric cost [distance] matrix, with d ii = 0, ∀i;αis the kilometric gas transported cost per gas unit (cubic meter –m 3 ); [X i j ] is the allocation matrix, with X i j = 1 if a no<strong>de</strong> i is allocatedto no<strong>de</strong> j, and X i j = 0, otherwise; X ii = 1 if no<strong>de</strong> i has a GSUand X ii = 0, otherwise; P is the number of GSUs (medians) to belocated.The objective function (1) minimizes the distance of each pair ofno<strong>de</strong>s in network, weighted by α. Constraints (2) ensure that eachno<strong>de</strong> j is allocated to a source no<strong>de</strong> i. Constraint (3) <strong>de</strong>terminesthe number of GSUs to be located (P). Constraint (4) sets that a<strong>de</strong>mand no<strong>de</strong> j is allocated to a no<strong>de</strong> i, if there is a GSU at no<strong>de</strong> i.Constraint (5) states the integer conditions.The parameter α assumes the cost value in physical pipelines, ifa pipe exists between no<strong>de</strong> i and j. If there is no pipe connectionbetween no<strong>de</strong>s, the parameter α is replaced by α v , the cost valuein virtual pipelines (usually, α v is greater than α). These differenttransport cost values are implicitly assumed in the algorithm andhave a great influence on the located GSUs solution.3. RELOCATION HEURISTICThe Lagrangean heuristic presented to solve the location problemoften gives very good results, but it can be improved with the applicationof an additional heuristic – the relocation heuristic – whichattempts to get closer to the optimal solution than the simple Lagrangeanheuristic. The computational results comparing the twoLagrangean approach are presented in this section.The relocation heuristic starts from the simple Lagrangean results.Then, the P clusters are i<strong>de</strong>ntified, C 1 , C 2 , . . . C P , corresponding tothe P medians (GSUs) and their allocated non-medians (gas loadno<strong>de</strong>s). The solution can be improved by searching for new medians,swapping the current medians by non-medians and reallocatingthe loads. For each swap we analyze the solution achieved bynew location and allocation from new source to loads. If the newsolution is better, we keep it. The process is repeated until no moreimprovements are achieved.In table 1 we can see the behavior of the two implemented Lagrangeanheuristics: simple and with relocation. The solution wastaken for different values of α and α v , respectively, kilometric costof the natural gas unit transported by physical and virtual pipeline.The total costs presented in table I are the sum of GSUs implantationcosts and the transport costs.The increment of P means an increment of medians, so more GSUsinstalled, thus, the total costs increase for both approaches. Asobserved in table 1, for each case of α/ α v values, the total coststhat resulted from relocation heuristic are better (lower costs) thanthe simple Lagrangean heuristic.As we increase the α/α v value transportation costs are increased,so the optimization problem minimizes the solution by adding moreGSUs. This is an attempt to minimize transportation cost, but fixedcosts have a big influence, so, the result is a higher total cost.For all solutions of P, we can see in table I the Lagrangean relocationheuristic presents an obvious improvement in system totalcosts.4. CONCLUSIONBased on Lagrangean heuristics, this work supports GSUs location<strong>de</strong>cision-make and their optimal allocation to gas <strong>de</strong>mands. Thesimple Lagrangean heuristic often gives very good results, but notnecessarily the optimal ones. To improve this resolution approach,we <strong>de</strong>veloped the Lagrangean relocation heuristic, which provedits efficiency in total costs function minimization. Its performancewas verified for different location scenarios and with different parametersvalues.The <strong>de</strong>veloped location mo<strong>de</strong>l can be applied to any other gas networktype with the same good performance.5. REFERENCES[1] T. Nogueira, Z. Vale and M. Cor<strong>de</strong>iro. Natural Gas Marketin Europe: Development and Trends. CAIP2005 – 7th InteramericanConference on Computer Analysis of ManufacturerProcesses, Vila Real, Portugal, 2005, pp. 115–118.[2] T. Nogueira, R. Men<strong>de</strong>s, Z. Vale and J. Cardoso. An HeuristicApproach for Optimal Location of Gas Supply Units in TransportationSystem. 22 nd International Scientific Meeting of GasExperts, vol.1, pp. 303-311, May 2007, Opatija, Croatia.[3] L. Lorena and M. Narciso. Relaxation Heuristics for AssignmentProblem. European Journal of Operational Research,91:600–610, 1996.[4] Z. Drezner and H. Hamacher. Faciility Location: applicationsand Theory. 1 st ed. New York: Springer-Verlag, 2004, p. 119-143.[5] T. Nogueira and Z. Vale. Natural Gas System Operation: LagrangeanOptimization Techniques. IGRC 2008 – InternationalGas Union Research Conference, ID 147, Category:Transmission, October 2008, Paris, France.ALIO-EURO <strong>2011</strong> – 158


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A Multi-objective EPSO for Distributed energy resources planningRenan S. Maciel ‡ Mauro Rosa ∗ Valdimiro Mirando ∗ † Antonio Padilha-Feltrin ‡∗ INESC PortoCampus <strong>da</strong> FEUP, Rua Dr. Roberto Frias 378, Porto, Portugal{mrosa, vmiran<strong>da</strong>}@inescporto.pt‡ Department of Electrical Engineering, Sao Paulo State University (UNESP)Ilha Solteira, Brazilrsmaciel@aluno.feis.unesp.br, padilha@<strong>de</strong>e.feis.unesp.br† FEUP, Faculty of Engineering of the University of PortoPorto, PortugalABSTRACTThere is an increasing interest in Multi-objective optimization metaheuristicsto solve complex problems in many different areas. InPower Systems, Multi-objective optimization is also un<strong>de</strong>r intensiveresearch applied to traditional problems and mainly to themost recent trends such as Distributed Energy Resources integrationconsi<strong>de</strong>ring SmartGrids paradigm. Therefore, this paper isproposing a Multi-objective approach to the hybrid EPSO method.The Multi-objective EPSO method, called MEPSO, is applied to adiscrete problem of DER impact evaluation on electric distributionnetwork. It was observed, through the several runs, a better performanceof MEPSO when compared to the NSGA-II method. Despiteof being an initial evaluation, the results encourage to exploitthe best of EPSO characteristics in the Multi-objective domain.Keywords: Multi-objective optimization, Meta-heuristics, EPSO,NSGA-II, DER planning1. INTRODUCTIONThe ever-increasing interest to apply the concepts of Multi-objectiveoptimization (MO) in real-world problems, and the intense exploitationof the meta-heuristics as computational solutions to copewith complex optimization problems are fostering the <strong>de</strong>velopmentof many well known search techniques to the multi-objectivedomain. In power systems planning area, methodologies basedon MO meta-heuristics are un<strong>de</strong>r research, especially in the DistributedEnergy Resources (DER) integration [1]. In general, inthe earlier works, it was used the so-called classic MO methodssuch as Weighted Sum or ε-Constraint based on Genetic Algorithms(GA). As they have presented some limitations [2], newproposals based on Pareto optimality concepts, have being constantlyemployed. Despite the GA-based methods, such as NSGA-II and SPEA2, being more often used on DER planning, there aredifferent MO techniques based on meta-heuristics (e.g. SimulatedAnnealing, Tabu Search, PSO [3], [4], and so on) that take advantageof some specific mechanisms of each meta-heuristic.The Evolutionary Particle Swarm Optimization (EPSO) successfullycombines evolutionary strategies with the PSO method. Acomplete view about performance improvements carried out onEPSO algorithm is reported in the literature, including on powersystem problems [5], [6].This work proposes a Multi-objective EPSO called MEPSO, whichis applied to a discrete problem of DER integration in electricaldistribution networks. First of all, it is thoroughly presented anddiscussed the MEPSO algorithm. After that, an example followedby some remarks, results and discussions is showed.2. THE MULTI-OBJECTIVE EPSO PROPOSAL: MEPSOThe EPSO method merges the efficient PSO movement equationand overall structure with evolutionary strategies, namely self-a<strong>da</strong>ptivemutation and an explicit selection procedure [7]. For theMEPSO approach some steps of the EPSO algorithm are preserved,whereas others are strongly changed in or<strong>de</strong>r to incorporate MOconcepts. Consi<strong>de</strong>ring the general algorithm of EPSO presented in[8], the mutation and replication procedures were fully preservedin the MEPSO. On the other hand, reproduction, evaluation, andselection steps were remo<strong>de</strong>led using some of the MO proceduresintroduced by the NSGA-II method [9]. The general algorithm forMEPSO can be <strong>de</strong>scribed as follow:Parameters and variables initialization;Do i = 0;Do while i < IterMax;Rank and sort the swarm based on the concept of dominance;Up<strong>da</strong>te the Pareto List (PL), an external list that keeps theoptimal solution set;Assign the Global Best (Gb) to each particle of the swarm;Do for each particle:Replicate the particle;Execute on the replicated particle:Mutation of the strategic parameters;Execute on both original and replicated particles:Reproduction (based on PSO movement equation);Assign the Personal best (Pb);Add the replicated particle to Replica List (RL) thatkeeps whole set of replicated particles;Combine the original swarm with the RL;Perform Selection over the combined list of particles;Do i = i + 1;Print PL.The rank and sort of the swarm using the concept of dominancecan be performed in different ways. In this paper, the Fast NondominatedSort (FNS) algorithm [9] was employed. The nondominatedsolutions are the best ranked, belonging to the Paretofront 1. They are followed by the dominated solutions of only onesolution in the front 2. The process follows in the same way untilthe last front.ALIO-EURO <strong>2011</strong> – 159


not being used the Star Communication approach, what personal and global best assignment and in elitism.means that the particles do not will have the same Gbassigned. Except for the front 1, a particle belonging to the 3. DER INTEGRATION PLANNING PROBLEMfront f receives as Gb a solution randomly chosen from the It is wi<strong>de</strong>ly recognized and reported on literature that highfront (f 1). The Gb for front 1 solutions are randomly penetration of DER on distribution networks may offersProc. of the VII ALIO–EURO taken from – a Workshop reduced set onof Applied the PL, Combinatorial called Gb List. Optimization, The together Porto, benefits Portugal, and negative May 4–6, consequences <strong>2011</strong> [11]. Bothsolutions are chosen for Gb List aiming to favor diversity, i.e., looking for better exploration of poorly crow<strong>de</strong>d regionsThe PL up<strong>da</strong>te consists of the ofsearch the merger space. between For this purpose currentthe PLcrowding with the distance to <strong>de</strong>al operation with DER and control as well strategies as capacity to <strong>de</strong>al andwith placement DER as well on network. asfront one. Thus, themetricelimination[9] wasofusedtheinrepeatedor<strong>de</strong>r toandintroducedominate<strong>da</strong> measure ofMO optimizationcapacity and placementstands foronannetwork.interestingMO optimizationway to copestandswith DERdispersion of solutions in the objective function space. The for an interesting way to cope with DER integrationsolutions in the combined set is performed in or<strong>de</strong>r to conclu<strong>de</strong> integration problem, mainly due to suitable property of combineGb List may have a fixed size, as in this approach (was problem, mainly due to suitable property of combinethe up<strong>da</strong>te process. chosen The Gbthe assignment 5 less crow<strong>de</strong>d in MEPSO solutions wasin <strong>de</strong>eply PL), remo<strong>de</strong>led.The swarmpercentage will notof have the PL thesize same [10]. Gb assigned for eachor to be objectives a objectives from from different different natures natures over over aa discrete discrete manner.In or<strong>de</strong>r toIn show or<strong>de</strong>r the to potentialities show the potentialities of the proposed of the method, proposed it is performe<strong>da</strong> simple set of tests involving MEPSO and NSGA-II meth-particle, as occurs with The the Star Pb Communication is assigned after approach the particle [8]. Exceptfor the front 1, movement. where a particle The Pb belonging of a particle tois the the front last non-dominatedf willMEPSO and NSGA-II methods. A simplified mo<strong>de</strong>l of theperforms a method, it is performed a simple set of tests involvingreceive, as Gb, a solution position randomly visited by chosen the particle fromin the its path frontuntil f – the 1. currentods. Aproblemsimplifiedis assumed.mo<strong>de</strong>lItofconsiststhe problemof studyingis assumed.the impactIt consistsoverofThe Gb for the front iteration.studying1 are randomly taken from a reduced set ofthe network the impact losses over and the short network circuit losses level accordingly and short circuit with levelThe selection stage do not consists in a simple accordingly the position withand thesize position of generation and sizeunits. of generation A fixed number units. of A fixedthe PL, called Gb List. The solutions for Gb List are chosen aimingto favor diversity,comparison among the particle and the fitness function of its number generators of generators be to connected be connected is <strong>de</strong>fined. are <strong>de</strong>fined. Also a Also single a singlereplicas.i.e., lookingThe originalfor betterswarmexplorationand the listof poorlyof replicas are generation generation and and load load scenario is is used used in in or<strong>de</strong>r to to prepare theproblemin problem a discrete in a manner discrete with manner a finite with number a finite ofnumber solutions. of Thecrow<strong>de</strong>d regions of combined the searchand space. ranked Hence, using theFNS. crowding Then, distance it applied anmetric [9] was use<strong>de</strong>litist in or<strong>de</strong>r strategy to introduce [9], wherein a measure the swarm of of dispersion the next iteration is methods solutions. codification The methods is <strong>de</strong>tailed codification in [12] is and <strong>de</strong>tailed an example in [12] and is shownof the solutions, into firstly objective composed function by the space. best ranked Thesolutions Gb List and mayafterwar<strong>da</strong>n example is shown in Fig. 1. Each vector positionin Fig. 1. Each vector position indicates an available Distributedsized fixed. In thisusing paper, the the diversity five less criterion crow<strong>de</strong>d based solutions on the crowding PL distance indicates an available Distributed Generation (DG) unit andGeneration (DG) unit and the value assumed in each position indicatesno<strong>de</strong> where the generator is connected.was chosen. However, metric. another alternative can be a percentage ofthe value assumed in each position indicates the no<strong>de</strong> wherethe PL size [10].the generator is connected.2.1. DiscussionAfter the particle performs There are amany movement, features Pb in this assigned. MO EPSO The approach Pb of that canDG units 1 2a particle is the lastbe non-dominated altered and position tested visited seeking by thefor particle performance806 818itself in its path, until improvements. the current iteration.Compare solutions in MO is not trivial like in singleFig. 1. Codification example of the problem.The selection stageobjective doesn’t consist optimization. in a simple Mechanisms comparison have among to be <strong>de</strong>finedFigure 1: Codification example of the problem.the fitness functionpermitting of a particle to <strong>de</strong>al and with its multiple replicas. objectives, The original to accommo<strong>da</strong>te swarm and the list of Pareto replicas optimality are combined concepts and and ranked to obtain using a diversified FNS. and Then, it is applied an close elitist to strategy the true [9], Pareto wherein front the solution swarmset. ofHere the it was In Fig. 1, for instance, the DG unit “1” is connected on no<strong>de</strong> “806”,utilized the dominance-based rank and the crowdingnext iteration is firstly composed by the best ranked solutions accordinglywith dominance method improvements and afterwardover using its the former diversity version. crite-Thus, other short circuit level, and whose tra<strong>de</strong>off relationship is <strong>de</strong>sire<strong>da</strong>nd the 3.1. generator Problem formulation unit “2” on no<strong>de</strong> “818”.distance metric from [9], that corresponds to NSGA-II The two objectives to be minimized, real power loss andrion based on the crowding strategies distance may be matter metric. of investigation.3.1. to Problem be observed, formulation are represented through the two indices ILpThe Gb assignment structure has huge influence on and ISC3 written as follows.convergence. In this approach the main i<strong>de</strong>a is to stimulate i. Total Real Power Losses in<strong>de</strong>x (ILp) [13]: in2.1. DiscussionThe two objectives to be minimized are real power loss and shortdiversity and exploration of the search space. For front 1 it(1) it is evaluated the DG impact over the realis inten<strong>de</strong>d to intensify the search along the PF. However,circuit level. Theypowercouldlossesbe viewedby calculatingthroughthetheratiousebetweenof some indices,where the tra<strong>de</strong>off between both is observed. The indicesThere are many features in this MEPSO approach that can be change<strong>da</strong>nd tested towards performance improvements.ILp and ISC3 are written as follows:To compare solutions in MO problems isn’t quite trivial like in singleobjective optimization. Several mechanisms should be <strong>de</strong>finedin or<strong>de</strong>r to <strong>de</strong>al with multiple objectives, to accommo<strong>da</strong>te Paretooptimality concepts, and to obtain a diversified Pareto front solutionset. Here it was utilized the dominance-based rank and thecrowding distance metric from [9], which corresponds to NSGA-IImethod improvements over its former version. Thus, other strategiesmay be matter of investigation.The Gb assignment structure has huge influence on convergence.In this approach the main i<strong>de</strong>a is to stimulate diversity and explorationof the search space. For front 1 it is inten<strong>de</strong>d to intensify thesearch along the PF. However, changes may be performed on howthe Gb is chosen in a front. In this work it is a randomly procedure,or even the whole assignment procedure.The selection stage has also a high importance. The current proposalexploits elitist procedure from NSGA-II. Nevertheless, sometimesparticle’s information may be lost since a particle and itsreplica can be chosen. Tests can be ma<strong>de</strong> in or<strong>de</strong>r to check theinfluence of this behavior on the method performance.In [10] there is an example of MO PSO based on the NSGA-IImechanisms. However, it is different from this latter approach,because it doesn’t incorporating the ES of EPSO, in the personaland global best assignment and in elitism.3. DER INTEGRATION PLANNING PROBLEMpositive and negative impacts <strong>de</strong>pend on many technicalcharacteristics such as technology used, size of units,1. Total Real Power Losses in<strong>de</strong>x (ILp) [13]: in (1) it is evaluatedthe DG impact over the real power losses by calculatingthe ratio between the network total real power loss fora DG configuration (Loss DG ) and the total real power losswithout DG (Loss 0 ).ILp = LossDGLoss 0 (1)1. Three-Phase Short-Circuit Level in<strong>de</strong>x (ISC3) [13]: this in<strong>de</strong>x,<strong>de</strong>fined in (2), contributes to the DG impacts evaluationconcerning the network fault protection strategies.( )ISC abc DGiISC3 = maxi=1,NN I SC abc 0 iwhere: I SC abc DGi represents three-phase fault current value in no<strong>de</strong>i for a given DG configuration on the network; I SC abc 0 i representsthree-phase fault current value in no<strong>de</strong> i for the network withoutDG; NN is the number of no<strong>de</strong>s.Both indices follow the distribution utilities requirements in termsof DG unit connections. In some cases, losses and short-circuit arethe most important variables for connecting DG.The problem formulation is presented in equations (3) to (8).(2)It is wi<strong>de</strong>ly recognized and reported on literature that high penetrationof DER on distribution networks may offers together benefitsand negative consequences [11] to the systems. Both positive andnegative impacts <strong>de</strong>pend on many technical characteristics suchas technology used, size of units, operation and control strategiesMin ILp (3)Min ISC3 (4)ALIO-EURO <strong>2011</strong> – 160


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Subject to:∣0.95 ·V S/S ≤ ∣ViDG∣ ≤ 1.05 ·V S/S (5)∣∣I DGj ∣ ≤ I maxj (6)n DGi ≤ 1 (7)4. RESULTSTable 2 shows the cardinality of the true PF for constrained andunconstrained problems consi<strong>de</strong>ring both networks.IEEE123IEEE34CONS UNCONS CONS UNCONS|PF true | 29 91 52 76Table 2: Number of Pareto Front points for different cases.N DGnetwork = NDG available (8)where V S/S is rated voltage at the Substation; V DGi is the voltage atthe no<strong>de</strong> i for a given DG configuration; I DGj is the current throughthe branch j for a given DG configuration; I maxj is the maximumrated current for branch j; n DGi is the number of DG units connectedin no<strong>de</strong> i; N DGnetwork is the total number of DG units connectedin the network; and N DGavailable is the total number of DGunits available.This formulation gui<strong>de</strong>s the distribution companies that own DGunits, allowed in some places as a way to invest in the network[14], to take advantage of the connection by means of a tra<strong>de</strong>offanalysis. The study may also represent a scenario of DER notowned by the utility where the information provi<strong>de</strong>d give a portraitof the impact over the network technical performance, thendriving a policy of incentive or not the DER connection on certainlocations.Sometimes even the solutions that violate constraints may haverelevance in the tra<strong>de</strong>off analysis if the gain in some objective justifiesthe investment on turning feasible solutions, <strong>de</strong>pending onthe extent of the violation and the violated constraint. For this reasonand in or<strong>de</strong>r to observe the performance of the methods fora larger number of PF, the tests will be performed for both constrained(CONS) and unconstrained (UNCONS) problems.Two radial electric distribution networks with different features areused: the IEEE-34 and IEEE-123 [15]. The DG units to be allocatedin the networks were <strong>de</strong>fined in such a way to produce asimilar penetration level in both grids. Two generators were chosento each network: one of rated power of 200 kW and anotherof 400 kW for the IEEE-123 network; and one of rated power of100 kW and another of 200 kW for the IEEE-34 network. Theinformation about each network and the search space is shown inTable 1. The substation no<strong>de</strong> is not a candi<strong>da</strong>te no<strong>de</strong> to receive agenerator. It is also presented in the last Table 1 column how muchMaxEvals represents related to the search space size.Network Vs/s No<strong>de</strong>s DG Solutions in Max Max(pu) (except units the search Evals EvalsS/S) space (%)IEEE-123 1.0 113 2 12656 5000 39.5IEEE-34 1.05 32 2 992 400 40.3Table 1: Summary of the networks, tests and search space features.Finally, the methods performance is compared here by the numberof points found in the calculated PF (PF calc ) that belongs to thetrue PF (PF true ). For this purpose it is observed the cardinality ofthe PF calc , the number of dominated solutions (DS) in this set, an<strong>da</strong>lso the metric PF ratio (PFR) is used, <strong>de</strong>fined by (4), which givesthe percentage of PF true found.PFR = |PF calc ∩ PF true |× 100 (9)|PF true |The results are presented in Tables 3 and 4 for both electric networksconsi<strong>de</strong>ring constrained and unconstrained problem.CONS34UNCONS34NSGA-II MEPSO NSGA-II MEPSO|PF calc | 50 50 67 70DS - - - -PFR 96.2 96.2 88.2 92.1Table 3: Summary of results for the CONS34 and UNCONS34test cases.CONS123UNCONS123NSGA-II MEPSO NSGA-II MEPSO|PF calc | 26 28 85 91DS - - - -PFR 89.6 96.6 93.4 100.0Table 4: Summary of results for the CONS123 and UNCONS123test cases.In all test cases the final solution obtained by each method does nothave points dominated by the PF true . Then, both methods <strong>de</strong>monstratedgood convergence to the true PF. However, they generallywere not able to <strong>de</strong>fine the whole PF true set.Comparing the methods performance, except for CONS 34 testcase where both methods had the same PFR, MEPSO found moresolutions of the true PF than NSGA-II, keeping always a PFRhigher than 90%. Additionally, only MEPSO found the whole truePF for the UNCONS123 test case.It is important to remark that although MEPSO presented PFRequal or higher than NSGA-II, it does not means that NSGA-IIsolution set is contained in the MEPSO solution set, as can be seenin the Fig. 2.The EPSO method presents performance improvements and featuresthat can be exploited in MO. This paper shows in <strong>de</strong>tails amulti-objective proposal for EPSO and an example of applicationin Power System research field, where MO is being increasinglyused. The results <strong>de</strong>monstrate that MEPSO is comparable or evenbetter than NSGA-II, a method largely employed in the proposedproblem. MEPSO also preserved a simple framework and a userfriendlyparameter setting.However, MEPSO must be applied to problems with different sizesand features in or<strong>de</strong>r to clearly <strong>de</strong>fine its behavior.5. ACKNOWLEDGEMENTSThis work was supported by the FAPESP (grant no. 2006/06758-9), CNPq (grant no. 303741/2009-0) and CAPES (grant no. 0694/09-6).6. REFERENCES[1] A. Alarcon-Rodriguez, G. Ault, and S. Galloway. Multiobjectiveplanning of distributed energy resources: a reviewALIO-EURO <strong>2011</strong> – 161


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 2: Pareto Front for MEPSO and NSGA-II methods consi<strong>de</strong>ringthe UNCONS34 test case.of the state-of-the-art. Renewable and sustainable energy reviews,vol. 14, pp. 1353-1366, 2010.[2] K. Deb. Multi-objective optimization using evolutionary algorithms.U.K.: John Wiley & Sons Ltd, 2004.[3] M. Reyes-Sierra, and C.A. Coello Coello. Multi-objective particleswarm optimizers: a survey of the state-of-the-art. Internationaljournal of computational intelligence research, vol.2, n. 3, pp. 287-308, 2006.[4] C.A. Coello Coello, G.B. Lamont, D.A.V. Veldhuizen. Evolutionaryalgorithms for solving multi-objetive problem. 2n<strong>de</strong>d., New York: Springer, 2007.[5] V. Miran<strong>da</strong> and N. Fonseca. EPSO – best-of-two-world metaheuristicapplied to power systems problems. in Proc. 2002IEEE congress on evolutionary computation (CEC), pp. 1847-1851.[6] M. Eghbal, E.E. El-Araby, N. Yorino, and Y. Zoka. Applicationof metaheuristic methods to reactive power planning:a comparative study for GA, PSO and EPSO. in Proc. 2007ISIC – IEEE International conference on systems, man andcybernetics, pp. 3755-3760.[7] V. Miran<strong>da</strong>. Hybrid systems. in Mo<strong>de</strong>rn heuristic optimizationtechniques, K. Y. Lee, M. A. El-Sharkawi, Ed. New Jersey:John Wiley & Sons, 2008, pp. 524-562.[8] V. Miran<strong>da</strong>, H. Keko and A.J. Duque. Stochastic star communicationtopology in evolutionary particle swarm (EPSO).International journal of computational intelligence research,vol. 4, n. 2, pp. 105-116, 2008.[9] K. Deb, A. Pratap, S. Agarwal, and T. Meyarivan. A fastand elitist multiobjective genetic algorithm: NSGA-II. IEEETrans. on Evolutionary Computation, vol. 6, n. 2, pp. 182-197,2002.[10] X. Li. A non-dominated sorting particle swarm optimizerfor multiobjective optimization. in Proc. 2003 Genetic an<strong>de</strong>volutionary computation conference (GECCO’2003), pp. 37-48.[11] J.A. Peas-Lopes, N. Hatziargyriou, J. Mutale, P. Djapic andN. Jenkins. Integrating distributed generation into electricpower systems: a review of drivers, challenges and opportunities.Electric power systems research, vol. 77, n. 9, pp.1189-1203, 2007.[12] R.S. Maciel and A. Padilha-Feltrin. Distributed generationimpact evaluation using a multi-objective Tabu Search. inProc. 2009 International conference on intelligent systems applicationsto power systems, pp. 37-48.[13] L.F. Ochoa, A. Padilha-Feltrin, and G. Harrison. Timeseries-basedmaximization of distributed wind power generationintegration. IEEE Trans. on Energy Conversion, vol. 23,n. 3, pp. 968-974, 2008.[14] P. Siano, L.F. Ochoa, G.P. Harrison and A. Piccolo. Assessingthe strategic benefits of distributed generation ownershipfor DNOs. IET Gen., Trans. & Distr., vol. 3, n. 3, pp. 225-236,2009.[15] W.H. Kersting. Radial distribution test fee<strong>de</strong>rs. in Proc. 2001IEEE Power Engineering Society Winter Meeting, vol. 2, pp.908-912.ALIO-EURO <strong>2011</strong> – 162


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>On Using Preprocessing Cuts: I<strong>de</strong>ntification and Probing Schemes in StochasticMixed 0-1 and Combinatorial OptimizationL.F. Escu<strong>de</strong>ro ∗ M.A. Garín † M. Merino ‡ G. Pérez ‡∗ Dpto. Estadística e Investigación OperativaUniversi<strong>da</strong>d Rey Juan Carlos, Móstoles (Madrid), Spainlaureano.escu<strong>de</strong>ro@urjc.es† Dpto. <strong>de</strong> Economía Aplica<strong>da</strong> IIIUniversi<strong>da</strong>d <strong>de</strong>l País Vasco, Bilbao (Vizcaya), Spainmariaaraceli.garin@ehu.es‡ Dpto. <strong>de</strong> Matemática Aplica<strong>da</strong>, Estadística e Investigación OperativaUniversi<strong>da</strong>d <strong>de</strong>l País Vasco, Leioa (Vizcaya), Spain{maria.merino, gloria.perez}@ehu.esABSTRACTWe present a Branch and Fix Coordination algorithm for solvingmedium and large scale multi-stage mixed 0-1 & combinatorialoptimization problems un<strong>de</strong>r uncertainty. The uncertainty is representedvia a nonsymmetric scenario tree. The basic i<strong>de</strong>a consistsof explicitly rewriting the nonanticipativity constraints (NAC)of the 0-1 and continuous variables in the stages with commoninformation. As a result an assignment of the constraint matrixblocks into in<strong>de</strong>pen<strong>de</strong>nt scenario cluster submo<strong>de</strong>ls is performedby a compact representation. This partitioning allows to generatea new information structure to express the NAC which link the relatedclusters, such that the explicit NAC linking the submo<strong>de</strong>ls togetheris performed by a splitting variable representation The newalgorithm has been implemented in a C++ experimental co<strong>de</strong> thatuses the open source optimization engine COIN-OR, for solvingthe auxiliary LP and mixed 0-1 submo<strong>de</strong>ls. Some computationalexperience is reported to vali<strong>da</strong>te the new proposed approach. Wegive computational evi<strong>de</strong>nce of the mo<strong>de</strong>l tightening effect thathave preprocessing techniques in stochastic integer optimization aswell, by using the probing and Gomory and clique cuts i<strong>de</strong>ntificationand appending schemes of the optimization engine of choice.Keywords: Integer Programming, Mathematical Programming,Stochastic integer optimization1. INTRODUCTIONStochastic Optimization is actually one of the most robust toolsfor <strong>de</strong>cision making. It is broadly used in real-world applicationsin a wi<strong>de</strong> range of problems from different areas such as finance,scheduling, production planning, industrial engineering, capacityallocation, energy, air traffic, logistics, etc. The integer problemsun<strong>de</strong>r uncertainty have been studied in [1], [2] and [3], just forciting a few references. An exten<strong>de</strong>d bibliography of StochasticInteger Programming (SIP) has been collected in [4].It is well known that a mixed 0-1 & combinatorial optimizationproblem un<strong>de</strong>r uncertainty with a finite number of possible futurescenarios has a mixed 0-1 Deterministic Equivalent Mo<strong>de</strong>l (DEM),where the risk of providing a wrong solution is inclu<strong>de</strong>d in themo<strong>de</strong>l via a set of representative scenarios. However, as any graphrepresentation of this type of multi-stage mo<strong>de</strong>ls can suggest,the scenario information structuring for this type of problems ismore complex than for the approximation ma<strong>de</strong> by consi<strong>de</strong>ringtwo-stage stochastic mixed 0-1 & combinatorial mo<strong>de</strong>ls. Weshould point out that the scenario tree in real-life problems is veryfrequently a nonsymmetric one and then, the traditional splittingvariable representation for the nonanticipativity constraints (forshort, NAC), see [1, 5], on the 0-1 and continuous variables doesnot appear readily accessible to manipulations that are required bythe <strong>de</strong>composition strategies. A new type of strategies is necessaryfor solving medium and large scale instances of the problem.The <strong>de</strong>composition approaches that appear most promising arebased on some forms of branching selection, and scenario clusterpartitioning and bounding that <strong>de</strong>finitively use the informationabout the separability of the problem, see our work in [6].In full version of this work [7] we present a stochastic mixed 0-1optimization mo<strong>de</strong>ling approach and a parallelizable Branch andFix Coordination (BFC) algorithm for solving general mixed 0-1& combinatorial optimization problems un<strong>de</strong>r uncertainty, whereit is represented by nonsymmetric scenario trees. Given the structuringof the scenario clusters, the approach generates in<strong>de</strong>pen<strong>de</strong>ntcluster submo<strong>de</strong>ls, then, allowing parallel computation for obtaininglower bounds to the optimal solution value as well as feasiblesolutions for the problem until getting the optimal one. We presenta splitting variable representation with explicit NAC for linking thesubmo<strong>de</strong>ls together, and a compact representation for each submo<strong>de</strong>lto treat the implicit NAC related to each of the scenario clusters.Then, the algorithm that we propose uses the Twin No<strong>de</strong> Family(TNF) concept, see [6], and it is specially <strong>de</strong>signed for coordinatingand reinforcing the branching no<strong>de</strong>s and the branching 0-1variable selection strategies at each Branch-and-Fix (BF) tree. Thenonsymmetric scenario tree which will be partitioned into smallerscenario cluster subtrees. The new proposal is <strong>de</strong>noted NonsymmetricBFC-MS algorithm. We report some computational experienceto vali<strong>da</strong>te the new approach by using a testbed of mediumand large scale instances.2. SPLITTING VARIABLE REPRESENTATION INSTOCHASTIC OPTIMIZATIONLet us consi<strong>de</strong>r the following multi-stage <strong>de</strong>terministic mixed 0-1mo<strong>de</strong>lALIO-EURO <strong>2011</strong> – 163


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>min ∑ a t x t + c t y tt∈Ts.t.A 1 x 1 + B 1 y 1 = b 1A tx ′ t−1 + A t x t + B ty ′ t−1 + B t y t = b t ∀t ∈ T − {1}x t ∈ {0,1} nx t, y t ∈ R +ny t,∀t ∈ Twhere T is the set of stages (without loss of generality, let usconsi<strong>de</strong>r that a stage is only inclu<strong>de</strong>d by one time period), suchthat T = |T |, x t and y t are the nx t and ny t dimensional vectorsof the 0-1 and continuous variables, respectively, a t and c t are thevectors of the objective function coefficients, and A t and B t are theconstraint matrices for stage t.This mo<strong>de</strong>l can be exten<strong>de</strong>d to consi<strong>de</strong>r uncertainty in some ofthe main parameters, in our case, the objective function, the rhsand the constraint matrix coefficients. To introduce the uncertaintyin the parameters, we will use a scenario analysis approach. Ascenario consists of a realization of all random variables in allstages, that is, a path through the scenario tree. In this sense, Ωwill <strong>de</strong>note the set of scenarios, ω ∈ Ω will represent a specificscenario, and w ω will <strong>de</strong>note the likelihood or probability assignedby the mo<strong>de</strong>ler to scenario ω, such that ∑ ω∈Ω w ω = 1. We say thattwo scenarios belong to the same group in a given stage provi<strong>de</strong>dthat they have the same realizations of the uncertain parameters upto the stage. Following the nonanticipativity principle, see [1, 5],among others, both scenarios should have the same value for therelated variables with the time in<strong>de</strong>x up to the given stage. Let alsoG <strong>de</strong>note the set of scenario groups (i.e., no<strong>de</strong>s in the un<strong>de</strong>rlyingscenario tree), and G t <strong>de</strong>note the subset of scenario groups thatbelong to stage t ∈ T , such that G = ∪ t∈T G t . Ω g <strong>de</strong>notes the setof scenarios in group g, for g ∈ G .The splitting variable representation of the DEM of the fullrecourse stochastic version related to the multi-stage <strong>de</strong>terministicproblem (1) can be expressed as follows,z MIP = min ∑ ∑ w ω( at ω xtω + ct ω y ω )tω∈Ωt∈Ts.t.A 1 x1 ω + B 1y ω 1 = b 1 ∀ω ∈ ΩA ′ω t x ω t−1 + Aω t x ω t + B ′ω t y ω t−1 + Bω t y ω t = b ω t , ∀ω ∈ Ω, t ≥ 2x ω t − x ω′ t = 0, ∀ω,ω ′ ∈ Ω g : ω ≠ ω ′ , g ∈ G t , t ≤ T − 1y ω t − y ω′ t = 0, ∀ω,ω ′ ∈ Ω g : ω ≠ ω ′ , g ∈ G t , t ≤ T − 1x ω t ∈ {0,1} nxω t , y ω t ∈ R +nyω t , ∀ω ∈ Ω, t ∈ T .Observe that for a given stage t, A ′ ω t and At ω are the technologyand recourse matrices for the x t variables and B ′ ω t and Bt ω are thecorresponding ones for the y t variables. Notice that xt ω − xt ω′ = 0and yt ω − yt ω′ = 0 are the NAC. Finally, nxt ω and nyt ω <strong>de</strong>note thedimensions of the vectors of the variables x and y, respectively,related to stage t un<strong>de</strong>r scenario ω.3. SCENARIO CLUSTERING IN SCENARIO TREESIt is clear that the explicit representation of the NAC is not requiredfor all pairs of scenarios in or<strong>de</strong>r to reduce the dimensions ofmo<strong>de</strong>l. In fact, we can represent implicitly the NAC for some pairsof scenarios in or<strong>de</strong>r to gain computational efficiency.Definition 1. A scenario cluster is a set of scenarios whose NACare implicitly consi<strong>de</strong>red in mo<strong>de</strong>l (2).We will <strong>de</strong>compose the scenario tree into a subset of scenarioclusters, where P = {1,...,q} <strong>de</strong>notes the set of clusters and(1)(2)q = |P|. Let Ω p <strong>de</strong>note the set of scenarios that belongs to ageneric cluster p, where p ∈ P and ∑ q p=1 |Ωp | = |Ω|. It is clearthat the criterion for scenario clustering in the sets, say, Ω 1 ,...,Ω qis instance <strong>de</strong>pen<strong>de</strong>nt. Moreover, we favor the approach that showshigher scenario clustering for greater number of scenario groups incommon. In any case, notice that Ω p ⋂ Ω p′ = /0, p, p ′ = 1,...,q :p ≠ p ′ and Ω = ∪ q p=1 Ωp . Let also G p ⊂ G <strong>de</strong>note the set ofscenario groups for cluster p, such that Ω g ∩ Ω p ≠ /0 means thatg ∈ G p , Gtp = G t ∩G p <strong>de</strong>notes the set of scenario groups for clusterp ∈ P in stage t ∈ T .Definition 2. The break stage t ∗ is the stage t such that thenumber of scenario clusters is q = |G t ∗ +1|, where t ∗ + 1 ∈ T .Observe that cluster p ∈ P inclu<strong>de</strong>s the scenarios that belong togroup g ∈ G t ∗ +1, i.e., Ω p = Ω g .Notice that the choice of t ∗ = 0 corresponds to the full mo<strong>de</strong>l andt ∗ = T − 1 corresponds to the scenario partitioning.4. COMPUTATIONAL EXPERIENCEThe approach has been implemented in a C++ experimental co<strong>de</strong>.It uses the open source optimization engine COIN-OR for solvingthe LP relaxation and mixed 0-1 submo<strong>de</strong>ls, in particular, we haveused the functions: Clp (LP solver), Cbc (MIP solver), Cgl (Cutgenerator), Osi, OsiClp, OsiCbc and CoinUtils.The computational experiments were conducted in a WorkstationDebian Linux (kernel v2.6.26 with 64 bits), 2 processors Xeon5355 (Quad Core with 2x4 cores), 2.664 Ghz and 16 Gb of RAM.Table 1 gives the dimensions of the DEM of the full stochasticmo<strong>de</strong>l in compact representation for difficult medium and largescale problems. Table 2 gives µ, the mean and σ, stan<strong>da</strong>rd<strong>de</strong>viation for dimensions of the cluster submo<strong>de</strong>ls; so, we canobserve the variability of the nonsymmetric clusters. The headingsare as follows: m, number of constraints; nx, number of 0-1variables; ny, number of continuous variables; nel, number ofnonzero coefficients in the constraint matrix; and <strong>de</strong>ns, constraintmatrix <strong>de</strong>nsity (in %).Inst. m nx ny nel <strong>de</strong>nsP1 696 160 376 1550 0.42P2 1202 530 241 3053 0.33P3 7282 1878 4152 20818 0.05P4 16172 4270 9340 53257 0.02P5 23907 5560 11675 68937 0.02P6 32914 6672 14010 105854 0.02P7 2085 450 1155 9105 0.27P8 4696 1090 2516 9935 0.06P9 11298 2668 5962 25262 0.03P10 16870 4600 10430 42015 0.02P11 31648 7984 17676 83252 0.01P12 40020 8847 19377 100680 0.01P13 5256 1176 2904 12861 0.06P14 11121 2538 6045 27315 0.03P15 14570 3370 7830 32508 0.02P16 28176 6584 15008 62934 0.01P17 45844 10794 24256 102480 0.01P18 76424 18108 40208 170954 0.00Table 1: Testbed problem dimensionsTable 3 shows some results of our computational experimentation.The headings are as follows: |P|, number of clusters; |Ω|, numberof scenarios; |G |, number of scenario groups; Z LP , solutionvalue of the LP relaxation of the original DEM in compactALIO-EURO <strong>2011</strong> – 164


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Inst. µ m (σ m ) µ nx (σ nx ) µ nel (σ nel ) µ <strong>de</strong>ns (σ <strong>de</strong>ns )P1 133 (30) 28 (7) 275 (62) 2.31 (0.68)P2 496 (68) 230 (20) 1227 (171) 0.76 (0.08)P3 869 (305) 193 (70) 2145 (767) 0.46 (0.20)P4 1788 (579) 397 (131) 4961 (1617) 0.24 (0.09)P5 2815 (21) 561 (4) 6953 (51) 0.14 (0.00)P6 3823 (28) 673 (5) 10675 (78) 0.13 (0.00)P7 750 (187) 160 (43) 3236 (859) 0.80 (0.20)P8 643 (191) 138 (41) 1259 (372) 0.49 (0.21)P9 1241 (537) 269 (117) 2544 (1097) 0.30 (0.18)P10 2007 (454) 516 (146) 4711 (1333) 0.15 (0.03)P11 3322 (1208) 729 (266) 7608 (2759) 0.11 (0.05)P12 3748 (1455) 740 (288) 8423 (3265) 0.12 (0.06)P13 950 (260) 199 (56) 2171 (610) 0.37 (0.14)P14 1751 (544) 365 (114) 3930 (1217) 0.20 (0.06)P15 1973 (617) 423 (133) 4081 (1275) 0.17 (0.09)P16 3403 (984) 733 (212) 7010 (2025) 0.09 (0.03)P17 5000 (2216) 1081 (480) 10266 (4549) 0.08 (0.05)P18 5126 (1967) 824 (317) 8604 (3300) 0.07 (0.03)Table 2: Testbed cluster-subproblem dimensionsrepresentation; Z 0 , optimal expected solution value obtained bysolving in<strong>de</strong>pen<strong>de</strong>ntly the mixed 0-1 cluster submo<strong>de</strong>ls; z MIP ,optimal solution value of the original DEM.We can observethe very good lower bounds Z 0 , that can allow to improve theconvergence speed of the algorithm.Inst. |P| |Ω| |G | Z LP Z 0 z MIPP1 6 52 80 4395695 4654305 4654305P2 3 6 12 75103.6 58589.1 58585.1P3 10 247 313 5691.3 442336 573848P4 11 347 427 11601.4 725490 903367P5 10 1001 1112 4977.8 385471 468277P6 10 1001 1112 6116.5 540241 653638P7 3 13 30 20210.9 964395 973038P8 8 377 545 3156.8 156064 156064P9 10 1021 1334 3829.5 239683 239683P10 9 674 920 5757.0 394469 505729P11 11 1569 1996 5474.1 401435 401435P12 9 674 920 5757.0 394469 505729P13 6 208 392 8071.8 371498 372296P14 7 523 846 6157.3 339381 339381P15 8 1140 1685 3941.7 212593 212593P16 9 2372 3292 3521.9 258977 258977P17 10 4063 5397 2629.0 303900 303900P18 11 7058 9054 3824.7 318958 318958Table 3: Computational results. Stochastic solutionIt is well known that one of the most important contributions tothe advancement of the theory and applications of <strong>de</strong>terministicinteger & combinatorial optimization has been the <strong>de</strong>velopment ofthe preprocessing techniques for solving large scale instances inaffor<strong>da</strong>ble computing effort, due to the tightening of the mo<strong>de</strong>lsand, so, reducing the LP feasible space without eliminating anyfeasible integer solution that potentially could become the optimalone. Some of the key ingredients in preprocessing are theprobing techniques [8, 9, 10] and schemes for i<strong>de</strong>ntifying an<strong>da</strong>ppending Gomory cuts [11, 12] and clique cuts [13], among otherimportant schemes. So, our algorithm for solving large instancesof the mixed integer DEM takes benefit from the preprocessingtechniques of the optimization engine of choice. They are usedfor solving the auxiliary mixed integer submo<strong>de</strong>ls related to thescenario clusters. The difference in computing time by usingpreprocessing compared with the alternative that does not use itNonsymmetric BFC-MS B&BInst. T nT NF tt tt C tt tt CP1 4 1 0.4 0.3 4000.2 0.8P2 4 114 198.1 138.8 1304.2 1304.2P3 4 8 21.8 1.7 41.4 1.7P4 4 16 171.6 11.7 1530.8 19.4P5 4 8 162.1 8.8 448.5 13.4P6 4 10 229.5 8.5 889.7 48.4P7 5 81 142.5 41.8 188.3 35.9P8 5 1 2.7 0.9 272.3 6.1P9 5 1 9.1 1.4 100.0 4.5P10 5 10 206.6 45.8 7992.7 296.4P12 5 1 80.2 14.9 12113.1 126.8P12 5 7 513.8 66.8 3566.2(*) 867.5P13 6 3 13.2 2.6 1304.2 10.2P14 6 1 14.2 3.5 — 22.9P15 6 1 19.7 4.9 7226.3(*) 19.2P16 6 1 81.2 26.4 628.5(*) 48.5(*)P17 6 1 152.8 8.7 1897.3 67.3P18 6 1 377.0 24.1 — 202.9—: Time limit excee<strong>de</strong>d (6 hours)(*): Time for obtaining quasioptimum (0.05)Table 4: Nonsymmetric BFC-MS performance vs B&Bis crucial in solving large scale instances. Table 4 shows theefficiency and stability of the Nonsymmetric BFC-MS algorithmproposed in the full version [7] of the paper. The headings areas follows: T , number of stages; nT NF, number of TNFs; B&B,plain use of the Branch-and-Bound procedure for the full mo<strong>de</strong>l byusing the Cbc function of COIN-OR; and tt and tt C , total elapsedtime (in seconds) without and with preprocessing. Although otherbreak stages have been consi<strong>de</strong>red, we have obtained the bestresults with the break stage t ∗ = 1 and, then, q = |G 2 | for boththe Nonsymmetric BFC-MS algorithm and the plain use of the Cbcfunction of COIN-OR.5. CONCLUSIONSA mo<strong>de</strong>ling approach and an exact Branch-and-Fix Coordinationalgorithmic framework, so-called Nonsymmetric BFC-MS, isbeing proposed in the full version of the paper for solving multistagemixed 0-1 & combinatorial problems un<strong>de</strong>r uncertainty inthe parameters. The 0-1 and continuous variables can also appearat any stage. The approach treats the uncertainty by scenariocluster analysis, allowing the scenario tree to be nonsymmetric.This last feature has not been consi<strong>de</strong>red in the literature thatwe are aware of. However, in our opinion, it is crucial forsolving medium and large scale problems, since the real-life mixedinteger optimization problems un<strong>de</strong>r uncertainty that, at least, wehave encountered have very frequently nonsymmetric scenariosto represent the uncertainty. We can observe (1) the efficiencyof using the preprocessing techniques (i.e., probing and Gomoryand clique cuts i<strong>de</strong>ntification and appending schemes) and (2)the astonishing small computing time required by the propose<strong>da</strong>lgorithm, such that it clearly outperforms the plain use of theoptimization engine of choice.6. ACKNOWLEDGEMENTSThis research has been partially supported by the projectsECO2008-00777 ECON from the Ministry of Education and Science,Grupo <strong>de</strong> Investigación IT-347-10 from the Basque Government,URJC-CM-2008-CET-3703 and RIESGOS CM fromALIO-EURO <strong>2011</strong> – 165


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Comuni<strong>da</strong>d <strong>de</strong> Madrid, and PLANIN MTM2009-14087-C04-01from Ministry of Science and Innovation, Spain. The full paper isto be submitted for publication in a regular journal.7. REFERENCES[1] J. Birge and F. Louveaux, Introduction to Stochastic Programming.Springer, 1997.[2] R. Schultz, “Stochastic programming with integer variables,”Mathematical Programming Ser. B, vol. 97, pp. 285–309,2003.[3] R. Schultz and S. Tie<strong>de</strong>mann, “Conditional value-at-risk instochastic programs with mixed integer recourse,” MathematicalProgramming Ser. B, vol. 105, pp. 365–386, 2006.[4] M. H. van <strong>de</strong>r Vlerk, “Stochastic integer programming bibliography,”World Wi<strong>de</strong> Web, http://www.eco.rug.nl/mally/biblio/sip.html, 1996-2007.[5] R. Rockafellar and R.-B. Wets, “Scenario and policy aggregationin optimisation un<strong>de</strong>r uncertainty,” Mathematics ofOperations Research, vol. 16, pp. 119–147, 1991.[6] M. M. L.F. Escu<strong>de</strong>ro, A. Garín and G. Pérez, “On bfcmsmipstrategies for scenario cluster partitioning, and twinno<strong>de</strong> family branching selection and bounding for multistagestochastic mixed integer programming,” Computers & OperationsResearch, vol. 37, pp. 738–753, 2010.[7] ——, “An algorithmic framework for solving large scalemulti-stage stochastic mixed 0-1 problems with nonsymmetricscenario trees,” To be submmited for publication, <strong>2011</strong>.[8] E. J. M. Guignard and K. Spielberg, “Logical processing ininteger programming,” Annals of Operations Research, vol.140, pp. 263–304, 2005.[9] M. Guignard and K. Spielberg, “Logical reduction methodsin zero-one programming. minimal preferred variables,”Operations Research, vol. 29, pp. 49–74, 1981.[10] M. Savelsbergh, “Preprocessing and probing techniques formixed integer programming problems,” ORSA Jornal ofComputing, vol. 6, pp. 445–454, 1994.[11] G. Cornuejols, “Revival of the gomory cuts in the 1990s,”Operations Research, vol. 149, pp. 63–66, 2007.[12] R. Gomory, Recent Advances in Mathematical Programming.R.L. Graves and P. Wolfe, Eds. McGraw-Hill, 1963,ch. An algorithm for integer solutions to linear programs, pp.269–302.[13] E. J. H. Crow<strong>de</strong>r and M. Padberg, “Solving large-scale zeroonelinear programming problems,” Annals of OperationsResearch, vol. 31, pp. 803–834, 1983.ALIO-EURO <strong>2011</strong> – 166


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Scenario cluster lagrangean <strong>de</strong>composition in stochastic mixed integerprogrammingL.F. Escu<strong>de</strong>ro ∗ M.A. Garín † G. Pérez ‡ A. Unzueta †∗ Dpto. Estadística e Investigación OperativaUniversi<strong>da</strong>d Rey Juan Carlos, Móstoles (Madrid), Spainlaureano.escu<strong>de</strong>ro@urjc.es† Dpto. <strong>de</strong> Economía Aplica<strong>da</strong> IIIUniversi<strong>da</strong>d <strong>de</strong>l País Vasco, Bilbao (Vizcaya), Spain{mariaaraceli.garin, aitziber.unzueta}@ehu.es‡ Dpto. <strong>de</strong> Matemática Aplica<strong>da</strong>, Estadística e Investigación OperativaUniversi<strong>da</strong>d <strong>de</strong>l País Vasco, Leioa (Vizcaya), Spaingloria.perez@ehu.esABSTRACTIn this paper we introduce a scenario cluster based Lagrangean Decomposition(LD) scheme for obtaining strong lower bounds to theoptimal solution of two-stage stochastic mixed 0-1 problems. Ateach iteration of the Lagrangean based procedures, the traditionalaim consists of obtaining the optimal solution value of the correspondingLagrangean dual via solving scenario submo<strong>de</strong>ls oncethe nonanticipativity constraints have been dualized. Instead ofconsi<strong>de</strong>ring a splitting variable representation over the set of scenarios,we propose to <strong>de</strong>compose the mo<strong>de</strong>l into a set of scenarioclusters. We compare the computational performance of severalLagrangean dual schemes, as the Subgradient Method, the VolumeAlgorithm and the Progressive Hedging Algorithm for differentnumber of the scenario clusters and different dimensions of theoriginal problem. Our computational experience shows how thebound value and its computational effort <strong>de</strong>pend on the number ofscenario clusters to consi<strong>de</strong>r. In any case, the computational experiencereported in this exten<strong>de</strong>d abstract (as well as the extensiveone reported in the full paper) shows that the scenario cluster LDscheme outperforms the traditional LD scheme for single scenariosboth in lower bounds’s quality and computing effort. All the procedureshave been implemented in a C++ experimental co<strong>de</strong> thatuses the open source optimization engine COIN-OR, for solvingthe auxiliary LP and mixed 0-1 cluster submo<strong>de</strong>ls. We also givecomputational evi<strong>de</strong>nce of the mo<strong>de</strong>l tightening effect that preprocessingtechniques have in stochastic integer optimization as well,by using the probing and Gomory and clique cuts i<strong>de</strong>ntificationand appending schemes of the optimization engine of choice.Keywords: Stochastic integer programming, Lagrangean <strong>de</strong>composition,Subgradient, Volume, Progressive hedging algorithm, Scenarioclusters1. INTRODUCTIONIn this work we consi<strong>de</strong>r a general two-stage stochastic mixed 0-1problem. The uncertainty is mo<strong>de</strong>led via a finite set of scenariosω = 1,...,|Ω|, each with an associated probability of occurrencew ω , ω ∈ Ω. The traditional aim in this type of problems is tosolve the so-called Deterministic Equivalent Mo<strong>de</strong>l (DEM), whichis a mixed 0-1 problem with a special structure, see e.g. [1] fora good survey on some mayor results in the area obtained duringthe last <strong>de</strong>ca<strong>de</strong>. A Branch-and-Bound algorithm for problemshaving mixed-integer variables in both stages is <strong>de</strong>signed in [2],among others, by using Lagrangean relaxation for obtaining lowerbounds to the optimal solution of the original problem. A Branchand-FixCoordination (BFC) methodology for solving such DEMin production planning un<strong>de</strong>r uncertainty is given in [3, 4], butthe approach does not allow continuous first stage variables or 0-1 second stage variables. We propose in [5, 6] a BFC algorithmicframework for obtaining the optimal solution of the two-stagestochastic mixed 0-1 integer problem, where the uncertainty appearsanywhere in the coefficients of the 0-1 and continuous variablesin both stages. Recently, a general algorithm for two-stageproblems is <strong>de</strong>scribed in [7]. We study in [8] several solution methodsfor solving the dual problem corresponding to the LagrangeanDecomposition (LD) of two-stage stochastic mixed 0-1 mo<strong>de</strong>ls.At each iteration of these Lagrangean based procedures, the traditionalaim consists of obtaining the optimal solution value ofthe corresponding parametric mixed 0-1 Lagrangean dual problemvia solving scenario submo<strong>de</strong>ls once the nonanticipativity constraints(NAC) have been dualized, and the parameters (i.e., theLagrangean multipliers) are up<strong>da</strong>ted by using different subgradientbased methodologies.Instead of consi<strong>de</strong>ring a splitting variable representation over theset of scenarios, in this paper we propose to <strong>de</strong>compose the mo<strong>de</strong>linto a set of scenario clusters. For different choices of the numberof scenario clusters we computationally compare the solutiongiven by the plain use of the optimization engine COIN-OR, see[9], against various schemes for solving the Lagrangean dual problems.After this comparison we observe that very frequently thenew bounds give the optimal solution to the original problem. Moreover,the performance of the scenario cluster LD scheme outperformsthe LD scheme based on single scenarios in both the bounds’squality and computing effort. These successful results may openthe possibility for tightening the lower bounds of the solution atthe candi<strong>da</strong>te Twin No<strong>de</strong> Families in the exact BFC scheme forboth two-stage and multistage types of problems.2. TWO-STAGE STOCHASTIC MIXED 0-1 PROBLEMLet us consi<strong>de</strong>r the compact representation of the DEM of a twostagestochastic mixed integer problem (MIP),ALIO-EURO <strong>2011</strong> – 167


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>(MIP) :s.t.z MIP = minc T 1 δ + cT 2 x + ∑ [w ω q ωT1 γ ω + w ω q ωT2 y ω ]( ) ω∈Ωδb 1 ≤ A ≤ bx 2( ) ( )h ω 1 ≤ T ω δ+Wxω γ ωy ω ≤ h ω 2 ,ω ∈ Ωδ,γ ω ∈ {0,1},x,y ω ≥ 0,∀ω ∈ Ω,where the uncertainty in the parameters is introduced by using ascenario analysis approach, such that a scenario consists of a realizationof all random variables in both stages through a scenariotree. Notice that there are two types of <strong>de</strong>cision variables at eachstage, namely, the set of δ 0-1 and x continuous variables for thefirst stage, and the set of γ ω 0-1 and y ω continuous variables forthe second stage. Notice also that for simplifying reasons, the objectivefunction to optimize in the mo<strong>de</strong>ls <strong>de</strong>alt with in this paperis the expected value over the set of scenarios Ω.Let us suppose that we have selected a set of scenario clustersfor the second stage, whose number is say ˆp. In general, givena scenario tree, ˆp can be chosen as any value between 1 and |Ω|.Now, we can represent the MIP mo<strong>de</strong>l (1) by a splitting variablerepresentation, see [10, 11] among others, where the full mo<strong>de</strong>lis inclu<strong>de</strong>d by the ˆp cluster submo<strong>de</strong>ls and their related linkingNAC. Additionally, we consi<strong>de</strong>r a compact representation for theΩ p scenarios into each cluster submo<strong>de</strong>l p, where p ∈ {1,..., ˆp},and |Ω p | <strong>de</strong>fines the size of scenario cluster, p, i.e., the number ofscenarios that belong to the corresponding cluster, for p = 1,..., ˆp.The scenario clusters are <strong>de</strong>fined in terms of consecutive scenarios,Ω 1 = {1,...,|Ω 1 |}, Ω 2 = {|Ω 1 | + 1,...,|Ω 1 | + |Ω 2 |},..., Ω ˆp ={|Ω 1 |+...+|Ω ˆp−1 |+1,...,|Ω|}. The mixed 0-1 submo<strong>de</strong>l to consi<strong>de</strong>rfor each scenario cluster p can be expressed by the compactrepresentation,(MIP p ) : z p = minw p c T 1 δ p + w p c T 2 xp + ∑ w ω [q ωT1 γ ω + q ωT2 y ω ]( ) ω∈Ω pδ ps.t. b 1 ≤ Ax p ≤ b 2( ) ( )h ω 1 ≤ T ω δ px p +W ω γ ωy ω ≤ h ω 2 ,ω ∈ Ωpx p ≥ 0,δ p ∈ {0,1},γ ω ∈ {0,1},y ω ≥ 0,∀ω ∈ Ω p ,(2)where w p = ∑ ω∈Ω p w ω <strong>de</strong>notes the likelihood for scenario clusterp, and δ p and x p are the variable vectors δ and x for scenariocluster p. Moreover, the ˆp submo<strong>de</strong>ls (2) are linked by the NAC(1)δ p − δ p′ = 0 (3)x p − x p′ = 0, (4)for p, p ′ = 1,..., ˆp : p ≠ p ′ . So, the mixed 0-1 DEM (1) is equivalentto the splitting variable representation over the set of scenarioclusters,(MIP) : z MIP = ∑ ˆp p=1 zps.t.δ p − δ p+1 ≤ 0, ∀p = 1,..., ˆp − 1,δ ˆp ≤ δ 1x p − x p+1 ≤ 0, ∀p = 1,..., ˆp − 1,.x ˆp ≤ x 1 .Observe that the NAC (3)-(4) have been represented as the set ofinequalities (6), in or<strong>de</strong>r to avoid the use of non-signed vectorsof Lagrangean multipliers in the dualization of such constraints,see below. Additionally, notice that for ˆp = 1, the mo<strong>de</strong>l (5)-(6)coinci<strong>de</strong>s with the mixed 0-1 DEM in the compact representation(1), and for ˆp = |Ω| we obtain the splitting variable representationvia scenarios.3. SCENARIO CLUSTERING IN SCENARIO TREESThe scenario cluster Lagrangean Decomposition (LD) of the mixed0-1 DEM, (MIP) mo<strong>de</strong>l (5)-(6), for a given set of scenario clustersand a given nonnegative vector of Lagrangean multipliers µ =(5)(6)(µ δ , µ x ), is the µ-parametric mixed 0-1 minimization mo<strong>de</strong>l (7) in(δ,x,γ,y) with objective function value z LD (µ, ˆp). Let us <strong>de</strong>notethis mo<strong>de</strong>l as (MIP ˆp LD (µ)).(MIP ˆp LD (µ)) :s.t.ˆpz LD(µ, ˆp) = min ∑ [w p c T 1 δ p + w p c T 2 x p +p=1+ ∑ w ω [q ωT1 γ ω + q ωT2 y ω ]]+ω∈Ω p+ ˆp−1∑ µ p δ (δ p − δ p+1 ) + µ ˆp δ (δ ˆp − δ 1 )+p=1+ ˆp−1∑ µ x p (x p − x p+1 ) + µ x ˆp (x ˆp − x 1 )p=1( )δ pb 1 ≤ Ax p ≤ b 2 , p = 1,..., ˆph ω 1 ≤ T ω (δ px p )+W ω (γ ωy ω )≤ h ω 2 ,ω ∈ Ωp , p = 1,..., ˆpx p ≥ 0,δ p ∈ {0,1},∀p = 1,..., ˆpy ω ≥ 0,γ ω ∈ {0,1},∀ω ∈ Ω p , p = 1,..., ˆpIt is well known that (MIP ˆp LD (µ)) is a relaxation of (MIP), since (i)the feasible set of (MIP ˆp LD (µ)) contains the feasible set of (MIP),and (ii) for any (δ,x,γ,y) feasible for (MIP) and any µ ≥ 0 and1 < ˆp ≤ |Ω|, it results that z LD (µ, ˆp) ≤ z MIP . Notice that if ˆp = 1,for any µ ≥ 0, z LD (µ,1) = z MIP by <strong>de</strong>finition of the compact representation.Then, it follows that the optimal value z LD (µ, ˆp), which<strong>de</strong>pends on µ is a lower bound of the optimal value of (MIP), z MIPfor any choice of ˆp, with 1 < ˆp ≤ |Ω|.Definition 1. For any choice of ˆp, with 1 < ˆp ≤ |Ω|, the problemof finding the tightest Lagrangean lower bound on z MIP is(MIP LD ) : z LD = maxµ≥0 z LD(µ, ˆp).It is called Lagrangean dual of (MIP) relative to the (complicating)NAC (6), and ˆp <strong>de</strong>notes the number of scenario clusters.It can be shown, see [16], that the Lagrangean <strong>de</strong>composition givesequal or stronger bounds of the solution value of the original problemthan the Lagrangean relaxation of the constraints related toany of the scenario clusters to be <strong>de</strong>composed. See also [14].Given a choice of the set of ˆp scenario clusters, the µ-parametric(MIP ˆp LD (µ)) mo<strong>de</strong>l (7) must be solved, where the parametric vector(µ) = (µ δ , µ x ) is given. Moreover, the corresponding objectivefunction in (7) can be rewritten as the sum of the objective functionvalues of smaller submo<strong>de</strong>ls, one for each scenario cluster.s.t.z LD (µ, ˆp) = min∑ ˆp p=2 [[wp c T 1 + (µ p δ − µ p−1δ )]δ p ++ [w p c T 2 + (µ x p − µ x p−1 )]x p ++ ∑ ω∈Ω p w ω [q ωT1 γ ω + q ωT2 y ω ]]++ [w 1 c T 1 + (µ1 δ − µ ˆp δ )]δ 1 ++ [w 1 c T 2 + (µ1 x − µ x ˆp )]x 1 ++ ∑ ω∈Ω 1 w ω [q ωT1 γ ω + q ωT2 y ω ]( )δ pb 1 ≤ Ax p ≤ b 2 ,p = 1,..., ˆph ω 1 ≤ T ω (δ px p )+W ω (γ ωy ω )≤ h ω 2 ,x p ≥ 0,δ p ∈ {0,1},y ω ≥ 0,γ ω ∈ {0,1},∀p = 1,..., ˆp∀ω ∈ Ω p , p = 1,..., ˆp(7)(8)ω ∈ Ωp , p = 1,..., ˆpThat is, the MIP ˆp LD (µ) mo<strong>de</strong>l can be <strong>de</strong>composed in ˆp smallersubmo<strong>de</strong>ls, and its optimal solution value be calculated as the sumof the related z p LD (µ p ) values, i.e., the optimal solution value ofeach pth scenario cluster mo<strong>de</strong>l, and p = 1,..., ˆp.ALIO-EURO <strong>2011</strong> – 168


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>4. COMPUTATIONAL EXPERIENCEWe have implemented the three procedures: Subgradient method[17], Volume algorithm [12] and Progresive Hedging Algorithm[11] in a C++ experimental co<strong>de</strong>. The free optimization engineCOIN-OR is used for solving the linear and mixed 0-1 auxiliarysubmo<strong>de</strong>ls and the whole mo<strong>de</strong>l as well. The computational experimentswere conducted in a Workstation Debian Linux (kernelv2.6.26 with 64 bits), 2 processors Xeon 5355 (Quad Core with2x4 cores), 2.664 Ghz and 16 Gb of RAM. Table 1 gives the dimensionsof the mixed 0-1 DEM, in compact representation. Theheadings are as follows: m, number of constraints; n δ+γ , numberof 0-1 variables; n x+y , number of continuous variables; nel,number of nonzero coefficients in the constraint matrix; <strong>de</strong>ns, constraintmatrix <strong>de</strong>nsity (in %); and |Ω|, number of scenarios. Thetestbed used for the reported experimentation is available from theauthors un<strong>de</strong>r request.Table 1: Testbed problem dimensionsCase m n δ+γ n x+y nel <strong>de</strong>ns |Ω|P1 136 132 132 2112 5.88 32P2 148 138 138 3984 9.75 32P3 324 483 327 6440 2.45 80P4 520 516 516 8256 1.54 128P5 520 516 516 8256 1.54 128P6 516 771 519 10280 1.54 128P7 532 522 522 14736 2.65 128P8 1290 1290 1290 51400 1.54 128P9 712 612 612 146496 16.81 128Table 2: Stochastic SolutionCase z MIP z LP GAP T COIN T LP z LDP1 -80.48 -81.14 0.81 0.71 0.01 -73.02P2 -99.89 -100.42 0.52 1.12 0.02 -90.38P3 -45.61 -47.48 3.95 256.30 0.03 -4.74P4 -23.86 -27.19 12.23 10.55 0.05 -13.59P5 -28.75 -31.71 9.33 16115.80 0.04 -3.17P6 — -52.77 — — 0.03 -5.27P7 -218.67 -277.95 21.33 6.33 0.07 -27.79P8 — -63.76 — — 0.13 -6.37P9 -1937.85 -2070.47 6.40 2.01 0.36 -5.27—: Time limit excee<strong>de</strong>d (7 hours)Table 2 shows some results of our computational experimentation.See an extensive computational experience in the full paper [19].The headings of table 2 are as follows: z MIP and z LP , solutionvalues of the original stochastic mixed 0-1 problem and its LP relaxation,respectively; GAP, optimality gap <strong>de</strong>fined as z MIP−z LPz LP(in%); T COIN and T LP , elapsed times (in seconds) to obtain the z MIPand z LP solution values, respectively, by plain use of COIN-OR;and z LD , upper bound of the optimal solution value of the originalproblem.Table 3 shows some of our main computational results. We presentthe Lagrangen bounds that we obtain, with ˆp = 4 scenario clusters,see the results for other choices of the number of clusters in the fullpaper. The headings are as follows: z SUB , z VOL , and z PHA , lowerbounds of the optimal solution for the original problem obtainedby the Subgradient Method (SUB), Volume Algorithm (VOL) andProgressive Hedging Algorithm (PHA), respectively; T SUB , T VOLand T PHA elapsed times (in seconds) to compute the related Lagrangeanbounds; and, finally, nit SUB , nit VOL and nit PHA , numberof iterations to compute the corresponding bounds.The results in bold font are those where the Lagrangean boundcoinci<strong>de</strong>s with the optimal solution value of the original stochasticTable 3: Lagrangean bounds with ˆp = 4 scenario clustersz SUB T SUB nit SUBP1 -80.48 7.42 35P2 -99.89 3.72 20z VOL T VOL nit VOLP1 -80.48 7.54 37P2 -99.94 0.99 5z PHA T PHA nit PHAP1 -80.48 15.25 72P2 -99.89 5.66 30z SUB T SUB nit SUBP3 -45.61 22.09 35z VOL T VOL nit VOLP3 -45.61 28.04 41z PHA T PHA nit PHAP3 -45.64 31.44 53z SUB T SUB nit SUBP4 -23.86 23.65 21P5 -28.76 21.86 10P6 -49.79 16.10 0P7 -218.67 0.66 0P8 -61.63 10593.90 29P9 -1937.85 2.19 0z VOL T VOL nit VOLP4 -23.86 235.52 202P5 -28.75 926.75 362P6 -49.79 16.54 0P7 -218.67 0.66 0P8 -61.76 1212.43 3P9 -1937.85 0.53 0z PHA T PHA nit PHAP4 -23.86 32.89 32P5 -28.76 68.24 33P6 -49.79 16.26 0P7 -218.67 0.60 0P8 -61.60 22071.50 59P9 -1937.85 0.42 0integer problem. Notice that in general all the Lagrangean boundsobtained are very close to the optimal solution value, so, they arevery good bounds, and are obtained after few iterations (zero, inmany cases).We propose to use preprocessing and probing techniques [15] andschemes for i<strong>de</strong>ntifying and appending Gomory cuts and cliquecuts [18] before solving the scenario cluster mixed 0-1 submo<strong>de</strong>ls.As it is well known, due to the tightening of these mo<strong>de</strong>ls, it ispossible to reduce the LP feasible space without eliminating anyfeasible integer solution that potentially could become the optimalone. The difference in computing time by using preprocessingcompared with the alternative that does not use it can be crucialin the whole procedure for obtaining good bounds for large scaleinstances with affor<strong>da</strong>ble computing effort. However, a very goodperformance of the Volume Algorithm has been reported in [13]for a highly combinatorial problem as it is the stochastic set packingproblem.5. CONCLUSIONSIn this paper we have presented a scenario cluster based LagrangeanDecomposition (LD) scheme for obtaining strong lower bounds tothe optimal solution of two-stage stochastic mixed integer problems,where the uncertainty appears anywhere in the coefficientsof the 0-1 and continuous variables in the objective function andALIO-EURO <strong>2011</strong> – 169


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>constraints in both stages. For obtaining the bounds we have usedthree popular subgradient based schemes, namely, the traditionalSubgradient Method, the Volume Algorithm and the ProgressiveHedging Algorithm. Based on the computational results that wehave presented (and the extensive computational experience reportedin the full paper), we can draw some conclusions: (1) Veryfrequently the new bounds give the optimal solution to the originalproblem; (2) The performance of the scenario cluster LD schemeoutperforms the LD scheme based on single scenarios in both thebounds’s quality and computing effort; and (3) it is difficult for thenew Lagrangean multipliers up<strong>da</strong>ting schemes to outperform thetraditional Subgradient Method in this type of problems.6. ACKNOWLEDGEMENTSThis research has been partially supported by the projects ECO2008-00777 ECON from the Ministry of Education and Science, Grupo<strong>de</strong> Investigación IT-347-10 from the Basque Government, grantFPU ECO2006 from the Ministry of Education and Science, URJC-CM-2008-CET-3703 and RIESGOS CM from Comuni<strong>da</strong>d <strong>de</strong>Madrid, and PLANIN MTM2009-14087-C04-01 from Ministry ofScience and Innovation, Spain. The full paper is to be submittedfor publication in a regular journal.7. REFERENCES[1] R. Schultz, “Stochastic programming with integer variables,”Mathematical Programming Ser. B, vol. 97, 2003.[2] C. Carøe and R. Schultz, “Dual <strong>de</strong>composition in stochasticinteger programming,” Operations Research Letters, vol. 24,1999.[3] A. Alonso-Ayuso, L. Escu<strong>de</strong>ro, and M. Ortuño, “Branchand-fixcoordination algorithmic framework for solvingsome types of stochastic pure and mixed 0-1 programs,” EuropeanJournal of Operational Research, vol. 151, 2003.[4] A. Alonso-Ayuso, L. Escu<strong>de</strong>ro, M. Garín, M. Ortuño, andG. Pérez, “An approach for strategic supply chain planningbased on stochastic 0–1 programming,” Journal of GlobalOptimization, vol. 26, 2003.[5] L. Escu<strong>de</strong>ro, M. Garín, M. Merino, and G. Pérez, “A generalalgorithm for solving two-stage stochastic mixed 0-1first stage problems,” Computers and Operations Research,vol. 36, 2009.[6] ——, “An exact algorithm for solving large-scale two-stagestochastic mixed integer problems: some theoretical and experimentalaspects,” European Journal of Operational Research,vol. 204, 2010.[7] H. Sherali and J. Smith, “Two-stage hierarchical multiplerisk problems: Mo<strong>de</strong>ls and algorithms,” Mathematical ProgrammingS. A, vol. 120, 2009.[8] L. Escu<strong>de</strong>ro, M. Garín, G. Pérez, and A. Unzueta, “Lagrangean<strong>de</strong>composition for large-scale two-stage stochasticmixed 0-1 problems,” Working paper serie BiltokiDT.2010.07. http://econpapers.repec.org/paper/ehubiltok/201007.htm, UPV/EHU. Also submmited to TOP, 2010.[9] INFORMS, “Coin-or: Computational infrastructure for operationsresearch,” www.coin-or.org, 2010.[10] J. Birge and F. Louveaux, Introduction to Stochastic Programming.Springer, 1997.[11] R. Rockafellar and R.-B. Wets, “Scenario and policy aggregationin optimisation un<strong>de</strong>r uncertainty,” Mathematics ofOperations Research, vol. 16, 1991.[12] F. Barahona and R. Anbil, “The volume algorithm: Producingprimal solutions with a subgradient method,” MathematicalProgramming, vol. 87, 2000.[13] L. Escu<strong>de</strong>ro, M. Lan<strong>de</strong>te, and A. Rodriguez-Chia, “Stochasticset packing problem,” European Journal of OperationalResearch, accepted for publication, 2010.[14] M. Guignard, “Lagrangean relaxation,” TOP, vol. 11, 2003.[15] M. Guignard and K. Spielberg, “Logical reduction methodsin zero-one programming. minimal preferred variables,” OperationsResearch, vol. 29, 2003.[16] M. Guignard and S. Kim, “Lagrangean <strong>de</strong>composition. amo<strong>de</strong>l yielding stronger lagrangean bounds,” Mathematicalprogramming, vol. 39, 1987.[17] M. Held and R. M. Karp, “The traveling salesman problemand minimum spanning trees: part ii,” Mathematical programming,vol. 1, 1971.[18] G. Cornuejols, “Revival of the gomory cuts in the 1990s,”Operations Research, vol. 149, 2007.[19] L. Escu<strong>de</strong>ro, M. Garín, G. Pérez, and A. Unzueta, “Clusterbased <strong>de</strong>composition of lagrangean duals,” To be submmitedfor publication.ALIO-EURO <strong>2011</strong> – 170


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Positive Edge: A Pricing Criterion for the I<strong>de</strong>ntification of Non-<strong>de</strong>generateSimplex PivotsVincent Raymond † Francois Soumis ∗ † Ab<strong>de</strong>lmoutalib Metrane ∗ Mehdi Towhidi ∗Jacques Desrosiers ∗ ‡∗ GERADMontreal, Cana<strong>da</strong> H3T 2A7{francois.soumis, ab<strong>de</strong>lmoutalib.metrane, mehdi.towhidi}@gerad.ca† Ecole Polytechnique <strong>de</strong> MontrealMontreal, Cana<strong>da</strong> H3C 3A7vincent.raymond@polymtl.ca‡ HEC MontrealMontreal, Cana<strong>da</strong> H3T 2A7jacques.<strong>de</strong>srosiers@hec.caABSTRACTThe Positive Edge is a new pricing rule for the Primal Simplex: iti<strong>de</strong>ntifies, with a probability error less than or equal to 2 −62 in doubleprecision binary floating-point format, variables allowing fornon-<strong>de</strong>generate pivots. These are i<strong>de</strong>ntified directly from a shortcalculation on the original coefficients of the constraint matrix. Ifsuch a variable has a negative reduced cost, it strictly improves theobjective function value when entered into the basis. Preliminarycomputational experiments ma<strong>de</strong> with CPLEX and COIN-OR showits high potential.Keywords: Linear programming, Simplex, Degeneracy1. INTRODUCTIONConsi<strong>de</strong>r the following linear programming problem (LP) in stan<strong>da</strong>rdformminimize c ⊤ x subject to: Ax = b, x ≥ 0, (1)where x,c ∈ R n , A ∈ R m × R n , and b ∈ R m . We are interested inproblems for which the basic solutions are highly <strong>de</strong>generate, thatis, for which the number of non-zero variables is much less thanm, the size of the basis. In that case, the Primal Simplex algorithmis likely to encounter <strong>de</strong>generate pivots and possibly to cycle. Toavoid cycling, several pivot rules and right-hand si<strong>de</strong> perturbationmethods have been proposed, e.g., [2, 10, 1, 6, 9]. However, thesedo not strongly improve the performance of the Primal Simplexalgorithm. Another way is by using the steepest edge criterion [5]which computes the improvement of the cost function for possibleentering variables. Hence, if one exists, it selects a variable with anon-<strong>de</strong>generate pivot. However, this requires a significant amountof CPU time.Pan [7] proposes the use of a reduced problem with a smaller numberof constraints and variables. The method starts with an initialbasic solution and i<strong>de</strong>ntifies its p non-zero basic variables. Constraintsare split in two: set P where the basic variables takes apositive value and set Z where the basic variables are zero. Variablesare also split in two sets. Compatible variables are thosefor which all values are zero in the up<strong>da</strong>ted simplex tableau forconstraint indices in Z, other variables are said to be incompatible.The m − p constraints in Z are temporarily removed to leavea smaller constraint matrix with only p rows. To preserve feasibility,incompatible variables are also removed to form the reducedproblem. Since the p × p basis of the reduced problem is non<strong>de</strong>generate,the next pivot is automatically non-<strong>de</strong>generate. Theresulting reduced problem is solved to optimality over the compatiblevariables and the reduced costs are computed by means ofits dual variables. In Pan’s method, dual variables of LP correspondingto the m − p eliminated constraints are arbitrarily set tozero. Next, incompatible variables are consi<strong>de</strong>red. If such a variableis to become basic, some of the eliminated constraints mustbe reintroduced in the reduced problem. When compared to hisown implementation of the primal simplex algorithm, [7] reportsspeed-up factors of 4 to 5.In a column-generation framework (which can be seen as a PrimalSimplex approach), authors of [3] propose a Dynamic ConstraintAggregation (DCA) method for the solution of the linear relaxationof set partitioning problems. Consi<strong>de</strong>ring only the p non-zero basicvariables, the DCA method i<strong>de</strong>ntifies i<strong>de</strong>ntical rows (composedof zeros and ones) of the corresponding columns. In the constraintaggregation phase, a single constraint per row-group remains inthe reduced problem. Authors show that, once the reduced problemhas been solved, the dual variable of a kept constraint is equalto the sum of the dual variables of the corresponding row-group. Afull set of dual variables is recovered by distributing a<strong>de</strong>quately thevalues of the dual variables of the reduced problem. For set partitioningproblems, this is done by solving a shortest-path problem.These dual variables are used to price out the generated columns,allowing for up<strong>da</strong>tes of the constraint aggregation. On a set oflarge-scale bus driver scheduling problems, DCA reduces the solutiontime by a factor of more than 23 over the classical columngenerationmethod.The Improved Primal Simplex (IPS) method of [4] combines i<strong>de</strong>asfrom the reduced problem of Pan [7] and from DCA [3] to solvelinear programming problems. Consi<strong>de</strong>ring only the p non-zerobasic variables, IPS i<strong>de</strong>ntifies a set of p rows that are linearly in<strong>de</strong>pen<strong>de</strong>ntand removes from the reduced problem all the other m− pconstraints. As in [7], the reduced problem is solved using thecompatible variables only. Next, a complementary problem is constructe<strong>da</strong>nd solved to prove that the current solution of the reducedproblem is optimal for LP, otherwise it selects a set of incompatiblevariables to be introduced into the current reduced problem.Authors of [4] show that when the solution of the reduced prob-ALIO-EURO <strong>2011</strong> – 171


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>lem is not optimal for LP, the re-optimization after adding all theincompatible variables of the chosen set strictly <strong>de</strong>creases the objectivefunction value. In<strong>de</strong>ed, they show that in that case, thereexists a convex combination of the selected incompatible variablesthat is compatible with respect to the reduced problem, hence witha strictly positive step size.The complementary problem contains all the incompatible variablesand its coefficient matrix is created at the same time as thereduced problem. Both problems are built according to a modificationof the original constraint matrix. In<strong>de</strong>ed, this modified matrix(which is the result of an up<strong>da</strong>ted simplex tableau) is obtained bymultiplying A by the current inverse of the basis matrix. The complexityof computing this modified matrix for the i<strong>de</strong>ntification ofthe compatible variables is O(m 2 n). The computational results of[8] show that, on medium-sized instances (m ≈ 5000, n ≈ 25 000),IPS is faster than the primal simplex algorithm of CPLEX by factorsranging from 5 to 20. However, on large-scaled problems (m ≈100 000, n ≈ 450 000), constructing the reduced and the complementaryproblems is too costly compared to the Primal Simplexalgorithm itself.2. THE POSITIVE EDGE RULEAs in IPS, the Positive Edge rule gives priority to non-<strong>de</strong>generatepivots. However, compatible variables that form the reduced problemare i<strong>de</strong>ntified directly from the original constraint matrix Ainstead of from the modified matrix obtained by multiplying it bythe inverse of the basis. Determining which variables are compatibleis done in O(mn), i.e., O(m) for each variable, the samecomplexity as for the reduced cost computation of such a variable.Obviously, as in IPS, one might have to execute some <strong>de</strong>generatepivots to reach optimality.3. COMPUTATIONAL EXPERIMENTSPreliminary computational experiments ma<strong>de</strong> with CPLEX showits high potential. We <strong>de</strong>signed a simple algorithm using two externalprocedures: one i<strong>de</strong>ntifies variables that allow for non-<strong>de</strong>generatepivots while the other i<strong>de</strong>ntifies variables with negative reducedcost. These are sent to the Primal Simplex algorithm ofCPLEX. It has been tested on fourteen medium-sized aircraft fleetassignment instances (5000 constraints and 25 000 variables), twolarge-scale manpower planning problems (100 000 constraints and450 000 variables), and nine PDS instances from the Mittelmannlibrary. All these problems are highly <strong>de</strong>generate. On the firstgroup, our algorithm is 7.4 times faster than CPLEX on averageand the number of pivots is almost reduced by a factor 2. On thesecond and third groups, it is 50% faster and the number of pivotsis <strong>de</strong>creased by 2.4 and 3.6, respectively. It has also been tested onFome12 and Fome13 from the Mittelmann library. For these twohighly <strong>de</strong>nse problems, our simple implementation failed.The recent integration of the positive edge rule within the primalsimplex co<strong>de</strong> of COIN-OR prevents such cases by eliminating theexternal procedures and taking advantage of partial pricing strategies.Computational experiments show that the positive edge canhelp in solving difficult LPs in about half of the time required withthe Devex rule.4. REFERENCES[1] Bland, R. G. 1977. New Finite Pivoting Rules for the SimplexMethod. Mathematics of Operations Research, 2(2), 103–107.[2] Charnes, A. 1952. Optimality and Degeneracy in Linear Programming.Econometrica, 20, 160–170.[3] Elhallaoui, I., D. Villeneuve, F. Soumis, and G. Desaulniers.2005. Dynamic Aggregation of Set Partitioning Constraints inColumn Generation. Operations Research 53(4), 632–645.[4] Elhallaoui, I., A. Metrane, G. Desaulniers, and F. Soumis.2007. An Improved Primal Simplex Algorithm for DegenerateLinear Programs. Forthcoming: INFORMS Journal on Computing.[5] Forrest, J.J., and D. Goldfarb. 1992. Steepest-Edge SimplexAlgorithms for Linear Programming. Mathematical Programming57(3), 341-374.[6] Fuku<strong>da</strong>, K. 1982. Oriented Matroid Programming. Ph.D. Dissertation,University of Waterloo, Cana<strong>da</strong>.[7] Pan, P.-Q. 1998. A Basis Deficiency-Allowing Variation ofthe Simplex Method for Linear Programming. Computers andMathematics with Applications, 36(3), 33–53.[8] Raymond, V., F. Soumis, and D. Orban. 2010. An New Versionof the Improved Primal Simplex Algorithm for DegenerateLinear Programs. Computers and Operations Research,37(1), 91–98.[9] Ryan, D. M. and Osborne, M. 1988. On the Solution to HighlyDegenerate Linear Programmes. Mathematical Programming,41, 385–392.[10] Wolfe, P. 1963. A Technique for Resolving Degeneracy inLP. SIAM Journal, 11(2), 205–211.ALIO-EURO <strong>2011</strong> – 172


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>On the transition from fluence map optimization to fluence map <strong>de</strong>livery inintensity modulated radiation therapy treatment planningHumberto Rocha ∗ Joana M. Dias ∗ † Brígi<strong>da</strong> C. Ferreira ‡ §Maria do Carmo Lopes §∗ INESC-CoimbraRua Antero <strong>de</strong> Quental 199, 3000-033 Coimbra, Portugalhrocha@mat.uc.pt† <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Economia, Universi<strong>da</strong><strong>de</strong> <strong>de</strong> CoimbraAv. Dias <strong>da</strong> Silva 165, 3004–512 Coimbra, Portugaljoana@fe.uc.pt‡ I3N, Departamento <strong>de</strong> Física, Universi<strong>da</strong><strong>de</strong> <strong>de</strong> AveiroCampus Universitário <strong>de</strong> Santiago, 3810–193 Aveiro, Portugalbrigi<strong>da</strong>@ua.pt§ Serviço <strong>de</strong> Física Médica, IPOC-FG, EPEAv. Bissaya Barreto 98, 3000–075 Coimbra, Portugalmclopes@ipocoimbra.min-sau<strong>de</strong>.ptABSTRACTThe intensity modulated radiation therapy (IMRT) treatment planningproblem is usually divi<strong>de</strong>d in three smaller problems that aresolved sequentially: geometry problem, intensity problem, and realizationproblem. There are many mo<strong>de</strong>ls and algorithms to addresseach of the problems satisfactorily. However, the last twoproblems can not be seen separately, because strong links existbetween them. In practice, the linkage between these problems isdone, most of the time, by rounding, which can lead to a significant<strong>de</strong>terioration of the treatment plan quality. We propose a combinatorialoptimization approach and use a binary genetic algorithmto enable an improved transition from optimized to <strong>de</strong>livery fluencemaps in IMRT treatment planning. A clinical example of ahead and neck cancer case is used to highlight the benefits of usinga combinatorial optimization approach when linking the intensityproblem and the realization problem.Keywords: Radiotherapy, IMRT, Fluence Map Optimization, CombinatorialOptimization1. INTRODUCTIONThe goal of radiation therapy is to <strong>de</strong>liver a dose of radiation to thecancerous region to sterilize the tumor minimizing the <strong>da</strong>mageson the surrounding healthy organs and tissues. In the inverse planningof radiation therapy, for a prescribed treatment plan, a correspon<strong>de</strong>ntset of parameters (beams and fluences) is algorithmicallycomputed in or<strong>de</strong>r to fulfil the prescribed doses and restrictions.Inverse treatment planning allows the mo<strong>de</strong>ling of highly complextreatment planning problems and optimization has a fun<strong>da</strong>mentalrole in the success of this procedure. An important type of inversetreatment planning is IMRT where the radiation beam is modulatedby a multileaf collimator (MLC) that enables the transformation ofthe beam into a grid of smaller beamlets of in<strong>de</strong>pen<strong>de</strong>nt intensities(see Figure 1). Despite the illustration of Figure 1, beamlets donot exist physically. Their existence is generated by the movementof the leaves of the MLC that block part of the beam during portionsof the <strong>de</strong>livery time. The MLC has movable leaves on bothsi<strong>de</strong>s that can be positioned at any beamlet grid boun<strong>da</strong>ry. In the“step and shoot mo<strong>de</strong>", consi<strong>de</strong>red here, the leaves are set to opena <strong>de</strong>sired aperture during each segment of the <strong>de</strong>livery and radiationis on for a specific fluence time or intensity. This proceduregenerates a discrete set (the set of chosen beam angles) of intensitymaps like in Figure 1.Figure 1: Illustration of a beamlet intensity map.A common way to solve the inverse planning in IMRT optimizationproblems is to use a beamlet-based approach. This approachleads to a large-scale programming problem with thousands ofvariables and hundreds of thousands of constraints. Due to thecomplexity of the whole optimization problem, many times thetreatment planning is divi<strong>de</strong>d into three smaller problems whichcan be solved separately: geometry problem, intensity problem,and realization problem. The geometry problem consists of findingthe minimum number of beams and corresponding directionsthat satisfy the treatment goals using optimization algorithms (see,e.g., [1]). After <strong>de</strong>ciding which beam angles should be used, apatient will be treated using an optimal plan obtained by solvingthe intensity problem - the problem of <strong>de</strong>termining the optimalbeamlet weights for the fixed beam angles. Many mathematicaloptimization mo<strong>de</strong>ls and algorithms have been proposed for theintensity problem, including linear mo<strong>de</strong>ls (e.g., [2]), mixed integerlinear mo<strong>de</strong>ls (e.g., [3]), nonlinear mo<strong>de</strong>ls (e.g., [4]), and multiobjectivemo<strong>de</strong>ls (e.g., [5]). After an acceptable set of intensityALIO-EURO <strong>2011</strong> – 173


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Structure Mean Dose Max Dose Prescribed DoseSpinal cord – 45 Gy –Brainstem – 54 Gy –Left parotid 26 Gy – –Right parotid 26 Gy – –PTV left – – 59.4 GyPTV right – – 50.4 GyBody – 70 Gy –Table 1: Prescription dose for the target volumes and tolerancedoses for the organs at risk.maps is produced, one must find a suitable way for <strong>de</strong>livery (realizationproblem). Typically, beamlet intensities are discretizedover a range of values (0 to 7, e.g.) and one of the many existingtechniques (see, e.g., [6]) is used to construct the apertures and intensitiesthat approximately match the intensity maps previously<strong>de</strong>termined.Most of the research published relates to each of the above problemsseparately. However, there is the need to link well the threeproblems. While the linkage between the geometry problem andthe intensity problem is straightforward, the linkage between theintensity problem and the realization problem is all but simple andmay lead to significant <strong>de</strong>terioration of plan quality.The outcome of the intensity problem is a set of optimal fluencemaps (one for each fixed beam) that can be represented by real matriceswhose entries correspond to each beamlet intensity. Thesematrices, solutions of the intensity problem, cannot be directly implemented,because of hardware constraints. The matrices have tobe transformed to accommo<strong>da</strong>te hardware settings, with a resulting<strong>de</strong>gra<strong>da</strong>tion of the plan quality. The process of converting anoptimal fluence map into a set of MLC segments is called segmentation.Segmentation needs to receive as input integer matrices,that are obtained by the discretization of each beamlet intensityover a range of values. This discretization, typically done by simplerounding of the optimized beamlets, is one of the main causesfor <strong>de</strong>terioration of plan quality. This subject is poorly documentedin literature (see [7], e.g.) and the general i<strong>de</strong>a transmitted is that<strong>de</strong>terioration is mainly caused by segmentation issues. The lack ofcriteria to increase or reduce a beamlet intensity should be avoi<strong>de</strong>dsince all optimization effort is jeopardized by doing so. Using aclinical example of a head and neck cancer case, numerical evi<strong>de</strong>nceof the resulting <strong>de</strong>terioration of plan quality is presentednext.2. ILLUSTRATION OF PLAN QUALITYDETERIORATION USING A HEAD & NECK CLINICALEXAMPLEA clinical example of a head and neck case is used to verify the<strong>de</strong>terioration caused by the rounding of the optimal fluence maps.In general, the head and neck region is a complex area to treat withradiotherapy due to the large number of sensitive organs in this region(e.g. eyes, mandible, larynx, oral cavity, etc.). For simplicity,in this study, the OARs used for treatment optimization were limitedto the spinal cord, the brainstem and the parotid glands. Thetumor to be treated plus some safety margins is called planningtarget volume (PTV). For the head and neck case in study it wasseparated in two parts: PTV left and PTV right (see Figure 2). Theprescribed doses for all the structures consi<strong>de</strong>red in the optimizationare presented in Table 1.In or<strong>de</strong>r to facilitate convenient access, visualization and analysisof patient treatment planning <strong>da</strong>ta, the computational tools <strong>de</strong>velopedwithin Matlab [8] and CERR [9] (computational environmentfor radiotherapy research) were used as the main software platformFigure 2: Structures consi<strong>de</strong>red in the IMRT optimization visualizedin CERR.Level level intensity beamlet intensity range0 0.0000 [0.0000 ; 1.2857)1 2.5714 [1.2857 ; 3.8571)2 5.1429 [3.8571 ; 6.4286)3 7.7143 [6.4286 ; 9.0000)4 10.285 [9.0000 ; 11.571)5 12.857 [11.571 ; 14.142)6 15.428 [14.142 ; 16.714)7 18.000 [16.714 ; 18.000]Table 2: Beamlet distribution to correspon<strong>de</strong>nt intensity level for7 levels.to embody our optimization research and provi<strong>de</strong> the necessarydosimetry <strong>da</strong>ta to perform optimization in IMRT.A linear mo<strong>de</strong>l was used to perform IMRT optimization on thiscase [7]. Our tests were performed on a 2.66Ghz Intel Core DuoPC with 3 GB RAM. We used CERR 3.2.2 version and MATLAB7.4.0 (R2007a). The dose was computed using CERR’s pencilbeam algorithm (QIB) with seven equispaced beams in a coplanararrangement, with angles 0 o , 51 o , 103 o , 154 o , 206 o , 257 o and309 o , and with 0 o collimator angle. To address the linear problemwe used one of the most effective commercial tools to solve largescale linear programs – Cplex[10]. We used a barrier algorithm(baropt solver of Cplex 10.0) to tackle our linear problem.In or<strong>de</strong>r to acknowledge the <strong>de</strong>gree of plan quality <strong>de</strong>terioration,results obtained for the optimal fluence maps were compared withthe fluence maps obtained after rounding optimal intensities using7 levels and 5 levels. In Tables 2 and 3 we have the beamlet intensityrange for each intensity level. By <strong>de</strong>creasing the numberof levels, the segmentation problem will be simplified resulting inmore efficient <strong>de</strong>livery. However, by <strong>de</strong>creasing the number of levelsthe beamlet intensity range will increase, potentiating a moreexpressive <strong>de</strong>terioration of results. In the best case scenario forboth levels there are no differences between the optimal intensitiesand the roun<strong>de</strong>d intensities. However, for the worst case scenario,for 7 levels the difference between the optimal and the roun<strong>de</strong>d intensityfor each beamlet is 1.2857 and for 5 levels that differenceis 1.8.The quality of the results can be perceived consi<strong>de</strong>ring a variety ofmetrics and can change from patient to patient. Typically, resultsare judged by their cumulative dose-volume histogram (DVH).The DVH displays the fraction of a structure’s volume that receivesat least a given dose. An i<strong>de</strong>al DVH for the tumor wouldpresent 100% volume for all dose values ranging from zero to theprescribed dose value and then drop immediately to zero, indicatingthat the whole target volume is treated exactly as prescribed.I<strong>de</strong>ally, the curves for the organs at risk would instead drop immediatelyto zero, meaning that no volume receives radiation.ALIO-EURO <strong>2011</strong> – 174


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Level level intensity beamlet intensity range0 0.0000 [0.0000 ; 1.8000)1 3.6000 [1.8000 ; 5.4000)2 7.2000 [5.4000 ; 9.0000)3 10.800 [9.0000 ; 12.600)4 14.400 [12.600 ; 16.200)5 18.000 [16.200 ; 18.000]Table 3: Beamlet distribution to correspon<strong>de</strong>nt intensity level for5 levels.In Figure 3, DVH curves for tumor volumes and parotids are presentedfor optimal fluences obtained by the linear mo<strong>de</strong>l and forthe roun<strong>de</strong>d optimal intensities when using 5 and 7 levels. DVHcurves for OARs other than parotids only suffer residual changeswith the rounding procedure. By simple inspection of Figure 3 wecan observe the <strong>de</strong>terioration of the results from the transition ofthe optimal fluence maps to the roun<strong>de</strong>d ones. That <strong>de</strong>teriorationaffect mostly the PTVs and is aggravated when fewer levels areconsi<strong>de</strong>red, i.e., when faster <strong>de</strong>livery is aimed.Another metric usually used consi<strong>de</strong>rs prescribed dose that 95%of the volume of the PTV receives (D 95 ). Typically, 95% of theprescribed dose is required. D 95 is represented in Figure 3 with anasterisk and we can observe that the roun<strong>de</strong>d fluences fail to meetthat quality criteria. Note that no segmentation was done and theobserved <strong>de</strong>terioration is exclusively caused by the rounding of theoptimal fluence maps.(a)3. COMBINATORIAL OPTIMIZATION APPROACHAfter obtaining an optimal fluence map, and <strong>de</strong>fining the numberof levels of intensity to consi<strong>de</strong>r, we need to <strong>de</strong>ci<strong>de</strong> to whichlevel of intensity each beamlet should be assigned to. The typicalapproach is to <strong>de</strong>ci<strong>de</strong> based on smaller distance and assign thelevel intensity closer to the optimal fluence (rounding). However,that <strong>de</strong>cision criteria can put two beamlets with very close optimalintensities in distinct levels of intensity. Moreover, in sucha complex large-scale optimization process, with such inter<strong>de</strong>pen<strong>de</strong>ncebetween beamlet intensity values, increasing or reducing theintensity of a beamlet should not be based on distance to closest intensitylevel. An alternative <strong>de</strong>cision criteria is to <strong>de</strong>ci<strong>de</strong> betweenthe two boun<strong>da</strong>ry levels of the optimal beamlet intensity, based ona dose-volume response, rather than on a distance criteria.The combinatorial optimization problem of <strong>de</strong>ciding, based on adose-volume criteria, to which neighbor intensity level a beamletintensity should be assigned to, can be stated as a binary optimizationproblem. Let x opt <strong>de</strong>note the vector of the optimal beamletintensities obtained in the end of the intensity problem. Let x round<strong>de</strong>note the vector of the usual roun<strong>de</strong>d intensities and let x trunc<strong>de</strong>note the vector of the truncated intensities, i.e., the vector ofthe intensities corresponding to the smaller intensity value of thetwo neighbor level intensities. The difference of intensity levelsbetween x round and x trunc is a binary vector, where each 1 representsa choice of an upper level of intensity, and each 0 represent achoice of an un<strong>de</strong>r intensity level. The combinatorial optimizationproblem can be stated asmin f (x)x binary,where f (x) is a penalty function of the distances between the DVHcurves for x opt and DVH curves for x trunc + x × (levels range).This formulation originates a large combinatorial optimization problem.For the head and neck problem introduced in the previoussection, the number of beamlets is 1613, which means we have2 1613 = 3.6423E485 possibilities to consi<strong>de</strong>r. The magnitu<strong>de</strong> of(b)Figure 3: DVH for i<strong>de</strong>al optimal fluence using LP vs 7 and 5 levelfor right PTV and PRT – (a) and DVH for roun<strong>de</strong>d fluence usingLP vs 7 and 5 level for left PTV and PRT– (b).those numbers implies that both an exhaustive approach and anexact approach (branch and bound) are inviable.There exists a number of heuristics to address successfully thisproblem. Here, we used a tailored version of binary genetic algorithms(using the Genetic Algorithm Optimization Toolbox ofMATLAB).In Figure 4, DVH curves for PTV’s and parotids are presented foroptimal fluences obtained by the linear mo<strong>de</strong>l, for the roun<strong>de</strong>d optimalfluences using 7 levels of intensity, and the fluences obtainedby the resolution of the combinatorial optimization problem usingthe binary genetic algorithm. Again, DVH curves for OARsother than parotids only suffer residual changes. Even for parotids,DVH differences are not significative. However, looking at theDVH curves for PTV’s we can see the benefit of the combinatorialoptimization approach on the improvement of the roun<strong>de</strong>d solution.That improvement is particularly notorious in Figure 4(a),for DVH curve of PTV right, since the DVH curve of the roun<strong>de</strong>dLP fluences failed to meet the criteria of having 95% of the volumeof the PTV receiving 95% of the prescribed dose. Not onlythe DVH curves of the optimal CO fluences for PTV right meetthat criteria (DVH curve is over the asterisk) but they are almost asgood as the DVH curves for the optimal LP fluences. The benefitof the combinatorial optimization approach on the improvement ofthe roun<strong>de</strong>d solution is amplified when using less intensity levels.ALIO-EURO <strong>2011</strong> – 175


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Hybrid large neighborhood search for the dial-a-ri<strong>de</strong> problemSophie N. Parragh ∗ Verena Schmid †∗ INESC Porto / IBM CAS PortugalRua Dr. Roberto Frias, 378, 4200-465 Porto (Portugal)sophie.parragh@inescporto.pt† Department of Business Administration, University of ViennaBruenner Strasse 72, 1210 Vienna (Austria)verena.schmid@univie.ac.atABSTRACTDemographic change towards an ever aging population entails anincreasing <strong>de</strong>mand for specialized transportation systems to complimentthe traditional public means of transportation. Typically,users place transportation requests specifying a pickup and a dropoff location and a fleet of minibuses or taxis is used to serve theserequests. Those systems are usually referred to as <strong>de</strong>mand responsivetransportation systems. The un<strong>de</strong>rlying optimization problemcan be mo<strong>de</strong>led in terms of a dial-a-ri<strong>de</strong> problem. In the dial-ari<strong>de</strong>problem consi<strong>de</strong>red in this article, total routing costs are minimizedwhile respecting time window, maximum user ri<strong>de</strong> time,maximum route duration, and vehicle capacity restrictions. Wepropose a hybrid large neighborhood search algorithm and comparedifferent hybridization strategies on a set of benchmark instancesfrom the literature.Keywords: Dial-a-ri<strong>de</strong>, Large neighborhood search, Hybrid1. INTRODUCTIONDemand responsive transportation services are requested, e.g. inremote rural areas, where no general public transportation systemsexist, as a complementary service to available public transportationsystems for the el<strong>de</strong>rly or disabled, or in the area of patienttransportation to and from hospitals or other medical facilities.All these services involve the transportation of persons who placetransportation requests, specifying an origin and a <strong>de</strong>stination location.The un<strong>de</strong>rlying optimization problem is usually mo<strong>de</strong>ledin terms of a dial-a-ri<strong>de</strong> problem (DARP). The field of dial-ari<strong>de</strong>problems has received consi<strong>de</strong>rable attention in the literature.However, due to the application oriented character of this problem,the objectives consi<strong>de</strong>red as well as the constraints imposed varyconsi<strong>de</strong>rably. Rather recent surveys covering dial-a-ri<strong>de</strong> problemsand <strong>de</strong>mand responsive transportation are due to Cor<strong>de</strong>au and Laporte[1] and Parragh et al. [2].In the DARP consi<strong>de</strong>red in this article, the objective correspondsto the minimization of the total routing costs. A homogeneousfleet of vehicles of size m has to serve a given set of transportationrequests n. These are all known in advance of the planning. Inthe following we will refer to the origin or pickup location of arequest i by i, and to its <strong>de</strong>stination or drop off location by n + i.Users specify time windows for either the origin or the <strong>de</strong>stination.In a addition, maximum user ri<strong>de</strong> times, route duration limits,and vehicle capacity constraints have to be consi<strong>de</strong>red in the planning.This version of the DARP has been consi<strong>de</strong>red by Cor<strong>de</strong>auand Laporte [3], who propose a tabu search heuristic and a set of20 benchmark instances, and by Parragh et al. [4], who <strong>de</strong>velop acompetitive variable neighborhood search heuristic. A formal <strong>de</strong>finitionof the problem can be found in [5], where a branch-a-cutalgorithm is proposed that solves instances with up to 36 requests.In recent years, the field of hybrid metaheuristics, and matheuristicsin particular, has received more and more attention [6, 7]. Inthe field of vehicle routing, metaheuristic and column generationhybrids have shown to be especially successful: Prescott-Gagnonet al. [8] propose a branch-and-price based large neighborhoodsearch algorithm for the vehicle routing problem with time windows;heuristic <strong>de</strong>stroy operators are complimented by a branchand-pricebased repair algorithm. Muter et al. [9], on the otherhand, propose a hybrid tabu search heuristic, where the columnpool is filled with feasible routes i<strong>de</strong>ntified by the tabu search. Thesearch is then gui<strong>de</strong>d by the current best lower and upper bound;the current best lower bound is obtained from solving the linearrelaxation of a set covering type formulation on the current columnpool; the current best upper bound is computed by imposingintegrality on the <strong>de</strong>cision variables. The resulting method isalso tested on benchmark instances for the vehicle routing problemwith time windows.Given its success for several vehicle routing problems [10], and thepickup and <strong>de</strong>livery problem with time windows in particular [11],we investigate and compare different hybridization strategies oflarge neighborhood search (LNS) and column generation (CG) inthe context of the DARP.2. SOLUTION METHODIn the following we first <strong>de</strong>scribe LNS and CG. Thereafter, weintroduce different hybridization schemes.2.1. Large neighborhood searchLNS has been introduced by Shaw [12]. Its principle is relativelysimple: in each iteration the incumbent solution is partially <strong>de</strong>stroye<strong>da</strong>nd then it is repaired again; that is, first a given numberof elements are removed and these elements are then reinserted.Every time these operations lead to an improved solution, the newsolution replaces the incumbent solution, otherwise it is discar<strong>de</strong>d.Ropke and Pisinger [11] propose to use a number of different <strong>de</strong>stroyand repair operators; in this article, we use the following(they are all based on [11]): random removal, worst removal, relatedremoval, greedy insertion, and k-regret insertion.Before a removal operator is applied to the incumbent solution,the number of requests to be removed q has to be <strong>de</strong>termined. Inour case, in each iteration, q is chosen randomly between 0.1nand 0.5n. Then, one of the <strong>de</strong>stroy operators is randomly selected.The random removal operator randomly removes q re-ALIO-EURO <strong>2011</strong> – 177


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>quests from the incumbent solution. The worst removal operatorrandomly removes requests while biasing the selection towardsrequests whose removal would improve the objective functionvalue the most. Finally, the related removal operator removesrelated requests. Two requests i and j are said to be related if(|B i − B j | + |B n+i − B n+ j | +t i j +t n+i,n+ j ) is small; t i j <strong>de</strong>notes thedistance between location i and j; and B i the beginning of serviceat i.In a next step, a solution that has been partially <strong>de</strong>stroyed is repaire<strong>da</strong>gain. We randomly choose a repair operator among greedyinsertion, 2-regret insertion, 3-regret insertion, 4-regret insertionand m-regret insertion; see [11] for further <strong>de</strong>tails.In or<strong>de</strong>r to further diversify the search we allow solutions that <strong>de</strong>terioratethe incumbent solution by at most 3% to be accepted witha probability of 1%. In or<strong>de</strong>r to facilitate switching between LNSand exact components, in a first step, we <strong>de</strong>ci<strong>de</strong>d to refrain fromusing more sophisticated acceptance schemes.Furthermore, following the findings of [11], in each iteration, werandomly choose if the selected repair operator is used in its <strong>de</strong>terministicor in its randomized version. If the randomized version isselected, every time the evaluation function is called, it randomlychooses a noise factor in [0.5,1.5] and multiplies the original insertioncosts by it.Finally, like in [4], every time a new solution is generated and itis at most 5% worse than the current best solution, the new solutionun<strong>de</strong>rgoes local search based improvement; we refer to [4] for<strong>de</strong>tails on this procedure.2.2. Column generationIn or<strong>de</strong>r to use column generation based components, we formulatethe DARP in terms of a set covering problem. Let Ω <strong>de</strong>notethe set of feasible routes and let P <strong>de</strong>note the set of requests. Theparameter m <strong>de</strong>notes the number of available vehicles. For eachroute ω ∈ Ω, let c ω be the cost of the route and let the constantb iω represent the number of times vertex i ∈ P is traversed by ω.Binary variable y ω takes value 1 if and only if route ω is used inthe solution. The problem can thus be formulated as the followingset-covering problem (SCP):subject toReplacing (4) by,min ∑ c ω y ω (1)ω∈Ω∑ b iω y ω ≥ 1 ∀i ∈ P, (2)ω∈Ω∑ y ω ≤ m, (3)ω∈Ωy ω ∈ {0,1} ∀ω ∈ Ω. (4)y ω ≥ 0 ∀ω ∈ Ω, (5)we obtain the linear relaxation of SCP <strong>de</strong>noted as LSCP.Due to the large size of Ω, LSCP cannot be solved directly. Instead,a restricted version of LSCP, <strong>de</strong>noted as RLSCP, consi<strong>de</strong>ringonly a small subset of columns Ω ′ ⊂ Ω, is solved. Usually, the setΩ ′ is generated using column generation. In column generation,in each iteration, the column or route that is associated with thesmallest negative reduced cost value is searched. The accordingproblem is usually referred to as the subproblem whereas RLSCPis <strong>de</strong>noted as the master problem. In our case, the reduced cost ofa given column is computed as follows:¯c ω = c ω − ∑ b iω π i − σ, (6)i∈Pwhere π i <strong>de</strong>notes the dual variable associated with constraint (2)for in<strong>de</strong>x i, and σ the dual variable associated with constraint (3).The solution of the master and the subproblem are iterated until nomore negative reduced cost columns can be found. In this case, theoptimal solution of LSCP has been found. The column generationconcept can also be exploited in a heuristic way. In the following,we <strong>de</strong>scribe how we intend to use it.2.3. Hybridization schemesWe investigate three hybridization schemes. In all schemes, allfeasible routes i<strong>de</strong>ntified by LNS are ad<strong>de</strong>d to the common columnpool Ω ′ . We apply the following column pool management. In thecase where a column already exists, the new column replaces theold one if the new column is associated with lower routing costs;otherwise, the old column is kept and the new column is discar<strong>de</strong>d.In the first hybridization scheme (<strong>de</strong>noted as h1), in addition to theabove <strong>de</strong>scribed <strong>de</strong>stroy and repair operators, a <strong>de</strong>stroy and a repairoperator taking into account dual information are introduced.Before executing either of the two operators, the RLSCP is solvedon the current column pool. The <strong>de</strong>stroy operator works in a similarway as the worst removal operator; but instead of the differencein cost, selection is biased towards requests with a high π i value.The repair operator also uses this i<strong>de</strong>a. It sequentially inserts allcurrently not routed requests or<strong>de</strong>red with respect to their π i values,at the the best possible position.In the second hybridization scheme (<strong>de</strong>noted as h2) we followi<strong>de</strong>as of [13]. Every 1000 iterations we interrupt the LNS and wesolve RSCP, that is the restricted integer set covering problem, onthe current column pool. Since we solve a set covering problemand not a set partitioning problem, requests might appear on morethan only one route. In this case, duplicated requests are sequentiallyremoved in a greedy way and LNS resumes the search fromthis solution.The third hybridization scheme (<strong>de</strong>noted as h3) is always combinedwith h2. Here we propose to use additional heuristic columngenerators that take into account dual information to populatethe column pool. These column generators are called in thesame frequency as RSCP. Since they use dual information in or<strong>de</strong>rto find routes of negative reduced cost, RLSCP has to be solve<strong>da</strong>s well. We use several different column generators. The simplestone works in a similar way as the new repair operator and generatesa new route from scratch. A second more sophisticated generatoruses i<strong>de</strong>as from variable neighborhood search [14]. It consi<strong>de</strong>rsone route at a time. The size of a neighborhood is <strong>de</strong>fine<strong>da</strong>s the percentage share of requests that is removed, inserted from,or swapped with requests currently not on this route. It starts fromeach route part of the current optimal solution of RLSCP; in addition,it consi<strong>de</strong>rs the new route that was generated from scratch,an empty route, and a randomly generated route. The third columngenerator uses i<strong>de</strong>as from LNS and works in a similar way as theLNS based column generator introduced in [15]. It is only calledif the first two column generators are not able to find columns ofnegative reduced cost.In Algorithm 1, we outline the LNS and the different hybridizationschemes.3. PRELIMINARY RESULTSThe algorithm was implemented in C++ and for the solution of theset covering problems CPLEX 12.1 together with Concert Technology2.9 was used. All tests were carried out on a Xeon CPU atALIO-EURO <strong>2011</strong> – 178


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Algorithm 1: LNS hybridization schemes1: generate a feasible starting solution s2: s best := s3: initialize column pool Ω ′4: repeat5: randomly choose a <strong>de</strong>stroy H d and a repair heuristic H r6: [h1:also choose from heuristics based on dual information]7: apply first H d and then H r to s yielding s ′8: add columns to Ω ′9: if s ′ is better than s best then10: s best := s ′11: s := s ′12: else if s ′ meets the acceptance criteria then13: set s := s ′14: end if15: [h3: every 1000 iterations solve RLSCP on Ω ′16: generate columns by heuristic column generators17: add columns to Ω ′ ]18: [h2: every 1000 iterations solve RSCP on Ω ′ yielding s ∗19: s best := s ∗20: s := s ∗ ]21: until some stopping criterion is met22: return s best2.67 GHz with 24GB of RAM (shared with 7 other CPUs). Inthe following we <strong>de</strong>scribe the test <strong>da</strong>ta set and then the resultsobtained. Note that the results reported in this article, althoughpromising, are of preliminary nature. Additional parameter tuningtests and improvements of the implementation are still necessary.3.1. Test instancesCor<strong>de</strong>au and Laporte [3] proposed a <strong>da</strong>ta set of 20 randomly generatedinstances. They contain 24 to 144 requests. In each instance,the first n/2 requests have a time window on the <strong>de</strong>stination, whilethe remaining n/2 instances have a time window on the origin. Foreach vertex a service time d i = 10 was set and the number of personstransported per request was set to 1. Routing costs and traveltimes from a vertex i to a vertex j are equal to the Eucli<strong>de</strong>an distancebetween these two vertices. The instances are split into twoparts. In the first 10 instances, narrow time windows are consi<strong>de</strong>red.For the second 10 instances, wi<strong>de</strong>r time windows are given.The route duration limit was set to 480, the vehicle capacity to 6,and the maximum ri<strong>de</strong> time to 90, in all instances.3.2. Comparison of the different hybridization schemesWe test all of the different hybridization strategies and benchmarkthem against pure LNS as <strong>de</strong>scribed above. In all experiments,LNS is run for 25000 iterations. In setting LNS + h1, in additionto the heuristic <strong>de</strong>stroy and repair operators, the dual informationbased ones are used. In setting LNS + h2, every 1000 iterationsRSCP is solved. In setting LNS + h1 + h2, both of the above hybridizationschemes are used and in setting LNS + h1 + h2 + h3,all three hybridization schemes are employed.In Table 1, the different results on a per instance level are given.The best values per column and instance are marked in bold. Theresults displayed are average values over five random runs per instance.Comparing LNS and LNS+h1, it is not clear if the dual informationbased operators, proposed in h1, are contributing to thesearch in a positive way; purely heuristic LNS seems to be the betteroption. Hybridization scheme h2, on the other hand, <strong>de</strong>finitelyhas a positive impact on the overall performance; it obtains mostof the per instance average best results. On a total average level, itties with LNS+h1+h2. This seems to indicate that in combinationwith h2, h1 does not have a negative impact on the overall performanceof the method. It even has a slightly positive impact in thecase of the largest instances of the first half of the <strong>da</strong>ta set. Lastbut not least, we also tested a combination of all three hybridizationschemes. In the table it is <strong>de</strong>noted as LNS+h1+h2+h3. Thetotal average value associated with this method is comparable tothe total average values of LNS + h2 and LNS + h1 + h2. On a perinstance level, it obtains the best per instance average results forthe largest instances in the second part of the <strong>da</strong>ta set.In comparison to the variable neighborhood search proposed in [4],both solution quality as well as run times are comparable. The<strong>de</strong>viations of the latter three settings from the average results ofthe variable neighborhood search are less then 0.5% on average.So far, we were able to improve one best known solution.4. CONCLUSIONS AND OUTLOOKAs noted above, the presented results are of very preliminary character.They do, however, indicate that the different hybridizationschemes have a positive impact on the overall performanceof the solution method. At the moment it seems as if hybridizationscheme h2 increases the performance the most; but also the othertwo schemes show positive potential.Further tests with different parameter settings are still nee<strong>de</strong>d inor<strong>de</strong>r to fully un<strong>de</strong>rstand the interplay between the heuristic andthe different column generation based components. In addition,an a<strong>da</strong>ptive layer as in [11] and a simulated annealing [16] base<strong>da</strong>cceptance scheme should be incorporated into the LNS, to furtherimprove its performance as a stand alone method. Finally,using i<strong>de</strong>as of [9], we also plan to investigate different guidingmechanisms that allow the search to switch between the differentcomponents as nee<strong>de</strong>d.5. ACKNOWLEDGMENTSWe wish to thank the Austrian Research Promotion Agency (FFG,grant #826151, program IV2Splus) for sponsoring this work.6. REFERENCES[1] J.-F. Cor<strong>de</strong>au and G. Laporte, “The dial-a-ri<strong>de</strong> problem:Mo<strong>de</strong>ls and algorithms,” Ann Oper Res, vol. 153, pp. 29–46,2007.[2] S. N. Parragh, K. F. Doerner, and R. F. Hartl, “Demand responsivetransportation,” Wiley Encyclopedia of OperationsResearch and the Management Sciences, to appear.[3] J.-F. Cor<strong>de</strong>au and G. Laporte, “A tabu search heuristic forthe static multi-vehicle dial-a-ri<strong>de</strong> problem.” Transport ResB-Meth, vol. 37, pp. 579–594, 2003.[4] S. N. Parragh, K. F. Doerner, and R. F. Hartl, “Variable neighborhoodsearch for the dial-a-ri<strong>de</strong> problem,” Comput OperRes, vol. 37, pp. 1129–1138, 2010.[5] J.-F. Cor<strong>de</strong>au, “A branch-and-cut algorithm for the dial-ari<strong>de</strong>problem.” Oper Res, vol. 54, pp. 573–586, 2006.[6] G. R. Raidl, J. Puchinger, and C. Blum, “Metaheuristic hybrids,”in Handbook of Metaheuristics, 2nd edition, M. Gendreauand J. Y. Potvin, Eds. Springer, 2010, pp. 469–496.[7] C. Blum, M. J. Blesa Aguilera, A. Roli, and M. Sampels,Eds., Hybrid Metaheuristics, ser. Studies in ComputationalIntelligence, vol. 114. Berlin: Springer, 2008.ALIO-EURO <strong>2011</strong> – 179


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>m n LNS LNS + h1 LNS + h2 LNS + h1 + h2 LNS + h1 + h2 + h3R1a 3 24 190.02 190.02 190.02 190.02 190.02R2a 5 48 303.59 304.64 303.34 303.92 302.50R3a 7 72 544.88 542.84 537.89 538.96 540.32R4a 9 96 584.02 590.71 573.78 582.48 585.35R5a 11 120 651.58 650.67 639.74 643.74 641.36R6a 13 144 814.64 823.15 813.30 801.02 804.66R7a 4 36 294.38 295.56 294.02 294.46 294.37R8a 6 72 510.32 505.35 496.73 499.72 497.36R9a 8 108 695.68 695.34 679.58 677.98 682.97R10a 10 144 896.93 905.36 883.48 881.50 885.09R1b 3 24 166.58 166.19 165.79 166.19 165.52R2b 5 48 302.51 303.58 299.16 296.71 299.61R3b 7 72 498.85 501.18 492.78 493.41 495.70R4b 9 96 553.73 553.83 539.31 543.97 541.97R5b 11 120 598.52 603.59 591.59 591.97 589.77R6b 13 144 764.67 768.47 752.40 750.61 749.65R7b 4 36 249.06 249.58 248.72 248.72 248.72R8b 6 72 474.68 475.40 473.36 469.93 469.99R9b 8 108 627.53 623.69 612.20 616.62 614.67R10b 10 144 833.88 833.56 821.44 816.14 813.42Avg 527.80 529.13 520.43 520.40 520.65Table 1: Comparison of the different hybridization schemes[8] E. Prescott-Gagnon, G. Desaulniers, and L.-M. Rousseau,“A branch-and-price-based large neighborhood search algorithmfor the vehicle routing problem with time windows,”Networks, vol. 54, no. 4, pp. 190–204, 2009.[9] I. Muter, S. I. Birbil, and G. Sahin, “Combination of metaheuristicand exact algorithms for solving set coveringtypeoptimization problems,” INFORMS J Comput, 2010,published online before print March 23, 2010, DOI:10.1287/ijoc.1090.0376.[10] D. Pisinger and S. Ropke, “Large neighborhood search,” inHandbook of Metaheuristics, 2nd edition, M. Gendreau andJ.-Y. Potvin, Eds. Springer, 2010, pp. 399–419.[11] S. Ropke and D. Pisinger, “An a<strong>da</strong>ptive large neighborhoodsearch heuristic for the pickup and <strong>de</strong>livery problem withtime windows.” Transport Sci, vol. 40, pp. 455–472, 2006.[12] P. Shaw, “Using constraint programming and local searchmethods to solve vehicle routing problems.” in <strong>Proceedings</strong>CP-98 (Fourth International Conference on Principles andPractice of Constraint Programming), 1998.[13] S. Pirkwieser and G. Raidl, “Boosting a variable neighborhoodsearch for the periodic vehicle routing problem withtime windows by ILP techniques,” in Preprints of the 8thMetaheuristic International Conference (MIC 2009), Hamburg,Germany, 2009.[14] N. Mla<strong>de</strong>novic and P. Hansen, “Variable neighborhoodsearch.” Comput Oper Res, vol. 24, pp. 1097–1100, 1997.[15] S. Ropke and J.-F. Cor<strong>de</strong>au, “Branch-and-cut-and-price forthe pickup and <strong>de</strong>livery problem with time windows,” TransportSci, vol. 43, pp. 267–286, 2009.[16] S. Kirkpatrick, C. D. Gelatt Jr., and M. P. Vecchi, “Optimizationby simulated annealing,” Science, vol. 220, pp. 671–680,1983.ALIO-EURO <strong>2011</strong> – 180


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>An integer programming approach for elective surgery scheduling in a LisbonhospitalInês Marques ∗ † Maria Eugénia Captivo ∗ ‡ Margari<strong>da</strong> Vaz Pato ∗ §∗ Centro <strong>de</strong> Investigação Operacional, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências, Universi<strong>da</strong><strong>de</strong> <strong>de</strong> LisboaBloco C6, Piso 4, sala 6.4.16, Campo Gran<strong>de</strong>, 1749-016 Lisboa, Portugal† Universi<strong>da</strong><strong>de</strong> Lusófona <strong>de</strong> Humani<strong>da</strong><strong>de</strong>s e TecnologiasFCTS/FEG, Campo Gran<strong>de</strong>, 376, 1749-024 Lisboa, Portugalines.marques@fc.ul.pt‡ Universi<strong>da</strong><strong>de</strong> <strong>de</strong> Lisboa, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> CiênciasDEIO, Edifício C6 - Piso 4, Campo Gran<strong>de</strong>, 1749-016 Lisboa, Portugalmecaptivo@fc.ul.pt§ Instituto Superior <strong>de</strong> Economia e Gestão, Universi<strong>da</strong><strong>de</strong> Técnica <strong>de</strong> LisboaDept. Matemática, ISEG, Rua do Quelhas, 6, 1200-781 Lisboa, Portugalmpato@iseg.utl.ptABSTRACTElective surgery planning is an important problem for any hospital.In particular, in Portugal, this problem reaches a level ofgreat importance as it has direct relation with an efficient use ofthe operating theater, which also results on reducing waiting listsfor surgery. Thus, a better surgical suite planning has economicand social impact. Both outcomes appear as gui<strong>de</strong>lines of the PortugueseNational Health Plan for 2004-2010. The authors presentan integer linear programming mo<strong>de</strong>l approach <strong>de</strong>veloped to addressthe elective surgery planning problem of a hospital in Lisbon,as well as results obtained with real <strong>da</strong>ta from the hospital. The resultsare analyzed in view of the impact on productivity indicatorsof the surgical suite and, as a consequence, on the hospital’s waitinglist for surgery.Keywords: Health Care, Operating rooms, Elective case scheduling,Integer Programming1. INTRODUCTIONThe health sector has been progressively affected by increasinglyrestrictive budgets that not only call for an urgent need to promotea resource rationalization practice among hospitals but, above all,the <strong>de</strong>mand for greater efficiency in the use of resources and in theperformance of each service. The surgical suite is wi<strong>de</strong>ly regar<strong>de</strong><strong>da</strong>s hospital’s central engine as it has a direct impact in many otherhospital <strong>de</strong>partments, such as surgical wards and recovery units[1, 2]. As such, it is <strong>de</strong>emed a priority to improve the efficiency ofthis component. On the other hand, improvement of the surgicalsuite’s efficiency may lead to increased productivity, in terms ofthe number of surgeries un<strong>de</strong>rtaken, thus contributing to a reductionin surgery waiting lists. Costs involved in keeping a patienton the waiting list for surgery are high, at both prevention andmaintenance levels, even more so as consi<strong>de</strong>ring the user’s qualityof live. In addition, according to Portugal’s General Direction ofHealth [3], reducing surgery waiting list is one of the priorities ofthe National Health Service (SNS). Cutting down waiting lists forsurgery is beneficial in many respects, at human and scientific, aswell as economic levels.This work focuses on a general, central and university hospital inLisbon, incorporated within the Portuguese National Health Service.It has no maternity or outpatient emergency service andperforms about 5 000 surgeries per year. The hospital has fivesurgical specialties. Its surgical suite has six operating theaters,one of which is reserved for ambulatory surgeries. Although allrooms in the surgical suite are equipped with the same basic equipment,the practice of this hospital is to <strong>da</strong>ily assign the rooms forconventional surgeries to surgical specialties. Between two surgeriesperformed in the same room, cleaning and disinfecting protocols,performed by auxiliary staff and taking about 30 minutesmust take place. Each operating room has a fixed and permanentnursing team assigned throughout the surgical suite’s regular time.Each patient is assigned to a surgeon at waiting list booking timeand, therefore, when planning, patient and surgeons are alreadyassigned. Currently, the surgical suite’s regular work schedule isbetween 8 am and 8 pm, from Mon<strong>da</strong>y to Fri<strong>da</strong>y. Surgery planningis performed on a weekly base and is finalized on Fri<strong>da</strong>y forthe following week. The problem consi<strong>de</strong>rs <strong>da</strong>ily and weekly operatingtime limits for each surgeon, different priorities related tothe surgeries and surgeons’ unavailability.In the literature, there are other integer programming approaches tosurgeries’ scheduling [4, 5, 6, 7, 8] and some heuristic approachesto the same problem [9, 10]. The specificities of the different casesun<strong>de</strong>r study are the most relevant factor contributing to the diversityof each work in this area.2. MATHEMATICAL MODELThe problem <strong>de</strong>scribed in the previous section consists of schedulingelective surgeries for a <strong>da</strong>y, a room and a starting time period,with a weekly planning horizon. Since surgeries are nonpreemptivejobs, starting time variables were consi<strong>de</strong>red in formulatingthe problem [11]. Thus, the <strong>de</strong>cision variables used in themo<strong>de</strong>l are: x srtd = 1, if surgery s starts at the beginning of periodt on <strong>da</strong>y d in room r. Additional variables were also consi<strong>de</strong>redto register, on a <strong>da</strong>ily basis, the surgical specialty assigned to eachoperating room: y jrd = 1, if a surgery of specialty j starts in roomr on <strong>da</strong>y d.ALIO-EURO <strong>2011</strong> – 181


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>In response to the urgent need of improving efficiency in resourcesutilization of the operating theater, the mo<strong>de</strong>l objective functionmaximizes surgical suite occupation. The mo<strong>de</strong>l constraints reflectthe structure of the problem as presented in the previous section.There are constraints forcing the higher priority surgeries tobe scheduled on Mon<strong>da</strong>y. Other constraints oblige surgeries ofthe following level of priority to be scheduled during the planningweek, and the remaining surgeries may be scheduled or not duringthe planning week. There are constraints assuring that differentsurgeries do not overlap in the same room. These constraintsalso impose empty periods for room cleaning at the end of eachsurgery. An additional set of constraints provi<strong>de</strong>s the possibility toconsi<strong>de</strong>r surgeons’ or patients’ unavailability periods. Constraintspreventing assignment of more than one surgery specialty to eachroom and <strong>da</strong>y are also inclu<strong>de</strong>d. Therefore, it is not permitted toexchange surgery specialty in the room during the <strong>da</strong>y. It is alsoensured that surgeons do not overlap between rooms in the sametime period and <strong>da</strong>y. In the real situation of the hospital involved,surgeons may exchange operating rooms. On the one hand, this exchangeis feasible as the rooms are physically si<strong>de</strong> by si<strong>de</strong>. On theother hand, permission to exchange operating rooms by surgeonsallows them to work in another operating room during hygieneperiods in the previous room (about 30 minutes idle). Daily andweekly operating time limits for each surgeon are also consi<strong>de</strong>red.If the hospital objective was to reduce the waiting list for surgery,the same set of constraints could be used. Only the objective functionshould be changed to the maximization of the number of surgeriesplanned.The mo<strong>de</strong>l objective function and constraints are all linear, thusresulting in a binary integer programming mo<strong>de</strong>l.3. SOLVING APPROACHThe problem is highly complex and attains a large dimension inhospital real instances [12]. Hence, the elective surgeries’ schedulingproblem was <strong>de</strong>composed into two hierarchical phases accordingto the nature of surgeries: conventional surgeries are planned inthe first phase and ambulatory surgeries are planned in the secondphase. The first planning phase generates a high dimension problem,while the second one is of rather reduced dimension. Theoutput of the conventional planning phase is inclu<strong>de</strong>d as input forthe ambulatory surgery planning to ensure feasibility of the wholeweek’s planning due to common resources (surgeons) between thetwo planning phases.In each planning phase, an integer linear programming solver isused with limited time. If solver times out without optimality, thebest feasible integer solution obtained is improved using a simpleimprovement heuristic.4. COMPUTATIONAL RESULTSThe solution approach was tested with real <strong>da</strong>ta from the hospital,for seven planning weeks of 2007. The binary integer programmingmo<strong>de</strong>ls were solved using CPLEX 11.0 with CONCERT 2.5[13, 14]. The improvement heuristic was co<strong>de</strong>d in C++ language.All tests were performed in a Core2 Duo, 2.53 GHz computer with4GB of RAM. Time limit to run the mo<strong>de</strong>l with Cplex was set to30 000 seconds.The proposed approach originated a valid surgical plan for all testedweeks, producing a potential surgical suite occupation rate in regulartime superior to 75%, well above the corresponding rate currentlyobtained in hospital planning (un<strong>de</strong>r 40%). Furthermore,this approach also achieves to improve the waiting list reductionrate of the hospital surgical plans, which shows that the hospitalsurgical plans are clearly dominated by the corresponding proposedsurgical plans with respect to these two conflicting criteria.A similar approach can be used if we consi<strong>de</strong>r the objective to bethe waiting list reduction. In this case, we also improve the waitinglist reduction rate (above 11%) of the hospital surgical plans (un<strong>de</strong>r6%), with a potential surgical suite occupation rate in regular timesuperior to 64%.The previous results clearly show that the two criteria consi<strong>de</strong>re<strong>da</strong>re conflituous. In the first case, longer surgeries are planned (withless cleaning periods) contrary to what happens in the second casewhere shorter surgeries are chosen.Detailed results will be presented and discussed at the talk.5. FINAL REMARKSThe approach employed allowed the authors to obtain an operatingplan for each one of the seven weeks tested. The operatingplans obtained are feasible and meet the necessary requirementsimposed by the hospital in question. This approach enables thehospital surgical suite to be more efficiently used, thus achievingthe purpose of the study un<strong>de</strong>rtaken and responding to the hospitalmanagement’s interest. Moreover, the methodologies <strong>de</strong>velopedhave also an impact on reducing the waiting list for surgery.6. ACKNOWLEDGEMENTSThe authors would like to thank Dr. Manuel Delgado for his enthusiasticinterest in this work, Dra. Margari<strong>da</strong> Baltazar for herpatient provision of all necessary <strong>da</strong>ta and nurse Fátima Menezesfor the friendly and patient <strong>de</strong>scription of the entire process.This research is partially supported by Portuguese Foun<strong>da</strong>tion forScience and Technology (FCT) un<strong>de</strong>r project POCTI/ISFL-1/152.7. REFERENCES[1] Health Care Financial Management Association, “Achievingoperating room efficiency through process integration,”Health Care Financial Management Association, Tech. Rep.,2005.[2] E. Litvak and M. C. Long, “Cost and quality un<strong>de</strong>r managedcare: irreconcilable differences?” The American Journal ofManaged Care, vol. 6, no. 3, pp. 305–312, 2000.[3] General Direction of Health, Ed., Plano Nacional <strong>de</strong> Saú<strong>de</strong>2004-2010: mais saú<strong>de</strong> para todos. Lisbon: General Directionof Health, 2004, (in Portuguese).[4] B. Cardoen, E. Demeulemeester, and J. Beliën, “Optimizinga multiple objective surgical case sequencing problem,”International Journal of Production Economics, vol. 119,no. 2, pp. 354–366, 2009.[5] ——, “Sequencing surgical cases in a <strong>da</strong>y-care environment:an exact branch-and-price approach,” Computers & OperationsResearch, vol. 36, no. 9, pp. 2660–2669, 2009.[6] R. Velásquez and M. T. Melo, “A set packing approachfor scheduling elective surgical procedures,” in OperationsResearch <strong>Proceedings</strong> 2005. Springer Berlin Hei<strong>de</strong>lberg,2006, pp. 425–430.[7] H. Fei, C. Chu, and N. Meskens, “Solving a tactical operatingroom planning problem by a column-generation-basedheuristic procedure with four criteria,” Annals of OperationsResearch, vol. 166, no. 1, pp. 91–108, 2009.ALIO-EURO <strong>2011</strong> – 182


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[8] E. Marcon, S. Kharraja, and G. Simonnet, “The operatingtheatre planning by the follow-up of the risk of no realization,”International Journal of Production Economics,vol. 85, no. 1, pp. 83–90, 2003.[9] E. Hans, G. Wullink, M. van Hou<strong>de</strong>nhoven, andG. Kazemier, “Robust surgery loading,” European Journal ofOperational Research, vol. 185, no. 3, pp. 1038–1050, 2008.[10] M. van <strong>de</strong>r Lans, E. W. Hans, J. L. Hurink, G. Wullink,M. van Hou<strong>de</strong>nhoven, and G. Kazemier, “Anticipating urgentsurgery in operating room <strong>de</strong>partments,” University ofTwente, Tech. Rep. WP-158, 2006.[11] J. P. Sousa and L. A. Wolsey, “A time in<strong>de</strong>xed formulation ofnon-preemptive single machine scheduling problems,” MathematicalProgramming, vol. 54, pp. 353–367, 1992.[12] I. Marques, “Planeamento <strong>de</strong> cirurgias electivas - abor<strong>da</strong>gensem programação inteira,” Ph.D. dissertation, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong>CiÍncias, Universi<strong>da</strong><strong>de</strong> <strong>de</strong> Lisboa, 2010.[13] ILOG CPLEX 11.0 User’s Manual, ILOG, Incline Village,Neva<strong>da</strong>, 2007.[14] ILOG CONCERT 2.0 User’s Manual, ILOG, Incline Village,Neva<strong>da</strong>, 2004.ALIO-EURO <strong>2011</strong> – 183


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Tackling Freshness in Supply Chain Planning of Perishable ProductsPedro Amorim ∗ Hans-Otto Günther † Bernardo Alma<strong>da</strong>-Lobo ∗∗ DEIG, Faculty of Engineering, University of PortoRua Dr. Roberto Frias, s/n, 4600-001 Porto, Portugal{amorim.pedro, alma<strong>da</strong>.lobo}@fe.up.pt† Department of Production Management, Technical University of BerlinStrasse <strong>de</strong>s 17. Juni 135, 10623hans-otto.guenther@tu-berlin.<strong>de</strong>ABSTRACTIntegrated production and distribution planning has received a lotof attention throughout the years and its economic advantages overa <strong>de</strong>coupled approach is well documented. However, for highlyperishable products this integrated approach has to inclu<strong>de</strong>, furtherthan the economic aspects, the intangible value of customers’willingness to pay, which is related to product freshness. Hence,in this work we explore, through a multi-objective framework, thepotential advantages of integrating these two intertwined planningproblems at an operational level for this kind of products. We formulateintegrated and <strong>de</strong>coupled mo<strong>de</strong>ls for the case where perishablegoods have a fixed and a loose shelf-life in or<strong>de</strong>r to test ourhypothesis. An illustrative example is used to interpret the mo<strong>de</strong>lsand the results show that the economic benefits <strong>de</strong>rived from usingan integrated approach are much <strong>de</strong>pen<strong>de</strong>nt on the freshness levelof products <strong>de</strong>livered that the planner is aiming at as well as on thetype and <strong>de</strong>gree of perishability the product is subject to.Keywords: Suppy chain planning, Multi-objective, Perishability1. INTRODUCTIONRapidly <strong>de</strong>teriorating perishable goods, such as fruits, vegetables,yoghurt and fresh milk, have to take into account the perishabilityphenomenon even for the operational level of production and distributionplanning, which has a timespan ranging from one weekto one month. Usually these products start <strong>de</strong>teriorating from themoment they are produced on. Therefore, without proper care,inventories may rapidly get spoiled before their final use makingthe stakehol<strong>de</strong>rs incur on avoi<strong>da</strong>ble costs. The customers of theseproducts are aware of the intense perishability they are subject to,and they attribute an intangible value to the relative freshness ofthe goods [1]. To evaluate freshness customers rely on visual cueswhich may differ among the broad class of perishable products.Nahmias [2] dichotomized <strong>de</strong>teriorating goods in two categoriesaccording to their shelf-life: (1) fixed lifetime: items’ lifetime ispre-specified and therefore the impact of the <strong>de</strong>teriorating factorsis taken into account when fixing it. In fact, the utility of theseitems may <strong>de</strong>crease during its lifetime, and when passing its lifetime,the item will perish completely and become of no value, e.g.,milk, inventory in a blood bank, and yoghurt, etc. (2) random lifetime:there is no specified lifetime for these items. The lifetimefor these items is assumed as a random variable, and its probabilitydistribution may take on various forms. Examples of items thatkeep <strong>de</strong>teriorating with some probability distribution are electroniccomponents, chemicals, and vegetables, etc.When the shelf-life is fixed the most common visual cue that customersrely on is the best-before-<strong>da</strong>te (BBD). The BBD can be<strong>de</strong>fined as the end of the period, un<strong>de</strong>r any stated storage conditions,during which the product will remain fully marketable andretain any specific qualities for which tacit or express claims havebeen ma<strong>de</strong>. In this case, customers will a<strong>da</strong>pt their willingness topay for a product based on how far away the BBD is. On the otherhand, when the expiry <strong>da</strong>te of a product is not printed and thenthe shelf-life is loose, customers have to rely on their senses or externalsources of information to estimate the remaining shelf-lifeof the good. For example, if a banana has black spots or if flowerslook wilted, then customers know that these products will bespoiled rather soon.In the case of loose shelf-life, especially in the fresh food industry,manufacturers can make use of predictive microbiology to estimatethe shelf-life of these kind of products based on external controllablefactors, such as humidity and temperature [3]. To makeconcepts clearer, shelf-life is <strong>de</strong>fined as the time period for theproduct to become of no value for the customer due to the lackof the tacit initial characteristics that the product is supposed tohave. Thus, in our case, this period starts on the <strong>da</strong>y the productis produced. The <strong>de</strong>termination of shelf-life as a function ofvariable environmental conditions has been the focus of many researchactivities in this field and a consi<strong>de</strong>rable number of reliablemo<strong>de</strong>ls exist, such as the Arrhenius mo<strong>de</strong>l, the Davey mo<strong>de</strong>l andthe square-root mo<strong>de</strong>l. These mo<strong>de</strong>ls take into account the knowledgeabout microbial growth in <strong>de</strong>caying food goods un<strong>de</strong>r differenttemperature and humidity conditions.Regarding production and distribution planning, many authors haveshown the economic advantages of using an integrated <strong>de</strong>cisionmo<strong>de</strong>l over a <strong>de</strong>coupled approach [4, 5]. These advantages arebelieved to be leveraged when the product suffers a rapid <strong>de</strong>teriorationprocess and hence pushes towards a more connected view ofthese intertwined problems. For perishable goods the final productsinventory that is usually used to buffer and <strong>de</strong>couple thesetwo planning <strong>de</strong>cisions have to be questioned since customers distinguishbetween different <strong>de</strong>grees of freshness and there is an actualrisk of spoilage. In this work, we want to study the potentialadvantages of using an integrated approach for operational productionand distribution planning of perishable goods compared witha <strong>de</strong>coupled one. These advantages will be analysed through theeconomic and product freshness perspective. We focus on highlyperishable consumer goods industries, with a special emphasis onfood processing, which have to cope with complex challenges,such as the integration of lot sizing and scheduling, the <strong>de</strong>finitionof setup families consi<strong>de</strong>ring major and minor setup timesand costs, and multiple non-i<strong>de</strong>ntical production lines [6]. We arealso interested in un<strong>de</strong>rstanding if these potential advantages differamong the two distinct perishable goods classes that we have mentionedbefore: with fixed shelf-life and with loose shelf-life. Forboth cases, since we are interested in rapidly <strong>de</strong>teriorating goods,ALIO-EURO <strong>2011</strong> – 184


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>we consi<strong>de</strong>r a customer who prefers products with a higher freshnesslevel. To tackle explicitly this customer satisfaction issue weembed<strong>de</strong>d our integrated operational production and distributionplanning problem in a multi-objective framework distinguishingtwo very different and conflicting objectives of the planner. Thefirst objective is concerned with minimizing the total costs overthe supply chain covering transportation, production, setup andspoilage costs. The second one aims at maximizing the freshnessof the products <strong>de</strong>livered to distribution centres and, therefore,maximize customers’ willingness to pay [7].2. PROBLEM STATEMENTThe production and distribution planning problem consi<strong>de</strong>red inthis paper consists of a number of plants having <strong>de</strong>dicated lineswhich produce multiple perishable items with a limited capacityto be <strong>de</strong>livered to distribution centres. It is relevant to un<strong>de</strong>rstandthe importance of the <strong>de</strong>sign choice of having such a complex supplychain instead of just consi<strong>de</strong>ring one plant and multiple distributioncentres. As said before, we focus on perishable consumergoods industries which are known for <strong>de</strong>manding increasing flexibilityin the supply chain planning processes. Thus, to consi<strong>de</strong>ra network of production plants which can add increased flexibilityand reliability to hedge against the complex dynamics of suchindustries is crucial. Therefore, although we are tackling an operationallevel of <strong>de</strong>cision making for these two planning tasks weassume a central organizational unit which makes <strong>de</strong>cisions thatare followed directly at a local level. The length of the planninghorizon for such planning problem ranges from one week to onemonth.All product variants belonging to the same family form a block.Therefore a product can only be assigned to one block. Blocksare to be scheduled on parallel production lines over a finite planninghorizon consisting of macro-periods with a given length. Thescheduling takes into account that the setup time and cost betweenblocks is <strong>de</strong>pen<strong>de</strong>nt on the sequence of production (major setup).The sequence of products in a block is set a priori due to naturalconstraints in this kind of industries. Hence, when changing theproduction between two products of the same block only a minorsetup is nee<strong>de</strong>d that is not <strong>de</strong>pen<strong>de</strong>nt on the sequence, but only onthe product to be produced.In or<strong>de</strong>r to consi<strong>de</strong>r the initial stock that might be used to fulfilcurrent <strong>de</strong>mand it is important to have an overview of the inventorybuilt up in each macro-period due to perishability concerns. Thelength of the horizon that needs to be consi<strong>de</strong>red is related to theproduct with the longest shelf-life. One shall consi<strong>de</strong>r an integermultiple X of past planning horizons that is enough to cover thelongest shelf-life, i.e. X = ⌈ maxũ kT ⌉, where ũ k is a conservativevalue for shelf-life of product k.A macro-period is divi<strong>de</strong>d into a fixed number of non-overlappingmicro-periods with variable length. Since the production linescan be scheduled in<strong>de</strong>pen<strong>de</strong>ntly, this is done for each line separately.It is important to notice that each line is assigned to a plant.The length of a micro-period is a <strong>de</strong>cision variable, expressed bythe production of several products of one block in the respectivemicro-period on a line and by the time to set up the block in caseit is necessary. A sequence of consecutive micro-periods, wherethe same block is produced on the same line, <strong>de</strong>fines the size ofa lot of a block through the quantity of products produced duringthese micro-periods. Therefore, a lot may aggregate several productsfrom a given block and may continue over several micro andmacro-periods. Moreover, a lot is in<strong>de</strong>pen<strong>de</strong>nt of the discrete timestructure of the macro-periods. The number of micro-periods ofeach <strong>da</strong>y <strong>de</strong>fines the upper bound on the number of blocks to beproduced <strong>da</strong>ily on each line.There is no inventory held at production plants. Thus, at the endof each <strong>da</strong>y the production output is <strong>de</strong>livered to distribution centres(DCs), which have an unlimited storage capacity. The <strong>de</strong>liveryfunction is assured by a third-party logistics (3PL), and weassume that it charges a flat rate per pallet transported between aplant and a DC. Moreover, it is assumed that the 3PL is able tocope with whatever distribution planning was <strong>de</strong>ci<strong>de</strong>d beforehan<strong>da</strong>nd, hence, there is no capacity restriction for transportation. Thedistances between production plants and distribution centres aresmall enough so that the product is <strong>de</strong>livered on the same <strong>da</strong>y itis produced. Therefore, the <strong>de</strong>crease of freshness during the transportationprocess is consi<strong>de</strong>red to be negligible. The small distanceassumption is quite realistic in supply chains of highly perishablegoods where the distribution centres are not very far away fromthe production plants. For our purposes these assumptions shallnot pose a problem since we are still consi<strong>de</strong>ring directly the mostimportant cost drivers for transportation services: distance, quantityand service level. The <strong>de</strong>mand for an item in a macro-period ata distribution centre is assumed to be dynamic and <strong>de</strong>terministic.The problem is to plan production and distribution so as to minimizetotal cost and maximize mean remaining shelf-life of productsat the distribution centres over a planning horizon.3. RESULTSTo un<strong>de</strong>rstand the tra<strong>de</strong>-off present in the two <strong>de</strong>veloped mo<strong>de</strong>ls(fixed and loose shelf-life) regarding total costs and product freshnessas well as the differences between using an integrated overa <strong>de</strong>coupled approach for production and distribution planning anillustrative example was <strong>de</strong>veloped.In this instance there are four products to be scheduled and producedon two production lines that are located in two differentproduction plants. Each of these products belongs to a differentblock and therefore there is always sequence <strong>de</strong>pen<strong>de</strong>nt setup timeand cost to consi<strong>de</strong>r when changing from one product to another.Moreover, although the first line is able to produce every product,the second one is not able to produce all of the products.The production lines are consi<strong>de</strong>red similar and, therefore, variableproduction costs are neglected. The number of micro-periodsper macro-period was set at the constant value of four allowing theproduction of all products in a macro-period. The capacity of eachline is the same in all macro-periods and every production plant.The planning horizon is ten <strong>da</strong>ys (macro-periods) and the shelf-lifeof products varies consi<strong>de</strong>rably among them, from highly perishableones (one <strong>da</strong>y) to others which can last throughout the entireplanning horizon. Demand has to be satisfied in two different DCsand products can be transported between any pair production plant– DC. Initial stock was set to zero in both DCs. In case shelf-life isnot fixed, there are three different temperature levels possible to bechosen at each DC influencing its duration. Finally, a sensitivityanalysis regarding the perishability impact was conducted and differentscenarios where shelf-lives and <strong>de</strong>cay rates are varied wereanalysed.3.1. Fixed Shelf-Life (Case 1)In this section results for the case where the shelf-life is fixed arepresented. In Figure 1 the Base scenario solutions of the Paretooptimalfronts for both the integrated and <strong>de</strong>coupled approach arepresented.It is rather clear from the comparison of the Pareto fronts that theintegrated approach strongly dominates the <strong>de</strong>coupled one. Bothcurves have a similar behaviour, which means that for the lowervalues of freshness just a small increase in costs fosters significantlythe remaining shelf-life of <strong>de</strong>livered products. Nevertheless,ALIO-EURO <strong>2011</strong> – 185


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>potential production mistakes and the integrated approach has lesscomparative advantage.3.2. Loose Shelf-Life (Case 2)In this section we focus on the case where the shelf-life is loose.In Figure 3 the results of the Pareto fronts for both integrated and<strong>de</strong>coupled approach are presented. These solutions concern theBase scenario.Figure 1: Pareto-optimal fronts of the illustrative example whenusing an integrated and a <strong>de</strong>coupled approach (Case 1).when we are approaching a strict Just-in-Time (JIT) accomplishmentof the <strong>de</strong>mand, touching very high freshness stan<strong>da</strong>rds, thecosts start to increase in a more important way. Furthermore, it isinteresting to notice that the savings in costs when using an integrate<strong>da</strong>pproach over a <strong>de</strong>coupled one tend to fa<strong>de</strong> when we aimat an increased freshness. This may be explained by the fact thatto achieve very high freshness stan<strong>da</strong>rds almost no inventory isallowed since we are working un<strong>de</strong>r in a JIT policy, this will constrainso much the solution space that the integrated and coupledsolutions are rather the same. Finally, in Figure 2 we perform asensitivity analysis regarding the perishability settings. The percentagesaving of using an integrated approach over a <strong>de</strong>coupledone is plotted for the three scenarios. In or<strong>de</strong>r to calculate the savingboth Pareto fronts (integrated and <strong>de</strong>couple approach) were estimatedthrough a second-or<strong>de</strong>r polynomial regression which hasa good fit to the experimental <strong>da</strong>ta with all R 2 above 90%.Figure 3: Pareto-optimal fronts of the illustrative example whenusing an integrated and a <strong>de</strong>coupled approach (Case 2).As it happened with the results of Case 1, the Pareto front related tothe integrated approach strongly dominates the one correspondingto the <strong>de</strong>coupled approach. It is interesting to note that both Paretofronts are non-convex. The other reasoning ma<strong>de</strong> before for Case1 regarding the behaviour of the fronts also applies to this case.As done before for Case 1, in Figure 4 the results of the sensitivityanalysis to un<strong>de</strong>rstand the effect of different perishability settingsis presented. The percentage saving of using an integrate<strong>da</strong>pproach over a <strong>de</strong>coupled one is plotted for the three scenarios.Figure 2: Total percentage saving when using a integrated approachover a <strong>de</strong>coupled one for three scenarios (Case 1).The potential savings of using an integrated approach over a <strong>de</strong>coupledone are rather consi<strong>de</strong>rable for the fixed-shelf-life caseand, in<strong>de</strong>pen<strong>de</strong>ntly of the scenario, the behaviour over the remainingshelf-life is quite similar. For the scenario with highly perishableproducts the savings can ascend up to 42% when aiming at70% of remaining shelf-life.When comparing the three scenarios it is observable that the advantagesof using an integrated approach are leveraged by the <strong>de</strong>greeof perishability the goods are subject to. In fact, when we areplanning using a <strong>de</strong>coupled approach and the products are subjectto intense perishability, the myopic mistakes incurred in productionplanning will be hardly corrected by the distribution processbecause the buffer between those activities is reduced by the smallamount of time that goods can stay stored. Therefore, the advantagesof using an integrated approach are boosted consi<strong>de</strong>rably forthis scenario. On the other hand, when <strong>de</strong>aling with products withlow perishability the buffer enables the possibility of correcting theFigure 4: Total percentage saving when using a integrated approachover a <strong>de</strong>coupled one for three scenarios (Case 2).Unlike Case 1 the savings are not as bold and the maximum savingascends to 20% for an average remaining shelf-life of about 65%in the Base scenario, which is still rather remarkable. Nevertheless,the behaviour of both saving curves (from Case 1 and Case 2)is very similar. The explanation for the difference in the amount ofsavings between the fixed and loose shelf-life case may lie in thefact that for the loose shelf-life case the distribution process hasmuch more freedom to influence both costs and specially productfreshness. Hence, for the <strong>de</strong>coupled approach even after the productionprocess has fixed the production quantities, the distributionprocess is still able to compensate potential mistakes through the<strong>de</strong>cisions on temperature of storage.Looking to the differences between the three scenarios it is interestingto notice that in this case the reasoning is not as straightfor-ALIO-EURO <strong>2011</strong> – 186


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>ward as in Case 1. Here, the two extreme scenarios have a similarbehaviour for different reasons. The scenario with products subjectto low perishability has a rather humble saving when usingan integrated approach for the same reasons as in Case 1. Hence,since the time buffer between production and distribution is ratherlarge the advantages of using an integrated approach are hin<strong>de</strong>red.In the scenario having products with a high perishability the explanationfor the relative low saving is related to the possibility ofcorrecting freshness problems coming from a myopic productionplanning in the <strong>de</strong>coupled approach through controlling the temperatureof storage in the distribution planning. When productsare highly perishable a small <strong>de</strong>crease in the storage temperaturewill entail a strong percentage augmentation of shelf-life. Hence,fi in a product with 7 <strong>da</strong>ys of shelf-life we are able to augmentit to 8 <strong>da</strong>ys through storing it at cooler temperature, then the percentageincreasing of shelf-life is not very significant. But, if theproduct is highly perishable, then an absolute increase of one <strong>da</strong>ywill reflect a strong percentage increase. Therefore, the scenariowith products subject to intermediate perishability (Base scenario)is the one which gains more from an integrated approach.4. CONCLUSIONSIn this paper, we have discussed the importance of integrating theanalysis for a production and distribution planning problem <strong>de</strong>alingwith perishable products. The logistic setting of our operationalproblem is multi-product, multi-plant, multi-DC and multiperiod.We have <strong>de</strong>veloped mo<strong>de</strong>ls for two types of perishableproducts: with fixed shelf-life and with loose shelf-life, alwaystaking into account that customers attribute <strong>de</strong>creasing value toproducts while they are aging until they completely perish. Thenovel formulations allow a comprehensive and realistic un<strong>de</strong>rstandingof these intertwined planning problems. Furthermore, the looseshelf-lifemo<strong>de</strong>l was able to incorporate the possibility of <strong>de</strong>alingwith the un<strong>de</strong>rlying uncertainty of a random spoilage process withthe help of predictive microbiology. To un<strong>de</strong>rstand the impact ofthe integrated approach in both the economic and the freshnessperspective a multi-objective framework was used. Since the formulationsfor the loose shelf-life case were not possible to solvewith stan<strong>da</strong>rd solvers, even for a small example, a simple heuristicwas <strong>de</strong>veloped for these cases.Computational results for an illustrative example show that thePareto front of the integrated approach strongly dominates the Paretofront of the <strong>de</strong>coupled one for both classes of perishable products.The economic savings that this coupled analysis entail is smoothe<strong>da</strong>s we aim to <strong>de</strong>liver fresher products. Nevertheless, in the fixedshelf-life case for a 70% mean remaining shelf-life of <strong>de</strong>liveredproducts we may reach savings around 42%. The explanation regardingthe fact that the gap between the integrated and the <strong>de</strong>couple<strong>da</strong>pproach tends to smooth for very high freshness stan<strong>da</strong>rds,may be due to the reason that in the latter case no inventory is allowedsince we are working completely un<strong>de</strong>r a JIT policy, turningthe problem at hand so constrained that the integrated and coupledsolutions are rather the same. The multi-objective frameworkproved to be essential to draw these multi-perspective conclusions.5. REFERENCES[1] M. Tsiros and C. Heilman, “The effect of expiration <strong>da</strong>tes andperceived risk on purchasing behavior in grocery store perishablecategories,” Journal of Marketing, vol. 69, pp. 114–129,2005.[2] S. Nahmias, “Perishable inventory theory: a review,” OperationalResearch, vol. 30, no. 4, pp. 680–708, 1982.[3] B. Fu and T. Labuza, “Shelf-life prediction: theory an<strong>da</strong>pplication,” Food Control, vol. 4, no. 3, pp. 125–133, 1993.[Online]. Available: http://linkinghub.elsevier.com/retrieve/pii/0956713593902983[4] C. Martin, D. Dent, and J. Eckhart, “Integrated Production,Distribution, and Inventory Planning at Libbey-Owens-Ford,”Interfaces, vol. 23, no. 3, pp. 68–78, 1993.[5] D. Thomas and P. Griffin, “Coordinated supply chainmanagement,” European Journal of Operational Research,vol. 94, no. 1, pp. 1–15, Oct. 1996. [Online]. Available: http://linkinghub.elsevier.com/retrieve/pii/0377221796000987[6] B. Bilgen and H.-O. Günther, “Integrated production anddistribution planning in the fast moving consumer goodsindustry: a block planning application,” OR Spectrum, vol. 32,no. 4, pp. 927–955, Jun. 2009. [Online]. Available: http://www.springerlink.com/in<strong>de</strong>x/10.1007/s00291-009-0177-4[7] P. Amorim, C. H. Antunes, and B. Alma<strong>da</strong>-Lobo, “Multiobjectivelot-sizing and scheduling <strong>de</strong>aling with perishabilityissues,” Industrial & Engineering Chemistry Research,vol. 50, no. 6, pp. 3371–3381, <strong>2011</strong>.ALIO-EURO <strong>2011</strong> – 187


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Approaching a robust bi-objective supply chain <strong>de</strong>sign problem by ametaheuristic procedureCardona-Valdés Yajaira ∗ Álvarez A<strong>da</strong> ∗ Pacheco Joaquín †∗ Universi<strong>da</strong>d Autónoma <strong>de</strong> Nuevo LeónNuevo León, Méxicoyajaira@yalma.fime.uanl.mx, a<strong>da</strong>lvarez@mail.uanl.mx† Universi<strong>da</strong>d <strong>de</strong> BurgosBurgos, Españajpacheco@ubu.esABSTRACTWe consi<strong>de</strong>r the <strong>de</strong>sign of a two-echelon production distributionnetwork with multiple manufacturing plants, customers and a setof candi<strong>da</strong>te distribution centers. On this study we incorporateuncertainty on the <strong>de</strong>mand of the customers which is representedthrough scenarios.As well, there are several transportation options available for eachpair of facilities between echelons. Each option represents a typeof service with associated cost and time parameters leading aninverse correspon<strong>de</strong>nce between them. This tra<strong>de</strong>off is handledthrough a bi–objective optimization mo<strong>de</strong>l, where the involvedobjectives should be minimized. One criterion minimizes the expectedcost of facility location, transportation, and the penalty forunmet <strong>de</strong>mand. The other criterion looks for the minimum timeto transport the product along any path from the plants to the customers.An estimated Pareto robust front is found using several tabu searches.Preliminary experiments show the computational effect.Keywords: Robust optimization, Multiobjective optimization, Supplychain, Metaheuristic, Tabu search1. INTRODUCTIONIn this study, we address the <strong>de</strong>sign of a supply chain of a two–echelon distribution system. The supply chain planning <strong>de</strong>cisionscan be classified as those concerning inventory, transportation andfacility location [1]. This work is <strong>de</strong>voted to facility location andthe selection of transportation mo<strong>de</strong>s, where both <strong>de</strong>fine the distributionnetwork in the supply chain.Network <strong>de</strong>sign <strong>de</strong>cisions <strong>de</strong>termine the supply chain configurationand have a significant impact in logistic costs and responsiveness[2]. For instance, facility location has a long term impact inthe supply chain because of the high cost to open a facility or tomove it. While cost related to opening a new facility and inventorypooling costs induce to reduce the number of facilities, responsivenesscauses a contrary effect. A high number of facilities may reducethe lead time to <strong>de</strong>liver a product to the final customer. In certainproducts lead time can be viewed as an ad<strong>de</strong>d value so that thefirm that makes them available first can obtain short and long termcompetitive advantages in the market. As it can be seen, facility location<strong>de</strong>cisions play a critical role in the strategic <strong>de</strong>sign of supplychain networks. For more <strong>de</strong>tails, a recent review on this topic canbe found at [3]. The authors highlight the facility location mo<strong>de</strong>lsincorporated insi<strong>de</strong> the supply chain management (SCM) framework,in particular, the integration of location <strong>de</strong>cisions with other<strong>de</strong>cisions relevant to the <strong>de</strong>sign of a supply chain network (typical<strong>de</strong>cisions as capacity, inventory, procurement, production, routing,and the choice of transportation mo<strong>de</strong>s). As well as strengthen themissing literature involving uncertainty on the SCM.In this study we consi<strong>de</strong>r a set of potential locations for new distributioncenters. Each candi<strong>da</strong>te site has a fixed cost for opening afacility with a limited capacity. In the supply network the numberand location of plants and customers are known.There are several transportation options available for each pairof facilities between echelons. The alternatives are generated bythe supplier from different companies, the availability of differenttypes of service at each company (e.g. express and regular), andthe use of different mo<strong>de</strong>s of transportation (e.g. truck, rail, airplane,ship or inter–mo<strong>da</strong>l). Commonly, these differences involvean inverse correspon<strong>de</strong>nce between time and cost, i.e. a faster servicewill be more expensive.In or<strong>de</strong>r to balance the economic concerns with prompt <strong>de</strong>mandsatisfaction, our approach is to minimize the total cost and themaximum time nee<strong>de</strong>d for shipping the product across the wholesupply chain, simultaneously.This bi–objective problem was first introduced by [4] as the “CapacitatedFixed Cost Facility Location Problem with TransportationChoices”. In their contribution, the authors consi<strong>de</strong>r that allthe <strong>de</strong>sign parameters are <strong>de</strong>terministic.However, in practice, supply chains are characterized by numeroussources of technical and commercial uncertainties. Criticalparameters as customer <strong>de</strong>mands, prices and future facility capacitiesare quite uncertain. The fact that meeting customer <strong>de</strong>mandis what mainly drives most supply chain initiatives motivated us tostudy the problem consi<strong>de</strong>ring that the <strong>de</strong>mand is a random variablewhose value is not known at the time of <strong>de</strong>signing the network.Literature reveals that there are several studies which <strong>de</strong>al withuncertainty in supply chain management at different levels. In tacticallevel supply chain planning we can mention, for example,some papers related to the distribution of raw materials and products[5, 6]. At the strategic level, there is a great <strong>de</strong>al of researchin the facility location component of supply chain network <strong>de</strong>signun<strong>de</strong>r uncertainty. A good review can be found in the studies of[7, 3].The optimization focus in traditional SCM problems on maximizingprofit or minimizing costs as a single objective [8, 9, 10]. Nevertheless,other criteria to meet customer <strong>de</strong>mand on time such ascustomer response time, or fill rate should be taken into accountbecause they are related to the most basic functions of the SCM:ALIO-EURO <strong>2011</strong> – 188


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>to meet customer requirements.In the last years customer response time related consi<strong>de</strong>rationshave been revisited in the distribution network <strong>de</strong>sign [11, 12].Controlling lead time is becoming a competitive advantage formany firms because of the transformation of the manufacturing–distribution chains throughout the world. This parameter has effectson costs and also can be affected by the supply chain configuration.Papers involving an integrated <strong>de</strong>sign of supply chain networks un<strong>de</strong>runcertainty and consi<strong>de</strong>ring several objectives are significantlysmaller in number [13, 14, 15].As commented in [3] in relation to the type of the objective functionmeasuring supply chain performance, 75% attempts a costminimization function, 16% attempts a profit maximization andonly 9% refers to mo<strong>de</strong>ls with multiple and conflicting objectives.In this study we assume that the response time is influenced bythe selection of the transportation channel between facilities. Theexistence of third party logistic companies allows that differenttransportation services are available in the market, so in this paperwe consi<strong>de</strong>r several alternatives to transport the product betweenfacilities, where each option represents a type of service with associatedcost and time parameters. The implicit assumption is that afaster transportation mo<strong>de</strong> is also a more expensive one, creating atra<strong>de</strong>off between cost and time that affects the distribution networkconfiguration.The selection of a transportation channel has commonly been limitedto the transportation mo<strong>de</strong>. In an international context, differenttransportation mo<strong>de</strong>s are usually a consequence of the naturaloptions of transportation around the world: by air, by sea or byland. On this matter, the literature related to supply chain managementallowing several transportation mo<strong>de</strong>s to be chosen is toofew, only four papers feature this characteristic on the review givenby [3]. In this study, the term “transportation channel” is moregeneric and inclu<strong>de</strong>s not only choices for the transportation mo<strong>de</strong>but also for different types of services from one or several transportationcompanies. Although the principles may be the same,this distinction is important to <strong>de</strong>scribe a more general case.Therefore, on one si<strong>de</strong>, the objective of this study is to select theappropriate sites to open distribution centers and <strong>de</strong>termine theflow between facilities to minimize the total expected cost involvingfacility location, transportation and a penalty for unmet <strong>de</strong>mand.The last term <strong>de</strong>scribes mo<strong>de</strong>l infeasibility and representsunmet <strong>de</strong>mand un<strong>de</strong>r some scenario. An application in agile manufacturingis shown in [16].On the other si<strong>de</strong>, it is <strong>de</strong>sired to minimize the transportation timefrom the plant to the customers. This part of the problem <strong>de</strong>termineswhich services will be selected in or<strong>de</strong>r to reduce the transportationtime in each echelon of the supply chain. Hence, thetra<strong>de</strong>off between cost and time creates a bi–objective problem.Even though there are studies that i<strong>de</strong>ntify the cost–time tra<strong>de</strong>offas an important element in supply chain <strong>de</strong>sign, they do not relatethis balance to the availability of transportation choices betweenfacilities [17, 18]. In [19], authors use an aggregated functionfor time and cost. Although different transportation mo<strong>de</strong>s areinclu<strong>de</strong>d in their mo<strong>de</strong>l (rail and truck), the problem is to selectbetween a direct and an inter–mo<strong>da</strong>l shipping strategy. They donot have transportation choices between each pair of locations.The cost–time tra<strong>de</strong> off, in conjunction with the uncertainty in <strong>de</strong>mands,means that we are handling a novel multi–objective optimizationproblem un<strong>de</strong>r uncertainty. The contribution of this studyis to propose a procedure to find a set of non-dominated robust solutionsthis problem2. PROBLEM DESCRIPTIONThe “Stochastic Capacitated Fixed Cost Facility Location Problemwith Transportation Choices” (SCFCLP–TC) is based on a two–echelon system for the distribution of one product in a single timeperiod. In the first echelon the manufacturing plants send productto distribution centers (DC). The second echelon corresponds tothe flow of product from the distribution centers to the customers.The number of customers is known. The number of plants, theirlocations and manufacturing capacities are also known. There isa set of potential locations to open distribution centers. The numberof open DC is not <strong>de</strong>fined a priori. Each candi<strong>da</strong>te site hasa fixed cost for installing a DC, where the DC will have a limitedoperational capacity. There are several transportation optionsavailable for each pair of facilities between echelons. Each optionrepresents a type of service with associated cost and time parameters.Each customer has an associated product <strong>de</strong>mand, whichmust be supplied from a single DC. The exact <strong>de</strong>mand realizationis not known in advance. Thus, the <strong>de</strong>mand will be consi<strong>de</strong>red asa random variable mo<strong>de</strong>led through scenarios.Figure 1: Supply Chain ConfigurationAn improved stochastic programming called robust programmingwas presented by [20]. An optimal solution to a robust optimizationmo<strong>de</strong>l is <strong>de</strong>fined as solution robust if it remains “close” tooptimal for all scenarios of the input <strong>da</strong>ta, and mo<strong>de</strong>l robust if itremains “almost” feasible for all <strong>da</strong>ta scenarios. In this way, minimizingthe expected combined cost of transportation and facilitylocation will lead to a solution robust, while minimizing the expectedcost for unmet <strong>de</strong>mand will contribute to a mo<strong>de</strong>l robust.That is, we are penalizing the unmet <strong>de</strong>mand, so in the final solutionthe amount of unmet <strong>de</strong>mand will be as small as possible.The robust optimization mo<strong>de</strong>l, by <strong>de</strong>sign, yield solutions that areless sensitive to the mo<strong>de</strong>l <strong>da</strong>ta, given that the mo<strong>de</strong>l measures thetra<strong>de</strong>off between solution and mo<strong>de</strong>l robustness.The <strong>de</strong>cision related to the transportation options has an impact onthe transportation time from the plant to the customer. The tra<strong>de</strong>offbetween cost and time must be consi<strong>de</strong>red in the formulation of amathematical mo<strong>de</strong>l that minimizes both criteria simultaneously.Hence, the problem should be addressed through a bi–objectiveoptimization mo<strong>de</strong>l. Following this approach, one criterion minimizesthe combined expected cost of transportation, facility location,and the penalty for unmet <strong>de</strong>mand. The other criterion looksfor the minimum time to transport the product along any path fromthe plants to the customers.3. METHODOLOGY PROPOSEDMetaheuristics have many <strong>de</strong>sirable features to be an excellentmethod to solve very complex SCM problems: in general they aresimple, easy to implement, robust and have been proven highlyeffective to solve hard problems [21].ALIO-EURO <strong>2011</strong> – 189


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>The procedure for <strong>de</strong>termining an estimated Pareto front is basedin MOAMP [22] which consists on the following three phases:1. Look for efficient solutions close to the ends of the Paretofrontier, that is, solutions that approximate to the best solutionsof the single-objective problems that result consi<strong>de</strong>ringeach objective separately2. Look for additional points insi<strong>de</strong> the efficiency curve, thatis, find the efficient points that represent a good compromisebetween the distinct objectives consi<strong>de</strong>red3. Intensify the search around the efficient points found in previousphasesThe first phase of MOAMP starts from an arbitrary initial solutionand optimizes, at first, the objective f 1 . Starting from the last pointvisited at the end of this search (usually a local optimum for f 1 ) thesearch is conducted again to find the best solution to the problemwith the single objective f 2 . In the case of two objectives, onemore search is carried out for the objective f 1 , starting from thebest point found in the last search.In our implementation, we first build a near optimal solution forobjective f 1 and this point is consi<strong>de</strong>red as the initial point forthe optimization process for f 2 . Then, we build a near-optimalsolution for f 2 and from this point we start the optimization for f 1 .In the second phase we launch several tabu searches using a globalcriterion method. In this step, the aim is to minimize a functionthat measures the distance to the i<strong>de</strong>al point following the notionof compromise programming, on the un<strong>de</strong>rstanding that is logicalfor the <strong>de</strong>cision maker to prefer a solution that is closer to thei<strong>de</strong>al point over the one that is farther away. The metric employedis the L ∞ because it has been shown to lead to balanced efficientsolutions, as showed in [22]. In general, a point that minimizes anL q (1 ≤ q ≤ ∞) distance to the i<strong>de</strong>al point is an efficient point. Theset of all points obtained in this way is called the compromise set.These have the characteristic of providing a good balance amongthe values of the p objective functions.A graphical representation of the two phases of MOAMP is shownin figure 2.Finally, the third phase consists on an intensification process of theinitial Pareto front obtained during the first two phases. Here, eachpoint on the efficient set is improved via a local search. After eachpoint is improved the set of efficient points will be the approximationto the Pareto front provi<strong>de</strong>d.x 1f 2xx 42f 1x 3f 1Figure 2: A general framework of the MOAMP procedure for abiobjective problem4. DISCUSSION AND EXPERIMENTAL PRELIMINARYRESULTSAs mentioned before, the first part of MOAMP starts from an “arbitrarypoint”. In our case, we first build a near optimal solutionfor objective f 1 and this point is consi<strong>de</strong>red as the initial pointfor the optimization process for f 2 . The solution for objectivef 1 is ma<strong>de</strong> via a GRASP procedure. For constructing a feasiblesolution, we proceed in a backward form. Starting from the secondlevel of the supply chain, we solve a generalized assignmentproblem and apply the procedure <strong>de</strong>veloped by [23]. Once thecustomer-distribution center assignment has been <strong>de</strong>termined, thefirst level of the supply chain gets the structure of a transportationproblem, so we proceed solving that problem to complete a feasiblesolution. The objective function that gui<strong>de</strong>s the constructionof this initial point is a robust function that minimizes the totalexpected cost for facility location, transportation and the penaltyfor <strong>de</strong>mand unmet. Then, this feasible solution is improved usinga local search procedure, exchanging customers–distributioncenters assignments until it is not possible to get a better solution.After that, an initial near-optimal solution for f 2 is constructedusing also a GRASP procedure. This procedure is similar to theone <strong>de</strong>signed for constructing a solution for objective f 1 , however,the greedy function that gui<strong>de</strong>s the search is properly re-<strong>de</strong>fined totake now into account the time required to transport the productsalong the supply chain.For the intensification phase the same local search used in theGRASP procedure is applied. All visited points during the searchesconducted on the three phases are checked for inclusion in the setof non-dominated solutions, which is the output of the algorithm.In or<strong>de</strong>r to vali<strong>da</strong>te the proposed algorithm, several computationalexperiments have been <strong>de</strong>signed. The first experiment intends tomeasure the performance of the method regarding to the size ofthe problem. For that purpose preliminary test were conducted oninstances with 3 and 5 plants, 3 and 5 distribution centers, 4 and8 customers, 2 or 3 transportation channels, and 2 or 3 scenarios.The second experiment attempts to measure the contribution ofeach phase of the proposed method toward the quality of the finalapproximation of the efficient set.5. CONCLUSIONSIn this paper, we have studied a supply chain <strong>de</strong>sign problem thatinvolves uncertainty on the customers’ <strong>de</strong>mands mo<strong>de</strong>led by scenarios.Two conflicting objectives are consi<strong>de</strong>red: as well as the totalcost, the maximum time nee<strong>de</strong>d for shipping the product acrossthe chain total transportation time, has to be minimized.We have formulated it by a biobjective mo<strong>de</strong>l that minimizes thecost for opening distribution centers, the expected value of thetransportation cost and the expected value for unmet <strong>de</strong>mand. Simultaneously,the mo<strong>de</strong>l minimizes the sum of the maximum leadtime for the plants to the customers through each distribution center.As the mo<strong>de</strong>l penalizes the unmet <strong>de</strong>mand in the final solution theamount of unmet <strong>de</strong>mand will be as small as possible.Taking into account the computational complexity of the addressedproblem we have <strong>de</strong>signed a solution approach based on metaheuristics.Preliminary results show the proposed method performswell, but the computational time grows as the numbers ofscenarios increase. Therefore, ongoing work is conducted in or<strong>de</strong>rto improve this.6. ACKNOWLEDGMENTThis work was partially supported by CONACYT (México) grant61903.ALIO-EURO <strong>2011</strong> – 190


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>7. REFERENCES[1] R. H. Ballou, Business Logistics Management. USA: UpperSaddle River, 1999.[2] S. Chopra and P. Meindl, Supply Chain Management: Strategy,Planning and Operation. USA: Prentice Hall, UpperSaddle River, 2004.[3] M. T. Melo, S. Nickel, and F. Sal<strong>da</strong>nha-<strong>da</strong> Gama, “Facilitylocation and supply chain management– a review,” EuropeanJournal of Operational Research, vol. 196, no. 1, pp. 401–412, 2009.[4] E. Olivares, J. González-Velar<strong>de</strong>, and R. Ríos-Mercado, “Ametaheuristic algorithm for a bi-objective supply chain <strong>de</strong>signproblem,” in RED-M, 2007, pp. 5–8.[5] H. Van Lan<strong>de</strong>ghem and H. Vanmaele, “Robust planning: Anew paradigm for <strong>de</strong>mand chain planning,” Journal of OperationManagement, vol. 20, no. 6, pp. 769–783, 2002.[6] C.-S. Yu and H.-L. Li, “A robust optimization mo<strong>de</strong>l forstochastic logistic problems,” International Journal of ProductionEconomics, vol. 64, no. 1-3, pp. 385–397, 2000.[7] L. V. Sny<strong>de</strong>r, “Facility location un<strong>de</strong>r uncertainty: a review,”IIE Transactions, vol. 38, no. 7, pp. 537–554, 2006.[8] P. Tsiakis, N. Shah, and C. Panteli<strong>de</strong>s, “Design of multi–echelon supply chain networks un<strong>de</strong>r <strong>de</strong>mand uncertainty,”Industrial and engineering chemistry research, vol. 40,no. 16, pp. 3585–3604, 2001.[9] T. Santoso, S. Ahmed, M. Goetschalckx, and A. Shapiro, “Astochastic programming approach for supply chain network<strong>de</strong>sign un<strong>de</strong>r uncertainty,” European Journal of OperationalResearch, vol. 167, no. 1, pp. 96–115, 2005.[10] S. Elhedhli and F. Gzara, “Integrated <strong>de</strong>sign of supply chainnetworks with three echelons, multiple commodities andtechnology selection,” IIE Transactions, vol. 40, no. 1, pp.31–44, 2008.[11] I. Erol and W. G. Ferrell Jr., “A methodology to support <strong>de</strong>cisionmaking across the supply chain of an industrial distributor,”International Journal of Production Economics, vol. 89,no. 1, pp. 119–129, 2004.[12] A. De Toni and S. Tonchia, “Performance measurement systems:Mo<strong>de</strong>ls, characteristics and measures,” InternationalJournal of Operations & Production Management, vol. 21,no. 1-2, pp. 46–70, 2001.[13] E. Sabri and B. Beamon, “A multi–objective approach tosimultaneous strategic and operational planning in supplychain <strong>de</strong>sign,” The International Journal of Management Science,vol. 28, no. 5, pp. 581–598, 2000.[14] C. Chen, B. Wang, and W. Lee, “Multi–objective optimizationfor a multi-enterprise supply chain network,” Industrialand Engineering Chemistry Research, vol. 42, no. 6–7, pp.1879–1889, 2008.[15] G. Guillén, F. D. Mele, M. J. Bagajewicz, A. Espuña, andL. Puigjaner, “Multiobjective supply chain <strong>de</strong>sign un<strong>de</strong>r uncertainty,”Chemical Engineering Science, vol. 60, no. 6, pp.1535–1553, 2005.[16] F. Pan and R. Nagi, “Robust supply chain <strong>de</strong>sign un<strong>de</strong>r uncertain<strong>de</strong>mand in agile manufacturing,” Computers and OperationResearch, vol. 37, pp. 668–683, 2010.[17] G. Zhou, H. Min, and M. Gen, “A genetic algorithm approachto the bi-criteria allocation of customers to warehouses,”International Journal of Production Economics,vol. 86, no. 1, pp. 35–45, 2003.[18] T. Truong and F. Azadivar, “Optimal <strong>de</strong>sign methodologiesfor configuration of supply chains,” International Journal ofProduction Research, vol. 43, no. 11, pp. 2217–2236, 2005.[19] E. Eskigun, R. Uzsoy, P. Preckel, G. Beaujon, S. Krishnan,and J. Tew, “Outbound supply chain network <strong>de</strong>sign withmo<strong>de</strong> selection, lead times and capacitated vehicle distributioncenters,” European Journal of Operational Research,vol. 165, no. 1, pp. 182–206, 2005.[20] J. M. Mulvey, R. J. Van<strong>de</strong>rbei, and S. A. Zenios, “Robustoptimization of large–scale systems,” Operations Research,vol. 43, no. 2, pp. 264–281, 1995.[21] H. Ramalhinho-Lourenço, “Supply chain management: Anopportunity for metaheuristics,” Department of Economicsand Business, Universitat Pompeu Fabra, Economics WorkingPapers 538, 2001.[22] E. Caballero, J. Molina, and R. V., MOAMP: ProgramaciónMultiobjetivo mediante un procedimiento <strong>de</strong> búsque<strong>da</strong> tabú.Universi<strong>da</strong>d <strong>de</strong> Oviedo, España: Actas <strong>de</strong>l II Congreso Español<strong>de</strong> Metaheurísticas y Algoritmos Evolutivos y Bioinspirados:MAEB, 2003, pp. 153–159.[23] S. Martello and P. Toth, Knapsack problems: algorithms andcomputer implementations. John Wiley & Sons, 1990.ALIO-EURO <strong>2011</strong> – 191


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A Tabu Search Approach for the Hybrid Flow ShopNicolau Santos ∗João Pedro Pedroso ∗∗ INESC Porto and <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências, Universi<strong>da</strong><strong>de</strong> do PortoRua do Campo Alegre, 4169-007 Porto, Portugalnsantos@inescporto.pt, jpp@fc.up.ptABSTRACTIn this work we present a metaheuristic based on tabu search, <strong>de</strong>signedwith the objective of minimizing makespan in a hybridflow shop problem. In or<strong>de</strong>r to assess the performance of the proposedmethod we performed tests using both well known benchmarksand randomly generated instances; preliminary results indicatethat the approach is valid.Keywords: Scheduling, Metaheuristics, Flow Shop, CombinatorialOptimization1. INTRODUCTIONA Hybrid Flow Shop (HFS) consists of series of production stages,each of which has one or more machines operating in parallel; atleast one stage has multiple machines, and at least one job hasmore than one stage. HFS problems appear as a natural extensionof the traditional Flow Shop Problem. With the increasing complexityof mo<strong>de</strong>rn production systems, the introduction of parallelmachines, as well as additional constraints, are nowa<strong>da</strong>ys common.The HFS problem was initially stated in [1]; surveys ofproblems arising in this area and methods for solving them wasprovi<strong>de</strong>d in [2], [3], and more recently in [4].In the HFS, each job is processed by one machine in each stage,and it must go through one or more stages. It can be <strong>de</strong>fined asfollows: there is a set of n jobs to be processed in m stages; alljobs have the same production direction, from stage 1 to stage m,and the production times t ik of job i at stage k, are known. In thispaper, we make the following further assumptions:• setup and transportation times between stages are negligible;• there are buffers with infinite capacity between stages;• each machine can only process a job at a given time;• a job can only be processed by a single machine at eachstage;• job preemption is not allowed.While many objective functions are consi<strong>de</strong>red in the literature, wewill focus on makespan minimization.Figure 1 shows an example of an HFS with two machines at thefirst and second stages, and three machines at the third stage.The numerous practical applications of HFS have attracted manyresearchers, and many approaches have been <strong>de</strong>veloped, from simpledispatching heuristics to exact methods. Regarding metaheuristics,there are two common exploration strategies: the first is tofind the best job/machine association at each stage, as in [5], thesecond—the one we will use in this study—is to consi<strong>de</strong>r permutationschedules as in [6]. The main i<strong>de</strong>a is to generate a permutationthat <strong>de</strong>fines the job or<strong>de</strong>r at the first stage; in the subsequentFigure 1: Hybrid Flow Shop example.stages, jobs join a queue and are loa<strong>de</strong>d to the machines in FIFOor<strong>de</strong>r, being assigned to the first available machine. Although thisapproach may fail to find the optimal association of jobs to stagemachines, it is one of the most wi<strong>de</strong>ly used in practice [7], as itprevents stock accumulation between stages, and naturally keepsthe work in process at low levels, two important requirements inmo<strong>de</strong>rn production systems.Figure 2 presents an example of a seven job Gantt chart for theHFS presented in Figure 1. Notice that there is an exchange inthe production or<strong>de</strong>r of jobs six and seven from the first to thesecond stage; this is due to different or<strong>de</strong>rs of arrival to the queuesbetween those stages.Figure 2: Gantt chart example.2. TABU SEARCHTabu Search was proposed by Glover [8] as a method to gui<strong>de</strong>heuristics through the solution space. Its main characteristic is theavoi<strong>da</strong>nce of being trapped in local optima, through the use of atabu list: a recent memory record that prevents the repetition ofmoves as long as they are kept in the list. This avoids cyclingin many cases (<strong>de</strong>pending on the length of that list), leading thealgorithm to explore promising regions.Our implementation is based on the insertion neighborhood, andis relatively problem in<strong>de</strong>pen<strong>de</strong>nt; it can easily be a<strong>da</strong>pted to otherobjectives, if required.ALIO-EURO <strong>2011</strong> – 192


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>procedure tabuSearch(π init )π ∗ = π init # initialize best found solutionπ = π init # initialize incumbent solutionforall π k in π: tabu[π k ] = 0 # initialize tabu listiter = 0 # iteration counterwhile termination criteria not satisfied doiter = iter + 1select L non-tabu jobs of πlet r be a random integer, r min ≤ r ≤ r maxlet R be a set of r randomly chosen jobs of Levaluate N (R,π)let π ′ be the best neighbor found and π b ′ the job selected for insertionup<strong>da</strong>te solution and tabu listif ob j (π ′ ) < ob j (π ∗ ) thenπ ∗ = π ′forall π k in π ′ : tabu[π k ] = 0tabu[π b ] = iter + 1elselet t be a random integer, 1 ≤ t ≤ t maxtabu[π b ] = iter +tend ifend whilereturn π ∗end procedureFigure 3: Tabu Search pseudoco<strong>de</strong>2.1. Moves and NeighborhoodTabu search exploration is based on moving iteratively to a solutionin the neighborhood. In our algorithm we use insertion moves:given a permutation π and a pair of positions (i, j), i ≠ j, the permutationπ ′ obtained by removing job at position i and inserting itat position j is:π ′ = π 1 ,...,π i−1 ,π i+1 ,...,π j ,π i ,π j+1 ,...,π n if i < j;π ′ = π 1 ,...,π j−1 ,π i ,π j ,...,π i−1 ,π i+1 ,...,π n if j < i.Having a set of U jobs we <strong>de</strong>fine N (U,π) as the neighborhoodthat contains all the possible insertion moves of the jobs in U.2.2. Tabu list and search strategyIn our implementation the tabu list consists of an array: for eachjob i we assign a value tabu[i], at iteration iter we say that jobi is tabu if tabu[i] > iter. The job π b chosen to perform themove becomes tabu for t iterations, where 1 ≤ t ≤ t max . Hence,we up<strong>da</strong>te tabu[π b ] = iter + t, except when the best known solutionis improved; in that case, we set tabu[k ] = 0,∀k ≠ π b andtabu[π b ] = iter+1, in or<strong>de</strong>r to prevent immediate reversion. Evaluatingthe neighborhood generated by trying insertion among anyconsecutive pair of jobs is a <strong>de</strong>manding computational task, so wepropose a neighborhood restriction: instead of evaluating the completeneighborhood we evaluate a set of r randomly chosen jobs,with r min ≤ r ≤ r max .To illustrate the behaviour of the algorithm, let us consi<strong>de</strong>r theexample from Figure 2. Suppose that at a given iteration we haveas incumbent solution (1,2,3,4,5,6,7). The operations to performduring a tabu search iteration are the following:1. find the list of legal (non tabu moves) L; suppose we obtainL = (1,2,3,4);2. draw r to find the number of jobs to evaluate; suppose weobtain r = 2;3. randomly choose r jobs from L; suppose we choose jobs 1and 4 so R = (1,4);4. evaluate N (R,π), i.e., the permutations(2,1,3,4,5,6,7)(2,3,1,4,5,6,7)(2,3,4,1,5,6,7)(2,3,4,5,1,6,7)(2,3,4,5,6,1,7)(2,3,4,5,6,7,1)(4,1,2,3,5,6,7)(1,4,2,3,5,6,7)(1,2,4,3,5,6,7)(1,2,3,5,4,6,7)(1,2,3,5,6,4,7)(1,2,3,5,6,7,4).Then, we choose the permutation that yields the best objective asthe incumbent solution. Suppose the first permutation is chosen;in this case, set π b = 1. The final step is to up<strong>da</strong>te the tabu list, asstated previously.An extensive computational experiment is currenlty being conducted.The results of our method are being compared with thelower bound of [9] and known heuristics; though the results arepreliminary, quality of the proposed method is very promising,with respect to other results found in the literature. We also referthat establishing a direct comparison with results from otherauthors is very hard, as each reports results based on its own setof randomly generated instances. In table 1 we present some resultswith instances from [10], initialy proposed for the flow shopproblem and available in the Internet. Each instance as n jobs tobe processed on m stages, p is the number of parallel machinesintroduced at each stage. LB is the value of the lower bound of[9], B is the best makespan found by our method and AD is theaverage of the relative percentage error D, were D is calculated byEquation 1.D = heu sol − LB.100 (1)LBThe results are <strong>de</strong>rived from five runs on each instance with a runningtime of n.m.45 ms of CPU time in a computer AMD Athlon64 X2 Dual Core 3800+ with 2Gb of RAM running OS Mandriva2010 Free. For the problems with 5 and 10 stages we can observesmall values of AD, though they are slightly larger for instanceswith 20 stages. To our knowledge, this is the first time results forthis problem with 20 stages are presented.p=2 p=4inst n m LB B AD LB B ADta001 20 5 688 721 5.41 428 459 7.57ta011 10 885 1009 14.44 657 737 12.48ta021 20 1332 1578 19.11 1237 1261 2.10ta031 50 5 1395 1405 0.77 756 756 0.44ta041 10 1572 1713 9.66 931 1052 13.64ta051 20 2077 2430 17.59 1430 1658 16.58ta061 100 5 2766 2803 1.47 1443 1468 1.93ta071 10 2961 3058 3.71 1606 1737 8.61ta081 20 3202 3720 16.79 1948 2270 17.26ta091 200 10 5533 5603 1.61 2898 2966 3.22ta101 20 5756 6290 10.05 3167 3569 13.81ta111 500 20 13199 14246 8.25 6848 7666 12.30Table 1: Average results for 5 in<strong>de</strong>pen<strong>de</strong>nt runs on Taillard’sbenchmark3. ACKNOWLEDGEMENTSThe presented research was <strong>de</strong>veloped at INESC Porto un<strong>de</strong>r theEuropean Commission, Framework Programme 7 project FIT4U:Framework of Integrated Technologies for User Centred Products.4. REFERENCES[1] T. Arthanari and K. Ramamurthy, “An extension of two machinessequencing problem,” Opsearch, vol. 8, pp. 10–22,1971.ALIO-EURO <strong>2011</strong> – 193


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[2] R. Linn and W. Zhang, “Hybrid flow shop scheduling: Asurvey,” Computers & Industrial Engineering, vol. 37, no. 1-2, pp. 57 – 61, 1999, proceedings of the 24th internationalconference on computers and industrial engineering.[3] H. Wang, “Flexible flow shop scheduling: optimum, heuristicsand artificial intelligence solutions,” Expert Systems,vol. 22, no. 2, pp. 78–85, 2005.[4] R. Ruiz and J. Vázquez-Rodríguez, “The hybrid flow shopscheduling problem,” European Journal of Operational Research,vol. 205, no. 1, pp. 1–18, 2010.[5] E. Nowicki and C. Smutnicki, “The flow shop with parallelmachines: a tabu search approach,” European Journal ofOperational Research, vol. 106, no. 2-3, pp. 226–253, 1998.[6] D. Santos, J. Hunsucker, and D. Deal, “FLOWMULT: Permutationsequences for flow shops with multiple processors,”Journal of Information and Optimization Sciences, vol. 16,pp. 351–366, 1995.[7] M. Pinedo, Scheduling: theory, algorithms, and systems.Springer Verlag, 2008.[8] F. Glover, “Future paths for integer programming and linksto artificial intelligence,” Computers & Operations Research,vol. 13, no. 5, pp. 533–549, 1986.[9] D. Santos, J. Hunsucker, and D. Deal, “Global lower boundsfor flow shops with multiple processors,” European Journalof Operational Research, vol. 80, no. 1, pp. 112–120, 1995.[10] E. Taillard, “Benchmarks for basic scheduling problems,”European Journal of Operational Research, vol. 64,no. 2, pp. 278 – 285, 1993, project Managementanf Scheduling. [Online]. Available: http://www.sciencedirect.com/science/article/B6VCT-48MYGV0-4W/2/9dd8e0f50213f3302f7ebf1a80dca3b7ALIO-EURO <strong>2011</strong> – 194


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Sequencing approaches in Synchronous ManufacturingJan Riezebos ∗∗ University of Groningen, The NetherlandsP.O.box 800, 9700AV Groningen, The Netherlandsj.riezebos@rug.nlABSTRACTWe consi<strong>de</strong>r a sequencing problem in a synchronized manufacturingenvironment. Or<strong>de</strong>r release is an essential part of this system.As or<strong>de</strong>rs may differ in the amount and distribution of their capacityrequirements over subsequent production stages, total capacityload may vary over time. We encountered this problem in a laborintensivecellular environment. In practice, heuristics are used tosolve this problem, but their effectiveness is questioned. This paperexamines heuristics that are based on insights from assemblysystem <strong>de</strong>sign and work load control. The heuristics are evaluatedin a rolling schedule environment.Keywords: Synchronous manufacturing, Bottleneck, Employeescheduling1. INTRODUCTIONThe basic i<strong>de</strong>a of synchronous manufacturing is to create a flow ofwork through the manufacturing system, either continuous or intermittent,in or<strong>de</strong>r to achieve short and constant throughput timesand a predictable loading of the resources in the system. Recently,fixed cycle-time synchronization approaches have been <strong>de</strong>velopedthat no longer implicitly assume an inflexible capacity. They consi<strong>de</strong>rtotal capacity to be limited, but capacity to be flexible betweenthe stages of the production system. The reason for relaxingthis assumption is that workers are nowa<strong>da</strong>ys increasingly multiskille<strong>da</strong>nd cross-trained. This flexibility makes it possible to handlecapacity fluctuations between stages in a production system.However, total capacity in terms of the number of workers availabledoes not increase through such measures. Therefore, thesesynchronization approaches aim at an or<strong>de</strong>r release <strong>de</strong>cision thateffectively uses the available capacity of the multi-skilled workerswhile still realizing a high output for the whole production system.Examples of papers in this area are [1], [2], [3], [4], and [5]. Thecomplexity of the resulting synchronization problems has been analyzedby [6]. They showed that most leveling problems in suchsystems are NP-complete, even if they only consist of two stages.Therefore, in practice heuristic solutions are being used to solvethe or<strong>de</strong>r release <strong>de</strong>cision, as the number of stages often is muchlarger than two.This paper discusses various single pass heuristics for the or<strong>de</strong>rrelease <strong>de</strong>cision in such a synchronization approach with a fixedcycle time in a multi-product multi-stage situation. Single passheuristics <strong>de</strong>termine a sequence without back tracking or pair wiseinterchanging parts of a solution. Such heuristics are often usedin practice. The question is whether these heuristics can be improvedby incorporating insights from related fields, such as workloadcontrol and assembly line balancing. This paper will examinethe performance of several heuristics in a rolling schedule environment.Testing in a rolling schedule environment provi<strong>de</strong>s betterinsights in the performance of these heuristics in the long term, asit prohibits the negative impact of postponing problems to the endof the cycle.2. PROBLEM DEFINITIONIn a synchronous manufacturing mo<strong>de</strong>, stages represent a subset ofoperations that are to be performed in a cell within a fixed periodof time. At the end of each period, all jobs that are in progressare transferred to their next stage. At such a transfer moment,employees may have to switch to other tasks within the cell.The cell starts each period with a new or<strong>de</strong>r that is selected from alist of or<strong>de</strong>rs that should be completed during the cycle (i.e., at theend of the week). The batch size, process plans, and work contentper stage may differ per or<strong>de</strong>r. Therefore, capacity requirementsmay vary strongly, both per or<strong>de</strong>r and per stage.An important problem faced in such synchronous manufacturingsystems concerns the capacity balance over time ([7]). A cellularsystem makes it less appropriate to vary the total number of employeesover time. Employees should feel themselves responsiblefor the whole task of the cell. A relatively constant number ofemployees over various periods is therefore preferred.Figure 1 presents a realistic example with 10 or<strong>de</strong>rs that have tobe released during one week. We have 10 or<strong>de</strong>rs (A,. . . ,J) and fivestages j=1,. . . ,5. An or<strong>de</strong>r that starts in period t=1 in stage j=1arrives in period t = 2 in stage j=2, and leaves the system at theend of period t=5, if stage j=5 has been finished. Or<strong>de</strong>rs are representedusing different sha<strong>de</strong>s. The amount of capacity required ina stage differs per or<strong>de</strong>r and is presented in the cells of the table. Arow shows the fluctuating capacity requirements of that stage. Or<strong>de</strong>rsmay also require a different total number of employees (i.e.the sum of the cells with i<strong>de</strong>ntical sha<strong>de</strong>). However, the sum ofthe cells in the same column is more important for the capacitymanagement of the firm. This shows the total number of employeesnee<strong>de</strong>d in a single period. If this number exceeds the availablecapacity, the firm has to hire additional employees or change thesequence of or<strong>de</strong>rs. The problem is to <strong>de</strong>termine one or more sequencesfor the or<strong>de</strong>rs A,. . . ,J such that the available capacity ineach period t is not excee<strong>de</strong>d as long as possible. For the firstfour periods, earlier <strong>de</strong>cisions on the sequence in a previous cycleaffect the loading of the available capacity. The cells at the bottomleftsi<strong>de</strong> express the capacity requirements of these already startedor<strong>de</strong>rs that still have to complete one or more stages. The sameeffect appears at the end of the cycle, as the last four or<strong>de</strong>rs in thesequence affect not only capacity in this cycle, but also capacityrequirements in the next cycle. Here the end-of-horizon effect ortruncated-horizon effect appears (see e.g. [8]). The loading shouldresult in a sequence for which the available capacity per periodis not excee<strong>de</strong>d. The sequence presented in Figure 1 results infour periods that encounter a capacity shortage if available capacityequals 20 per period: t=2,6,9,10. Can a better or<strong>de</strong>r sequencebe found?ALIO-EURO <strong>2011</strong> – 195


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 1: Sequencing 10 products, 5 stages, period length 4 hours, cycle length one week3. MATHEMATICAL PROBLEM FORMULATIONThe problem can be formulated as:Given:• a set of i=1,. . . ,n or<strong>de</strong>rs that have to start during periodst=1,. . . ,n;• their capacity requirements C i j during j=1,. . . ,m stages ofcompletion;• the capacity requirements CR t j for or<strong>de</strong>rs that have alreadystarted but are not yet completed;• average required capacity in stage j (j=1,. . . ,m-1)ARC j = ∑ j−1t=1 CR t j + ∑ n i=1 C i j,n + j − 1• capacity requirements to complete stage j in period t Y t j(t=1,. . . ,n+m-1; j=1,. . . ,m);• available capacity per period AC t ,• capacity shortage (expected) in period tCS t = max(0,∑ m j=1 Y t j − AC t ) (t=1,. . . ,n+m-1)• weight factor for capacity shortage in period t w t(w t = 1 if t = 1,...,n; w t ≤ 1 if t > n (t=1,. . . ,n+m-1))<strong>de</strong>termine the sequence of the n or<strong>de</strong>rs X it (i = 1,...,n;t = 1,...,n)such that ∑ n+m−1t=1 w t CS t is minimized, whereX it = 1 if or<strong>de</strong>r i starts in period t, else 0 (i=1,. . . ,n; t=1,. . . ,n)The problem can mathematically be formulated as:such thatn+m−1Minimize ∑ w t ·CS t (1)t=1n∑ X it = 1∀t = 1,...,n (2)i=1n∑ X it = 1∀i = 1,...,n (3)i=1Y t j = CR t j ∀ j = 2,...,m∀t = 1,..., j − 1 (4)Y t j = ARC j ∀ j = 1,...,t − n∀t = n + 1,...,n + m − 1 (5)nY t+ j−1, j = ∑ X it ·C i j ∀t = 1,...,n∀ j = 1,...,m (6)i=1m∑ Y t j −CS t


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 2: Optimal solution compared with heuristicsThe results show that in 73% of the cases, FillCapBottleneck outperformsFillCap, while in 18% of the cases FillCap was better.However, the gap with the optimal solution remains very high,making it attractive to extend the search or apply the optimal solutionmethod if time allows. Calculation time of the optimal solutionrapidly increases with the number of products (0.4 sec for 10-products, 1.6 hours for 40-products). See Figure 3 for an overview.[2] G. L. Vairaktarakis, X. Cai, and C.-Y. Lee, “Workforce planningin synchronous production systems,” European Journalof Operational Research, vol. 136, pp. 551–572, 2002.[3] M. Gronalt and R. Hartl, “Workforce planning and allocationfor mid-volume truck manufacturing: a case study,” InternationalJournal of Production Research, vol. 41, pp. 449–463,2003.[4] J. Bukchin and M. Masin, “Multi-objective <strong>de</strong>sign of team oriente<strong>da</strong>ssembly systems,” European Journal of OperationalResearch, vol. 156, pp. 326–352, 2004.[5] E. Cevikcan, M. B. Durmusoglu, and M. E. Unal, “A teamoriented<strong>de</strong>sign methodology for mixed mo<strong>de</strong>l assembly systems,”Comput. Ind. Eng., vol. 56, pp. 576–599, 2009.[6] G. L. Vairaktarakis and X. Cai, “Complexity of workforcescheduling in transfer lines,” J. of Global Optimization,vol. 27, pp. 273–291, 2003.[7] V. I. Cesaní and H. J. Steu<strong>de</strong>l, “A study of labor assignmentflexibility in cellular manufacturing systems,” Comput. Ind.Eng., vol. 48, pp. 571–591, 2005.[8] H. Stadtler, “Improved rolling schedules for the dynamicsingle-level lot-sizing problem,” Manage. Sci., vol. 46, pp.318–326, 2000.[9] J. Riezebos, “Or<strong>de</strong>r sequencing and capacity balancing in synchronousmanufacturing,” International Journal of ProductionResearch, vol. 49, pp. 531–552, <strong>2011</strong>.8. APPENDIXFigure 3: Execution time optimal solution and heuristics6. CONCLUSIONSThe concept of synchronous manufacturing aims at achieving shortand reliable throughput times through the introduction of fixedtransfer moments between several stages of production. This causesa loading of the resources that can be predicted in advance, whichmakes it easier for planners to do their job.We <strong>de</strong>veloped optimal and heuristic solution approaches for thismulti-product multi-stage problem. The FillCap heuristic focuseson maximum utilization of the first stage. It selects an or<strong>de</strong>r thatconsumes as much capacity in the first stage as possible, whileFillCapBottleneck focuses on loading the bottleneck stage.Future research should verify if other heuristics, such as geneticalgorithms, can be applied as well. The current study has applied<strong>de</strong>terministic optimization in a rolling schedule horizon. We thinkthat the use of a rolling schedule has important benefits when testingthe performance of different solution approaches.7. REFERENCES[1] C.-Y. Lee and G. L. Vairaktarakis, “Workforce planning inmixed mo<strong>de</strong>l transfer lines,” Operations Research, vol. 45, pp.553–567, 1997.Figure 4: FillCap and FillCapBottleneck heuristicsALIO-EURO <strong>2011</strong> – 197


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Affine recourse for the robust network <strong>de</strong>sign problem: between static anddynamic routingMichael Poss ∗ Christian Raack ∗ †∗ Department of Computer Science, Faculté <strong>de</strong>s SciencesUniversité Libre <strong>de</strong> Bruxelles, Brussels, Belgiummposs@ulb.ac.be† Zuse Institute Berlin (ZIB)Takustr. 7, D-14195 Berlin, Germanyraack@zib.<strong>de</strong>ABSTRACTAffinely-Adjustable Robust Counterparts are used to provi<strong>de</strong>tractable alternatives to (two-stage) robust programs with arbitraryrecourse. We apply them to robust network <strong>de</strong>sign with polyhedral<strong>de</strong>mand uncertainty, introducing the affine routing principle. Wecompare the affine routing to the well-studied static and dynamicrouting schemes for robust network <strong>de</strong>sign. It is shown that affinerouting can be seen as a generalization of the wi<strong>de</strong>ly used staticrouting still being tractable and providing cheaper solutions. Weinvestigate properties on the <strong>de</strong>mand polytope un<strong>de</strong>r which affineroutings reduce to static routings and also <strong>de</strong>velop conditions onthe uncertainty set leading to dynamic routings being affine. Weshow however that affine routings suffer from the drawback that(even strongly) dominated <strong>de</strong>mand vectors are not necessarily supportedby affine solutions. The proofs and computational resultsare not presented due to the space restriction.Keywords: Robust optimization, Network <strong>de</strong>sign, Recourse, AffineAdjustable Robust Counterparts, Demand polytope1. INTRODUCTIONIn the classical <strong>de</strong>terministic network <strong>de</strong>sign problem, a set ofpoint-to point commodities with known <strong>de</strong>mand values is given,and capacities have to be installed on the network links at minimumcost such that the resulting capacitated network is able toaccommo<strong>da</strong>te all <strong>de</strong>mands simultaneously by a multi-commodityflow. In practice however, exact <strong>de</strong>mand values are usually notknown at the time the <strong>de</strong>sign <strong>de</strong>cisions must be ma<strong>de</strong>. Robust optimizationovercomes this problem by explicitly taking into accountthe uncertainty of the <strong>da</strong>ta introducing so-called uncertainty sets.A solution is said to be feasible if it is feasible for all realizationsof the <strong>da</strong>ta in a pre<strong>de</strong>termined uncertainty set D [1]. Introducingeven more flexibility, two-stage robust optimization allows toadjust a subset of the problem variables only after observing theactual realization of the <strong>da</strong>ta [2]. In fact, it is natural to apply thistwo-stage approach to network <strong>de</strong>sign since very often first stagecapacity <strong>de</strong>sign <strong>de</strong>cisions are ma<strong>de</strong> in the long term while the actualrouting is adjusted based on observed user <strong>de</strong>mands. Thissecond stage adjusting procedure is called recourse which in thecontext of network <strong>de</strong>sign relates to what is known as traffic engineering.Unrestricted second stage recourse in robust network<strong>de</strong>sign is called dynamic routing, see [3]. Given a fixed <strong>de</strong>sign, thecommodity routing can be changed arbitrarily for every realizationof the <strong>de</strong>mands. In [3] it is shown that allowing for dynamic routingmakes robust network <strong>de</strong>sign intractable. Already <strong>de</strong>cidingwhether or not a fixed capacity <strong>de</strong>sign allows for a dynamic routingof <strong>de</strong>mands in a given polytope is N P-complete (on directedgraphs).This paper is motivated by the scarcity of works using affine routing.Following [2], we introduce affine routing as a generalizationof static routing allowing for more routing flexibility but stillyielding polynomially solvable robust counterparts, (in oppositionto the schemes from [4] and [5]). In this context affine routing provi<strong>de</strong>sa tractable alternative in between static and dynamic routing.Affine routing has been used implicitly already in [6] for a robustnetwork <strong>de</strong>sign problem with a particular uncertainty set. The contributionsof this paper consist of a theoretical and empirical studyof network <strong>de</strong>sign un<strong>de</strong>r the affine routing principle for generalpolyhedral <strong>de</strong>mand uncertainty sets D. Section 2 introduces themathematical mo<strong>de</strong>ls and <strong>de</strong>fines formally static, affine and dynamicroutings. In Section 3 we present our main results. Proofsare ommited due to space restrictions. We also conducted numericalcomparisons of static, affine and dynamic routings, which arenot presented due to space restrictions.2. ROBUST NETWORK DESIGN WITH RECOURSEWe are given a directed graph G = (V,A) and a set of commoditiesK. A commodity k ∈ K has source s(k) ∈ V , <strong>de</strong>stination t(k) ∈V , and <strong>de</strong>mand value d k ≥ 0. A flow for k is a vector f k ∈ R A +satisfying:∑a∈δ + (v)∑fa k − fa k = d k ψ vk for all v ∈ V, (1)a∈δ − (v)where δ + (v) and δ − (v) <strong>de</strong>note the set of outgoing arcs and incomingarcs at no<strong>de</strong> v, respectively. For no<strong>de</strong> v ∈ V and commodityk ∈ K we set ψ vk := 1 if v = s(k), ψ vk := −1 if v = t(k), andψ vk := 0 else. Flows are non-negative. A multi-commodity flow isa collection of flows, one for each commodity in K. A circulation(or cycle-flow) is a vector g ∈ R A satisfying∑ g a − ∑ g a = 0 for all v ∈ V. (2)a∈δ + (v) a∈δ − (v)A circulation is not necessarily non-negative. A value g a < 0 canbe seen as a flow from the head of arc a to its tail. We call a circulationg non-negative if g ≥ 0 and positive if additionally g ≠ 0.Notice that any two flows ˆf k , f k for k only differ by a circulation,that is, there always exists a circulation g such that ˆf k = f k + g.In many practical situations, the <strong>de</strong>mand vector d ∈ R K + is uncertain.In the sequel we assume that d ∈ D ⊂ R K with D beinga polytope. Any d ∈ D is said to be a realization of the <strong>de</strong>mand.A routing is a function f : D → R A×K+ that assigns a multicommodityflow to every realization of the <strong>de</strong>mand. We say thatALIO-EURO <strong>2011</strong> – 198


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>f serves D and call f a dynamic routing if there is no further restrictionon the routing. A capacity allocation x ∈ R A + is said tosupport the set D if there exists a routing f serving D such thatfor every d ∈ D the corresponding multi-commodity flow f (d) isnot exceeding the arc-capacities given by x. Robust network <strong>de</strong>signnow aims at providing the cost minimal capacity allocationsupporting D. In this respect, robust network <strong>de</strong>sign is a two-stagerobust program with recourse, following the more general framework<strong>de</strong>scribed by [2]. The capacity <strong>de</strong>sign has to be fixed in thefirst stage and observing a <strong>de</strong>mand realization d ∈ D, we are allowedto adjust the routing f (d) in the second stage. The problemcan be written as the following (semi-infinite) linear program:(RND)min ∑ κ a x aa∈A∑a∈δ + (v)fa k (d) − ∑a∈δ − (v)fa k (d) = d k ψ vk ,∑ fa k (d) ≤ x a ,k∈Kf k a (d) ≥ 0,x a ≥ 0, a ∈ A,v ∈ V,k ∈ K,d ∈ Da ∈ A,d ∈ D(3)(4)a ∈ A,k ∈ K,d ∈ D(5)where κ a ∈ R + is the cost for installing one unit of capacity on arca ∈ A. As already mentioned, <strong>de</strong>ciding whether or not a given capacityvector x supports D is N P-complete for general polytopesD [3]. It follows that (unless P = N P) it is impossible to <strong>de</strong>rivea compact formulation for (RND) with dynamic routing. Using abranch-and-cut approach based on Ben<strong>de</strong>rs <strong>de</strong>composition, Mattia[7] shows how the solve the N P-hard separation problem for robustmetric inequalities using bilevel and mixed integer programs.Most authors ([8, 9, 10], among others) use a simpler version of(RND) introducing a restriction on the second stage recourse knownas static routing (also called oblivious routing). Each componentf k : D → R A + is forced to be a linear function of d k :f k a (d) := y k ad k a ∈ A,k ∈ K,d ∈ D. (6)Notice that by (6) the flow for k is not changing if we perturb the<strong>de</strong>mand for h ≠ k. By combining (6) and (3) it follows that themultipliers y ∈ R+ A×K <strong>de</strong>fine a multi-commodity (percentage) flow.For every k ∈ K, the vector y k ∈ R A + satisfies (1) setting d k = 1.The flow y is called a routing template since it <strong>de</strong>ci<strong>de</strong>s, for everycommodity, which paths are used to route the <strong>de</strong>mand and what isthe percental splitting among these paths. The routing template hasto be used by all <strong>de</strong>mand scenarios d ∈ D un<strong>de</strong>r the static routingscheme.Ben-Tal et al. [2] introduce Affine Adjustable Robust Counterpartsrestricting the recourse to be an affine function of the uncertainties.Applying this framework to (RND) means restricting f k to be anaffine function of all components of d givingfa k (d) := fa0k + ∑ y kha d h ≥ 0, a ∈ A,k ∈ K,d ∈ D, (7)h∈Kwhere fa 0k ,y kh a ∈ R for all a ∈ A,k,h ∈ K, also see [6]. In whatfollows, a routing f serving D and satisfying (7) for some vectorsf 0 and y is called affine. We see immediately that static routingcan be obtained from (7) by imposing fa0k = 0 and y kh a = 0 foreach a ∈ A and all k,h ∈ K with k ≠ h. In this context affine routinggeneralizes static routing allowing for more flexibility in reactingto <strong>de</strong>mand fluctuations, but it is not as flexible as dynamic routing.Formally it holdsopt dyn ≤ opt a f f ≤ opt stat ,where opt dyn , opt a f f , and opt stat <strong>de</strong>note the cost values of the optimalsolution to (RND) where f is allowed to be dynamic, affine,or static, respectively. Note that there is a proven (tight) worstcaseoptimality gap of O(log|V |) between the dynamic and staticrouting principle, see [11]. In this paper we do not establish optimalitygaps between the three routing principles. We rather focuson studying properties of the <strong>de</strong>mand scenarios D that either yieldopt stat = opt a f f or opt a f f = opt dyn .Given a <strong>de</strong>mand polytope D, a static routing f is completely <strong>de</strong>scribedby the vector y ∈ R A×K+ . Similarly, an affine routing iscompletely <strong>de</strong>scribed by fixing the vectors f 0 ∈ R A×K and y ∈R A×K×K . Extending the previous <strong>de</strong>finitions, any routing templatey ∈ R A×K+ is said to serve D if it yields a (static) routing f servingD. Similarly, any pair of vectors f 0 ∈ R A×K and y ∈ R A×K×K thatsatisfies (3) and (7) are said to serve D. Given a capacity allocationx ∈ R A +, the pair (x,y) with y serving D, or the triplet (x, f 0 ,y)with ( f 0 ,y) serving D are said to support D if the correspondingroutings satisfy (4).The mo<strong>de</strong>l (RND) contains an infinite number of inequalities. However,when D is convex, we can replace D by the set of its extremepoints, which is finite whenever D is a polytope.Lemma 1. Let D ⊂ R K be a boun<strong>de</strong>d set and x be a capacityallocation x ∈ R A .(a) x supports D if and only if x supports conv(D).(b) (x,y) supports D if and only if (x,y) supports conv(D).(c) (x, f 0 ,y) supports D if and only if (x, f 0 ,y) supports conv(D).Hence (RND) can be discretized by restricting the mo<strong>de</strong>l to theextreme <strong>de</strong>mand scenarios that correspond to vertices of D (for allthree routing schemes).3. AFFINE ROUTINGSIn this section, we study properties and consequences of the affinerouting principle. Using (7) and substituting the flow variables inthe balance constraints (3) it can be seen that affine routing has anice interpretation as paths and cycles:Lemma 2. Let D be a <strong>de</strong>mand polytope and let ( f 0 ,y) ∈ R A×K ×R A×K×K be an affine routing serving D. If D is full-dimensional,then y kk ∈ R A is a routing template for k ∈ K and f 0k ∈ R A ,y kh ∈R A are circulations for every k,h ∈ K with k ≠ h.Just like in the static case, the flow for commodity k changes linearlywith d k on the paths <strong>de</strong>scribed by the template y kk a . However,the flow for commodity k may change also if the <strong>de</strong>mand for h ≠ kchanges which is <strong>de</strong>scribed by circulations y kh . In addition thereis a constant circulation shift <strong>de</strong>scribed by variables f 0k .As already mentioned, a dynamic routing for commodity k coul<strong>da</strong>lso be <strong>de</strong>scribed by one (representative) routing and circulations<strong>de</strong>pending on the <strong>de</strong>mand fluctuations. In the dynamic case however,the circulations can be chosen arbitrarily while in the affinecase the actual flow changes according to (7). We illustrate thisconcept in Example 1 which shows that affine routing can be asgood as dynamic routing in terms of the cost for capacity allocationand that f 0 and y kh may not <strong>de</strong>scribe circulations when D isnot full-dimensionalExample 1. Consi<strong>de</strong>r the network <strong>de</strong>sign problem for the graph<strong>de</strong>picted in Figure 1(a) with two commodities k 1 : a → b and k 2 :a → c. The uncertainty set D is <strong>de</strong>fined by the extreme pointsd 1 = (2,1),d 2 = (1,2) and d 3 = (1,1), and the capacity unitarycosts are the edge labels of Figure 1(a). Edge labels from Figure1(b) and 1(c) represent optimal capacity allocations with staticALIO-EURO <strong>2011</strong> – 199


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>aa322bc(a) edge costs1/32/3(d) y k 1k 12/3bcaa22(b) static1/31/3(e) y k 1k 21/3bcbcaa121(c) dynamic1(f) y k 2k 2Figure 1: Static, dynamic, and affine recourse.and dynamic routing, respectively. They have costs of 10 and 9,respectively. Then, Figure 1(d)-1(f) <strong>de</strong>scribes coefficients y kh foran affine routing feasible for the capacity allocation 1(c). If weremove d 3 = (1,1) from the set of extreme points, the dimensionof the uncertainty set reduces to 1. The affine routing prescribedby y k 2k 2ac = 1, f 0k 1ab = 3 and yk 1k 2ab = −1 serves all <strong>de</strong>mands in theconvex hull of d 1 = (2,1) and d 2 = (1,2) but f 0k 1and y k 1k 2do not<strong>de</strong>scribe a circulation.Compact reformulations In the following, we assume that D isfull-dimensional. If the number of vertices of D is polynomial inthe number of no<strong>de</strong>s, arcs, and commodities then mo<strong>de</strong>l (RND)can be written in a compact way for all three routing schemes, thatis, it can be written with a polynomial number of variables andconstraints. However, even if the number of vertices is exponentialthere are compact reformulations for (RND) with static or affinerouting as long as D is compact. Reformulating by dualizing constraintsis a stan<strong>da</strong>rd technique in robust optimization resulting inso-called robust counterparts, see for instance [12]. Applying thistechnique to (RND) with affine routing yields the following result.Proposition 3. Consi<strong>de</strong>r (RND) with affine routing for a fulldimensionaluncertainty polytope D. If D has polynomial manyvertices or can be <strong>de</strong>scribed by a polynomial number of inequalitiesthen (RND) can be solved in polynomial time.bcbcRelation to static routing Notice that if a flow f k for k containsa positive circulation, that is, there exists a positive circulationg such that f k − g is a flow for k then f k can be reduced tof k − g without changing the flow balance at s(k) and t(k). Moreover,the percental splitting among the used paths is unchanged.In this spirit we call any routing f cycle-free if for all d ∈ D an<strong>da</strong>ll commodities k ∈ K the commodity flows do not contain positivecirculations. Of course every optimal capacity allocation hasa cycle-free (static, affine, or dynamic) routing.Notice that if a flow f k for k contains a positive circulation, that is,there exists a positive circulation g such that f k − g is a flow for kthen f k can be reduced to f k −g without changing the flow balanceat s(k) and t(k). Moreover, the percental splitting among the usedpaths is unchanged. In this spirit we call any routing f cycle-freeif for all d ∈ D and all commodities k ∈ K the commodity flows donot contain positive circulations. Of course every optimal capacityallocation has a cycle-free (static, affine, or dynamic) routing.Let e k be the k-th unit vector in R K + and D0 k be the set obtained fromD by removing d ∈ D with d k > 0, that is, D0 k := {d ∈ D : dk = 0}.We can prove the following:Proposition 5. Let D be a <strong>de</strong>mand polytope. If 0 ∈ D and foreach k ∈ K there is ε k > 0 such that ε k e k ∈ D, then all cycle-freeaffine routings serving D are static.Proposition 6. Let D be a <strong>de</strong>mand polytope and let G be acyclic.If dim(D0 k ) = |K| − 1 for all k ∈ K, then all cycle-free affine routingsserving D are static.Theorem 7. Let D be a <strong>de</strong>mand polytope. If all cycle-free affineroutings serving D are static then dim(D0 k ) = |K|−1 for all k ∈ K.Combining Proposition 6 with Theorem 7, we have complectly <strong>de</strong>scribedpolytopes for which cycle-free affine routings and staticroutings are equivalent, assuming that G is acyclic. However,Proposition 6 is wrong for general graphs because f k (d) for d ∈D0 k is not necessarily equal to 0, it can also be a positive circulation.Then, one can check that, when G has the required structure,a positive circulation can be <strong>de</strong>composed into circulations that arenot positive, thus yielding a cycle-free affine routing and a counterexampleto Proposition 6.Proposition 3 implies that given a capacity allocation x, the existenceof an affine routing can be answered in polynomial time aslong as D can be <strong>de</strong>scribed in a compact way, which is also true inthe static case but is in contrast to the N P-complete results fordynamic routing [3].Domination of <strong>de</strong>mands For static and dynamic routings, notall extreme points of D have to be consi<strong>de</strong>red in a discretizationof D. For instance, if 0 ∈ D, it is an extreme point of D that anycapacity allocation using static (resp. dynamic) routing supports.This intuitive i<strong>de</strong>a has been formalized by Oriolo [13] introducingthe concept of domination. Given two <strong>de</strong>mands vectors d 1 andd 2 , we say that d 1 dominates d 2 if any capacity allocation x ∈ R A +supporting d 1 also supports d 2 (dynamic routing). Moreover, d 1totally dominates d 2 if any pair (x,y) supporting d 1 also supportsd 2 (static routing). Thus, removing dominated (extreme) pointsfrom D is not changing the problem in the static and the in dynamiccase.For general affine routings, however, there is no notion of dominationof <strong>de</strong>mands:Proposition 4. Let d 1 ,d 2 ∈ R K +, d 1 ≠ d 2 . There exists (x, f 0 ,y)that supports d 1 but does not support d 2 .Relation to dynamic routing Theorem 5 i<strong>de</strong>ntifies <strong>de</strong>mand polytopesfor which affine routing is no better than static routing. However,we saw in Example 1 that affine routing may also perform aswell as dynamic routing does, yielding strictly cheaper capacityallocations. For general robust optimization problems, [14] showthat affine recourse is equivalent to dynamic recourse when D isa simplex. Here we show that in the context of robust network<strong>de</strong>sign this condition is also necessary.Theorem 8. Given a <strong>de</strong>mand polytope D, all dynamic routingsserving D are affine routings if and only if D is a simplex.Example 2 shows that when D is not a simplex and does not containthe origin, capacity allocation costs required by static, affine,and dynamic routings can be strictly different.Example 2. Consi<strong>de</strong>r the network <strong>de</strong>sign problem from Example1 with the uncertainty set D <strong>de</strong>fined by the extreme pointsd 1 = (3,0), d 2 = (0,3), d 3 = (2,2) and d 4 = (0.5,0.5). The optimalcapacity allocation costs with static, affine, and dynamic routingsare, respectively, 13 + 2 1 ,13 + 1 3 , and 13. Notice that movingd 4 along the segment (0,0) − (1,1) leaves static and dynamic optimalcapacity allocations unchanged while the affine solution costmoves between 13 and 13 + 1 2 . In particular, if d 4 is set to (0,0),the affine and static costs are the same, which we knew alreadyfrom Theorem 5. If d 4 is in conv{d 1 ,d 2 ,d 3 ,(1,1)}, the affine anddynamic cots are the same.ALIO-EURO <strong>2011</strong> – 200


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>4. REFERENCES[1] A. Ben-Tal and A. Nemirovski, “Robust solutions of linearprogramming problems contaminated with uncertain <strong>da</strong>ta,”Mathematical Programming, vol. 88, pp. 411–424, 2000.[2] A. Ben-Tal, A. Goryashko, E. Guslitzer, and A. Nemirovski,“Adjustable robust solutions of uncertain linear programs,”Mathematical Programming, vol. 99, no. 2, pp. 351–376,2004.[3] C. Chekuri, F. B. Shepherd, G. Oriolo, and M. G. Scutellà,“Hardness of robust network <strong>de</strong>sign,” Networks, vol. 50,no. 1, pp. 50–54, 2007.[4] W. Ben-Ameur, “Between fully dynamic routing and robuststable routing,” in 6th International Workshop on Designand Reliable Communication Networks, 2007. DRCN 2007,2007.[5] M. G. Scutellà, “On improving optimal oblivious routing,”Operations Research Letters, vol. 37, no. 3, pp. 197–200,2009.[6] A. Ouorou and J.-P. Vial, “A mo<strong>de</strong>l for robust capacity planningfor telecommunications networks un<strong>de</strong>r <strong>de</strong>mand uncertainty,”in 6th International Workshop on Design and ReliableCommunication Networks, 2007. DRCN 2007, 2007, pp.1–4.[7] S. Mattia, “The robust network loading problem withdynamic routing,” La Sapienza, University of Rome,Tech. Rep. Vol 2, n. 3, 2010. [Online]. Available:http://ojs.uniroma1.it/in<strong>de</strong>x.php/DIS_TechnicalReports[8] W. Ben-Ameur and H. Kerivin, “Routing of uncertain <strong>de</strong>mands,”Optimization and Engineering, vol. 3, pp. 283–313,2005.[9] A. Altin, E. Amaldi, P. Belotti, and M. Ç. Pinar, “Provisioningvirtual private networks un<strong>de</strong>r traffic uncertainty,” Networks,vol. 49, no. 1, pp. 100–115, 2007.[10] A. M. C. A. Koster, M. Kutschka, and C. Raack, “Towardsrobust network <strong>de</strong>sign using integer linear programmingtechniques,” in <strong>Proceedings</strong> of the NGI 2010. Paris,France: Next Generation Internet, Jun 2010.[11] N. Goyal, N. Olver, and F. B. Shepherd, “Dynamic vs. obliviousrouting in network <strong>de</strong>sign,” in <strong>Proceedings</strong> of the ESA2009, 2009, pp. 277–288.[12] D. Bertsimas and M. Sim, “The Price of Robustness,” OperationsResearch, vol. 52, no. 1, pp. 35–53, Jan 2004.[13] G. Oriolo, “Domination between traffic matrices,” Mathematicsof Operations Research, vol. 33, no. 1, pp. 91–96,2008.[14] D. Bertsimas and V. Goyal, “On the power and limitationsof affine policies in two-stage a<strong>da</strong>ptive optimization,”Columbia University, USA, Tech. Rep., 2009. [Online].Available: http://www.columbia.edu/~vg2277/ALIO-EURO <strong>2011</strong> – 201


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Solving a Hub Location Problem by the Hyperbolic Smoothing ApproachAdilson Elias Xavier ∗ Claudio Martagão Gesteira ∗ Henrique Pacca Loureiro Luna †∗ Fe<strong>de</strong>ral University of Rio <strong>de</strong> Janeiro - BrazilRio <strong>de</strong> Janeiro, Brazil{adilson, gesteira}@cos.ufrj.br† Fe<strong>de</strong>ral University of Alagoas - BrazilMaceio, Brazilpacca@ic.ufal.brABSTRACTHub-and-spoke (HS) network <strong>de</strong>signs arise in transportation andtelecommunications systems, where one must flow commoditiesamong spatially separate points and where scale economies can beattained through the shared use of high capacity links. As an alternativefor the discrete approach of selecting as hubs a subset of theexisting no<strong>de</strong>s, this paper explores the possibility of a continuouslocation for the hubs. Therefore, the problem is to find the leastexpensive HS network, continuously locating hubs and assigningtraffic to them, given the <strong>de</strong>mands between each origin-<strong>de</strong>stinationpair and the respective transportation costs. The problem leads to amin − sum − min formulation that is strongly non-differentiable.The proposed method overcomes this difficulty with a smoothingstrategy that uses a special differentiable function. The approachis a particular application of the hyperbolic smoothing technique,which has been proven to be able to solve quite efficiently largeinstances of clustering problems. The final solution is obtainedby solving a sequence of differentiable unconstrained optimizationsubproblems which gradually approach the original problem. Themost important feature of the methodology is the low dimensionof the subproblems, <strong>de</strong>pen<strong>de</strong>nt only on the number of hubs. Theefficiency of the method is shown through a set of computationalexperiments with large continuous hub-and-spoke problems.Keywords: Hub Location, Min-Sum-Min Problems, Global Optimization,Non-differentiable Programming, Hyperbolic Smoothing1. INTRODUCTIONThe hierarchical organization of telecommunication and transportationsystems can be found in several real world applications, suchas the location of switching centers or postal offices, and plays amajor role in operations research and management science mo<strong>de</strong>ls.Cost minimization is the objective of most of these mo<strong>de</strong>ls and optimizedlevels of customer concentrations enables the economiesof scale of aggregating the flows in the related networks. The maindifferences among the mo<strong>de</strong>ls concern the hierarchical level of network<strong>de</strong>sign, typically backbone versus local access network, andhow the relevant aspects of connectivity, capacity, reliability, <strong>de</strong>mandpatterns, routing, pricing, performance and quality of serviceare consi<strong>de</strong>red for such networks[1, 2, 3]. Depending on thecontext or application, hub no<strong>de</strong>s are called switches, warehouses,facilities, concentrators or access points. Likewise, backbonesmay be referred to as hub-level networks and local access networksmay be called tributary networks or many other names. Normally,backbone links carry larger volumes of traffic than tributary links.Traffic originating at a specific customer location can pass througha local access network to get to one or more hub no<strong>de</strong>s, <strong>de</strong>pendingon whether single or multiple assignmenst are consi<strong>de</strong>red to linkthe backbone to the remote locations. After passing through thebackbone network, the traffic again uses a local access network totravel from a hub to its final <strong>de</strong>stination at another location.2. THE CONTINUOUS HUB-AND-SPOKE PROBLEMSPECIFICATIONThe continuous hub-and-spoke problem consists of locating a setof q centers or hubs in or<strong>de</strong>r to minimize a particular trans- portationcost function. To formulate this problem, we proceed asfollows. Let S = {s 1 ,...,s m } <strong>de</strong>note a given set of m cities orpoints in a planar region. Let d jl be the flow between two pointsj and l. Let x i ,i = 1,...,q where each x i ∈ R 2 , be the setof variables of the problem: the hubs or centers location. The setof these hubs are represented by X ∈ R 2q , and the assumption isthat each pair of hubs is directly connected by the shortest distanceroute between them.Concerning the hub-and-spoke problem un<strong>de</strong>r consi<strong>de</strong>ration, theconnections between each pair of points j and l, have alwaysthree parts: from the origin point j to a first hub center i 1 , fromi 1 to a second hub center i 2 and from i 2 to <strong>de</strong>stination point l.There are no network structure constraining connections, the onlyconstraint being that connections between cities must be done throughhubs. However, the first and the second hubs can be coinci<strong>de</strong>nt(i.e., i 1 = i 2 ), meaning that a unique hub is used to connectthe origin point j and the <strong>de</strong>stination point l. Multiple allocation ispermitted, meaning that any given point can be served by one ormore hubs.The unitary flow cost associated to a general connection ( j,i 1 ,i 2 ,l)is equal to a weighted distance obtained by the sum of three Euclidiandistances, with a reduction factor for the second part betweenhubs:z ji1 i 2 l = ‖s j − x i1 ‖ 2 + α ‖x i1 − x i2 ‖ 2 + ‖x i2 − s l ‖ 2 , (1)where α is the reduction factor: 0 ≤ α < 1.The unitary flow cost from the origin point j to the <strong>de</strong>stinationpoint l is taken as the minimum value of all connections:orz jl = min z jii 1 ,i 2 =1,...,q 1 i 2 l, (2)z jl = min ‖s j − x i1 ‖ 2 + α ‖x i1 − x i2 ‖ 2 + ‖x i2 − s l ‖ 2 . (3)i 1 ,i 2 =1,...,qALIO-EURO <strong>2011</strong> – 202


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>3. SMOOTHING THE CONTINUOUS HUB-AND-SPOKEPROBLEMThe continuous hub-and-spoke problem consists of minimizing thetotal flow cost between all pairs of cities taking the unitary costvalue, given by (2), for all connections:m mmin ∑ ∑ d jl z jl (4)j=1 l=1subject to z jl = min jii 1 ,i 2 =1,...,q 1 i 2 l, j,l = 1,...,m.So, this hub-and-spoke problem has a structure named min −sum − min, with nondifferentiable and nonconvex characteristics,having a myriad of local minimizers. A series of transformationswill be performed in or<strong>de</strong>r to obtain a continuous formulation.First, consi<strong>de</strong>ring its <strong>de</strong>finition, each z jl must necessarilysatisfy the following set of inequalities:z jl − z ji1 i 2 l ≤ 0, i 1 ,i 2 = 1,...,q. (5)Substituting these inequalities for the equality constraints of problem(4), the relaxed problem becomesm mmin ∑ ∑ d jl z jl (6)j=1 l=1subject to z jl − z ji1 i 2 l ≤ 0,i 1 ,i 2 = 1,...,q;j,l = 1,...,m.Since the variables z jl are not bound from below, in or<strong>de</strong>r to obtainthe <strong>de</strong>sired equivalence, we must modify problem (6). We doso by first letting ϕ(y) <strong>de</strong>note max{0,y} and then observingthat, from the set of inequalities in (6), it follows thatq q∑ ∑ ϕ(z jl − z ji1 i 2 l ) = 0, j,l = 1,...,m. (7)i 1 =1 i 2 =1Using (7) in place of the set of inequality constraints in (6), wewould obtain an equivalent problem maintaining the un<strong>de</strong>sirableproperty that z jl , j, l = 1,...,m still has no lower bound. Consi<strong>de</strong>ring,however, that the objective function of problem (6) willforce each z jl , j ,l = 1,...,m, downward, we can think of boundingthe latter variables from below by including an ε perturbationin (7). So, the following modified problem is obtained:m mmin ∑ ∑ d jl z jl (8)j=1 l=1subject toq q∑ ∑ ϕ(z jl − z ji1 i 2 l ) ≥ ε , j,l = 1,...,m,i 1 =1 i 2 =1for ε > 0. Since the feasible set of problem (4) is the limit of thatof (8) when ε → 0 + , we can then consi<strong>de</strong>r solving (4) by solving asequence of problems like (8) for a sequence of <strong>de</strong>creasing valuesfor ε that approaches 0.Analyzing problem (8), the <strong>de</strong>finition of function ϕ endows itwith an extremely rigid nondifferentiable structure, which makesits computational solution very hard. In view of this, the numericalmethod we adopt for solving problem (1), takes a smoothingapproach. From this perspective, let us <strong>de</strong>fine the function:for y ∈ R and τ > 0.()φ(y,τ) = y +√y 2 + τ 2 /2 (9)Function φ has the following properties:(a) φ(y,τ) > ϕ(y), ∀τ > 0;(b)limτ→0φ(y,τ) = ϕ(y);(c) φ(.,τ) is an increasing convex C ∞ function in variable y.Therefore, function φ constitutes an approximation of functionϕ. By using function φ in the place of function ϕ, in (8), theproblemm mmin ∑ ∑ d jl z jl (10)j=1 l=1subject toq q∑ ∑ φ(z jl − z ji1 i 2 l,τ) ≥ ε, j,l = 1,...,m,i 1 =1 i 2 =1is produced.To obtain a differentiable problem, it is necessary further to smooththe balanced distances z ji1 i 2 l. For this purpose, let us <strong>de</strong>fine thefunctionθ(v, w, γ ) =where v,w ∈ R 2 and γ > 0.Function θ has the following properties:(a)limγ→0√(w 1 − v 1 ) 2 + (w 2 − v 2 ) 2 + γ 2 (11)θ(v, w, γ ) = ‖w − v‖ 2 ;(b) θ is a C ∞ function.By using function θ in place of the Euclidian distances, the completelydifferentiable problemm mmin ∑ ∑ d jl z jl (12)j=1 l=1q qsubject to ∑ ∑ φ(z jl − (θ(s j ,x i1 ,γ) +i 1 =1 i 2 =1α θ(x i1 ,x i2 ,γ) + θ(x i2 ,s l ,γ)),τ) ≥ ε, j,l = 1,...,m,is now obtained.So, the properties of functions φ and θ allow us to seek a solutionto problem (8) by solving a sequence of subproblems like problem(12), produced by <strong>de</strong>creasing the parameters γ → 0 , τ → 0, andε → 0.Since z jl ≥ 0, j, l = 1,...,m, the objective function minimizationprocess will work towards reducing these values to the utmost. Onthe other hand, given any set of hubs x i , i = 1,...,q, due to property(c) of the hyperbolic smoothing function φ, the constraintsof problem (12) are a monotonically crescent function in z jl . So,these constraints will certainly be active and problem (12) will ultimatelybe equivalent to the problem:ALIO-EURO <strong>2011</strong> – 203


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>m mmin ∑ ∑ d jl z jl (13)j=1 l=1q qsubject to h jl (z jl ,x) = ∑ ∑ φ(z jl − (θ(s j ,x i1 ,γ) +i 1 =1 i 2 =1α θ(x i1 ,x i2 ,γ) + θ(x i2 ,s l ,γ)),τ) − ε = 0, j,l = 1,...,m.The dimension of the variable domain space of problem (13) is(2q + m 2 ).Since, in general, the value of the parameter m, the cardinality ofthe set S of the consumer points s j , is large, problem (13) has alarge number of variables. However, it has a separable structure,because each variable z jl appears only in one equality constraint.Therefore, as the partial <strong>de</strong>rivative of h jl (z jl ,x) with respect toz jl , j, l = 1,...,m is not equal to zero, it is possible to use theImplicit Function Theorem to calculate each component z jl , j, l =1,...,m as a function of the hub location variables x i , i = 1,...,q.In this way, the unconstrained problemm mmin f (x) = ∑ ∑ d jl z jl (x) (14)j=1 l=1is obtained, where each z jl (x) results from the calculation of azero of each equationq qh jl (z jl ,x) = ∑ ∑ φ(z jl − (θ(s j ,x i1 ,γ) +i 1 =1 i 2 =1α θ(x i1 ,x i2 ,γ) + θ(x i2 ,s l ,γ)),τ) − ε = 0, j,l = 1,...,m. (15)Due to property (c) of the hyperbolic smoothing function, eachterm φ above is strictly increasing with variable z jl and thereforethe equation has a single zero. Again, due to the ImplicitFunction Theorem, the functions z jl (x) have all <strong>de</strong>rivatives withrespect to the variables x i , i = 1,...,q, , and therefore it is possibleto calculate exactly the gradient of the objective function ofproblem (14),In this way, it is easy to solve problem (14) by making use of anymethod based on first or<strong>de</strong>r <strong>de</strong>rivative information. Finally, it mustbe emphasized that problem (14) is <strong>de</strong>fined on a (2q)−dimensionalspace, so it is a small problem, since the number of hubs, q, is, ingeneral, small for real world applications.The solution of the original hub-and-spoke problems can thus beobtained by using the Hyperbolic Smoothing Hub-and-Spoke Algorithm,<strong>de</strong>scribed below in a simplified form.Simplified HSHS AlgorithmInitialization Step: Choose initial values: x 0 , γ 1 , τ 1 , ε 1 .k = 1.Main Step:Choose values 0 < ρ 1 < 1, 0 < ρ 2 < 1, 0 < ρ 3 < 1; letRepeat until a stopping rule is attainedSolve problem (14) with γ = γ k , τ = τ k and ε = ε k ,starting at the initial point x k−1 and let x k be the solution obtained.Let γ k+1 = ρ 1 γ k , τ k+1 = ρ 2 τ k , ε k+1 = ρ 3 ε k ,k := k + 1.Just as in other smoothing methods, the solution to the hub-andspokeproblem is obtained, in theory, by solving an infinite sequenceof optimization problems. In the HSHS algorithm, eachproblem that is minimized is unconstrained and of low dimension.Notice that the algorithm causes τ and γ to approach 0, sothe constraints of the subproblems it solves, given as in (12), tendto those of (8). In addition, the algorithm causes ε to approach0, so, in a simultaneous movement, the problem (8) gradually approachesproblem (4).4. COMPUTATIONAL RESULTSThe computational results presented below were obtained froma preliminary implementation. The numerical experiments havebeen carried out on a PC Intel Celeron with a 2.7GHz CPU and512MB RAM. The programs were co<strong>de</strong>d with Compac VisualFORTRAN, Version 6.1. The unconstrained minimization taskswere carried out by means of a Quasi-Newton algorithm, employingthe BFGS up<strong>da</strong>ting formula from the Harwell Library. Theinitial starting hubs xi 0 , i = 1,··· ,q are taken around the center ofgravity of the set of points, by making random perturbations proportionalto the stan<strong>da</strong>rd <strong>de</strong>viation of this set. The value of τ 1 wastaken as 1/100 of this stan<strong>da</strong>rd <strong>de</strong>viation. The following choiceswere ma<strong>de</strong> for the other parameters: ε 1 = 4τ 1 , γ 1 = τ 1 /100,ρ 1 = 1/4, ρ 2 = 1/4 and ρ 3 = 1/4.In or<strong>de</strong>r to show the performance of the proposed algorithm, resultsobtained by using the German Towns instane, which uses thetwo Cartesian coordinates of 59 towns, originally presented by [4].The instance reported here presents a symmetric <strong>de</strong>mand matrix,with a required flow of one unit between all pairs of origin and<strong>de</strong>stination cities: d jl = d l j = 1, j, l = 1,··· ,m. So, problem (14)assumed the following formulation, further simplified:minimize f (x) =m−1 m∑ ∑ z jl (x). (16)j=1 l= j+1Table 1 presents the obtained computational results. Ten differentrandomly chosen starting points were used for each instance. Thediscount parameter has been fixed in α = 0,5 The first columnpresents the specified number of hubs (q). The second columnpresents the best objective function value ( f HSHS ) produced bythe HSHS algorithm. The next three columns present the numberof occurrences of the best solution (Occ.), the average percentageerror of the 10 solutions (E Mean ) in relation to the best solutionobtained ( f HSHS ) and the CPU mean time given in seconds(T Mean ). By <strong>de</strong>fining f r as the value of the objective functionobtained at the starting point r, the percentage error is calculatedby the expression:10E Mean = 10 ∑ ( f r − f HSHS )/ f HSHS . (17)r=1q f HSHS Occur. E Mean T Mean2 0.171285E6 10 0.00 0.533 0.154629E6 6 0.01 2.484 0.139158E6 9 0.75 8.475 0.131453E6 4 2.03 18.236 0.126496E6 1 0.68 39.307 0.122636E6 2 0.73 76.288 0.119239E6 1 0.81 149.529 0.116583E6 1 1.25 246.9910 0.113962E6 1 1.39 383.56Table 1: Results for the German Towns Instance ( α = 0.5)ALIO-EURO <strong>2011</strong> – 204


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>5. CONCLUSIONSThis paper shows how a preliminary implementation of the propose<strong>da</strong>lgorithm is able to efficiently produce reliable and <strong>de</strong>eplocal minima. The motivation to solve large scale versions ofcontinuous hub-and-spoke problems stems from, among other realworld applications, aerial transportation and oil and gas off-shoreexploration. We believe that this article’s methodology is a<strong>de</strong>quateenough for the requirements of such relevant applications.6. REFERENCES[1] M.E. O’Kelly and H.L. Miller, “The hub network <strong>de</strong>sign problem:A review and synthesis,” Journal of Transport Geography,vol. 2, pp. 31–40., 1994.[2] J. Klincewicz, “Hub location in backbone/tributary network<strong>de</strong>sign: A review,” Location Science, vol. 6, pp. 337–335,2001.[3] H. Luna, Network Planning Problems in Telecommunications.M.G.C. Rezen<strong>de</strong> and P.M. Par<strong>da</strong>los (editors), Springer, NewYork, 2006, ch. Handbook of Optimization in Telecommunications,pp. 213–240.[4] H. Späth, Cluster Analysis Algorithms for Data Reduction andClassification. Ellis Horwood, Upper Saddle River, NJ.,1980.ALIO-EURO <strong>2011</strong> – 205


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A hybrid method to solve a multi-product, multi-<strong>de</strong>pot vehicle routing problemarising in a recyclable waste collection systemTania Rodrigues Pereira Ramos ∗ ‡ Maria Isabel Gomes † Ana Paula Barbosa-Povoa ‡∗ Instituto Universitario <strong>de</strong> Lisboa (ISCTE-IUL)Aveni<strong>da</strong> <strong>da</strong>s Forcas Arma<strong>da</strong>s, Edificio ISCTE, 1649-026 Lisboa, Portugaltania.ramos@iscte.pt† CMA - FCT, Universi<strong>da</strong><strong>de</strong> Nova LisboaCampus <strong>da</strong> Caparica, 2829-516 Caparica, Portugalmirg@fct.unl.pt‡ CEG-IST, Universi<strong>da</strong><strong>de</strong> Tecnica <strong>de</strong> LisboaAveni<strong>da</strong> Rovisco Pais, 1049-001 Lisboa, Portugalapovoa@ist.utl.ptABSTRACTThe present work aims to support tactical and operational <strong>de</strong>cisionsin recyclable waste collection systems, focusing on the <strong>de</strong>limitationof service areas in systems with more than one <strong>de</strong>pot,and on vehicle routes <strong>de</strong>finition. The problem is mo<strong>de</strong>lled as amulti-product, multi-<strong>de</strong>pot vehicle routing problem. Due to problemsolution complexity, a hybrid method based on two mathematicalformulations and one heuristic procedure is <strong>de</strong>veloped as asolution method. The method proposed is applied to a large scaleproblem based on a real case study of a recyclable waste collectionsystem, where three types of recyclable materials have to becollected.Keywords: Multi-<strong>de</strong>pot, Vehicle routing, Hybrid method, Recyclablewaste collection system1. INTRODUCTIONThe present work aims to support tactical and operational <strong>de</strong>cisionsin recyclable waste collection systems with more than one<strong>de</strong>pot, helping the <strong>de</strong>cision making on the system <strong>de</strong>limitation ofservice areas and on the vehicle routes <strong>de</strong>finition. When characterizingthe recyclable waste collection system in study it can besaid that this is responsible to collect, within a certain geographicarea and in a regular basis, three types of recyclable materials usedin packaging (paper, glass and plastic/metal) dropped by the finalconsumer into special containers. When these systems have morethan one <strong>de</strong>pot, in addition to the <strong>de</strong>finition of the vehicle routes,it is also necessary to <strong>de</strong>ci<strong>de</strong> from which <strong>de</strong>pot the collection isto be performed. This problem is mo<strong>de</strong>lled as a multi-product,multi-<strong>de</strong>pot vehicle routing problem. A hybrid method is <strong>de</strong>velopedwhere a MIP solver is embed<strong>de</strong>d in a heuristic framework.The hybrid method is applied to a large scale problem based on areal recyclable waste collection system.2. LITERATURE REVIEWMDVRP consists on <strong>de</strong>fining a set of vehicle routes in such a waythat: (1) each route starts and ends at the same <strong>de</strong>pot, (2) each customeris visited exactly once by a vehicle, (3) the total <strong>de</strong>mand ofeach route does not exceed the vehicle capacity, (4) the total durationof each route (including travel and service times) does notexceed a preset limit and (5) the total routing cost is minimized.For the MDVRP, there are several mo<strong>de</strong>ls <strong>de</strong>veloped (exact and approximateapproaches). Due to its NP-hard combinatorial factor,the mo<strong>de</strong>ls presented in the literature are mostly heuristics-base<strong>da</strong>nd few exact algorithms cab be found in the literature. Laporteet al. [1], as well as Laporte et al. [2], <strong>de</strong>veloped exact branchand bound algorithms for solving the symmetric and asymmetricversion of the MDVRP, respectively. Recently, Bal<strong>da</strong>cci andMingozzi [3] <strong>de</strong>veloped an exact method for solving the HeterogeneousVehicle Routing Problem (HVRP) that is also capable tosolve, among other problems, the MDVRP. This algorithm is basedon the set partitioning formulation, where a procedure is applied togenerate routes. Three bounding procedures are used to reduce thenumber of formulation variables. As for the approximate methodsthere are several heuristic algorithms <strong>de</strong>veloped for the MD-VRP (Tillman and Cain [4], Gol<strong>de</strong>n, Magnanti and Nguyen [5],Renaud et al.[6], Salhi and Sari [7], Lim and Wang [8], Crevieret al. [9], among others). Based on the literature review, we canconclu<strong>de</strong> that few exact mo<strong>de</strong>ls for the multi-<strong>de</strong>pot problems exist,while several heuristic procedures have been <strong>de</strong>veloped for thesame problem. The combination of these two methods is also notwell explored. Therefore, this work studies this opportunity andproposes a hybrid method which combines an exact formulationwith heuristic procedures to solve the multi-product, multi-<strong>de</strong>potvehicle routing problem.3. HYBRID METHODIn Figure 1 a schematic diagram of the hybrid method proposed isshown. This involves three main steps.The first step involves the relaxation of the Multi-Product, Multi-Depot VRP (with more than one product and vehicle routes restrictedto start and finish at the same <strong>de</strong>pot) into the Single-Product,Multi-Depot VRP with Multi-Depot Routes (with just one productand multi-<strong>de</strong>pot routes allowed). By solving this mo<strong>de</strong>l, we obtainsome collection sites that belongs to feasible routes for theSingle-Product, Multi-Depot VRP, meaning that they belong to aroute that starts and finishes at the same <strong>de</strong>pot; and some othercollection sites whose route starts and finishes at different <strong>de</strong>pots.For the "feasible" collection sites, we fix their assignment to theALIO-EURO <strong>2011</strong> – 206


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>INPUTSingle-Product, MDVRPwith Multi-Depot Routes allowed• Distance Matrix(Without Duration Constraints, Collection Frequencies, Planning Horizon)• Weight to collect at eachcollection site (of onerecyclable material )• Vehicle CapacityOUTPUT• Collection Sites assigned to<strong>de</strong>pots (and its correspondingvehicle routes)• Collection Sites not assigned(that belonging to a multi-<strong>de</strong>potroutes)INPUT• Collection Sites assigned• Collection Sites not assigned• Distance MatrixHeuristic Procedureto complete service areasFor each <strong>de</strong>pot an<strong>de</strong>ach recyclablematerialOUTPUT• Service Areas Complete for each<strong>de</strong>potINPUT• Collection Sites assigned• Distance Matrix• Wheight to collect• Number of containers• Time required to collect each collection site• Road velocity• Collection frequency• Vehicle capacity• Maximum time allowed for a route• Vehicle unload duration• Number of hours available in the planning horizonVehicle Routing Problem(With Duration Constraints, Collection Frequencies, Planning Horizon)OUTPUT• Vehicle RoutesFigure 1: Structure of the proposed hybrid method.<strong>de</strong>pot (not to a particular route or vehicle, the assignment is donejust to the <strong>de</strong>pot) and then the a heuristic procedure at step two theprocedure is run. This assigns the remaining collection sites andtherefore completes the service areas by <strong>de</strong>pot.After the service areas being <strong>de</strong>fined, an exact formulation is run tosolve the Vehicle Routing Problem for each <strong>de</strong>pot and for each recyclablematerial (third step). The relaxed constraints ,on the firststep, are here consi<strong>de</strong>red: namely, the duration, the recyclable materialscollection frequencies and the planning horizon constraints.In the hybrid method, the service areas are established by the resultsobtained for one single recyclable material, at the first module.This module is also run for the other two recyclable materialsto assess which one produces the best solution regarding the minimumtotal distance travelled. In or<strong>de</strong>r to provi<strong>de</strong> further insightsinto the above steps, these will next be <strong>de</strong>scribed with greater <strong>de</strong>taile<strong>da</strong>nd with supporting references.1. Single-Product, MDVRP with Multi-Depot RoutesIn Multi-Depot VRP, vehicles are restricted to start and finishat the same <strong>de</strong>pot. This can be relaxed and multi-<strong>de</strong>potroutes are allowed, while minimizing the total distance travelled.The Multi-Depot Vehicle Routing Problem with Multi-Depot Routes has not received much attention fromresearchers. A similar problem is presented by Crevier etal. [9], entitled Multi-Depot Vehicle Routing Problem withInter-Depot Routes, where inter-<strong>de</strong>pot routes, that connecttwo different <strong>de</strong>pots, are allowed. In this case, <strong>de</strong>pots canact as intermediate replenishment facilities along the routeof a vehicle, but the rotation of a vehicle always starts an<strong>de</strong>nds at the same <strong>de</strong>pot (the authors called "rotation" to theset of all routes assigned to a vehicle). In the Multi-DepotVehicle Routing Problem with Multi-Depot Routes, the rotationconcept doesnŽt exist since the vehicles donŽt haveto return to their starting <strong>de</strong>pot. The vehicle routes can thenbe Hamiltonian cycles or just paths between two <strong>de</strong>pots.Our proposed formulation for the Multi-Depot VRP withMulti-Depot Routes is based on the two-commodity flowformulation for the CVRP, introduced by Bal<strong>da</strong>cci et al.[10]. This formulation consi<strong>de</strong>rs one real <strong>de</strong>pot and onecopy <strong>de</strong>pot, and all vehicle fleet has to be used. In the proposedformulation instead of one real and one copy <strong>de</strong>pot,we have a set of real <strong>de</strong>pots and a set of copy <strong>de</strong>pots, andwe do not impose that all vehicles are to be used.Since this formulation intends to be a simplification of themaster problem, multi-<strong>de</strong>pot routes are allowed and timeconstraints are not taken into account. We, therefore, relaxthe maximum time allowed for a route and the maximumtime available over the timeframe. As input <strong>da</strong>ta, thismodule requires the distance between each no<strong>de</strong> (collectionsites and <strong>de</strong>pots), the weight to be collected at each collectionsite (consi<strong>de</strong>ring only one recyclable material) and thevehicle fleet capacity. The output will be a set of collectionroutes, where some routes start and end at the same<strong>de</strong>pot (feasible routes) and some start and end at different<strong>de</strong>pots (infeasible routes). This module is run for each recyclablematerial, in or<strong>de</strong>r to find the best solution regardingthe minimum total distance travelled.2. Heuristic Procedure to Complete Service AreasAs mentioned above, the first module generates uncompletedservice areas by <strong>de</strong>pot. The aim of this second moduleis to complete those service areas by assigning to <strong>de</strong>potsthe collection sites that, in the previous module, were associatedto infeasible routes. The assignment is done througha greedy heuristic rule where the collection site is assignedto the nearest service area.3. Vehicle Routing ProblemAfter phase two, service areas by <strong>de</strong>pot are already <strong>de</strong>fined.It is now necessary to solve a vehicle routing problem foreach <strong>de</strong>pot to accomplish the multi-<strong>de</strong>pot vehicle routingproblem. The mathematical formulation used to solve theVRP is based on the two-commodity flow formulation [10],taking into account the collection frequencies of the recyclablematerials, the route duration limit and the number ofhours available in the planning horizon.ALIO-EURO <strong>2011</strong> – 207


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>4. APPLICATION TO A LARGE SCALE PROBLEMThe procedure <strong>de</strong>veloped is applied to a large scale problem, extractedfrom a real case study. This instance problem has 100 collectionsites and 3 <strong>de</strong>pots. We consi<strong>de</strong>red that each <strong>de</strong>pot has onevehicle that could make several trips over the timeframe. The timeframehas four weeks and the collection frequencies consi<strong>de</strong>red foreach recyclable material are once a month for glass, twice a monthfor plastic and once a week for paper. We do not set a limit tothe number of trips that a vehicle can do over the timeframe, butthe number of hours that a vehicle can work over the timeframe islimited to 160 hours (4 weeks × 5 <strong>da</strong>ys per week × 8 hours per<strong>da</strong>y).The first and third modules of the hybrid method are solved usingthe branch-and-bound method implemented in the solver ofthe CPLEX Optimizer 12.1.0. The branch-and-bound computationtime is arbitrarily limited to 2 hours. The second module, with theheuristic procedure, was written in MATLAB. An Intel(R) Core(TM) i7 CPU 930 @ 2.80 GHz is used.The mo<strong>de</strong>l will produce three solutions regarding service areas by<strong>de</strong>pot: the first one consi<strong>de</strong>rs glass results from the first and secondmodule; the second one consi<strong>de</strong>rs paper results and the third oneconsi<strong>de</strong>rs plastic/metal results. The solution that minimizes thetotal distance travelled is the one <strong>de</strong>veloped for the plastic/metalmaterial (Figure 2).5. CONCLUSIONSThis work studies the multi-product, multi-<strong>de</strong>pot vehicle routingproblem and proposes a hybrid method based on two mathematicalformulations and one heuristic procedure. This is justifiedby the high complexity associated to this kind of problems. Themethod proposed is applied to a large scale problem based on a realcase study where a recyclable waste collection system that collectsthree types of recyclable materials is consi<strong>de</strong>red. The existence ofmultiple <strong>de</strong>pots in such problems requires a service area <strong>de</strong>finitionby <strong>de</strong>pot. Each <strong>de</strong>pot is responsible to collect a set of collectionsites (which have the three recyclable materials to be collected) andto <strong>de</strong>fine the collection routes by recyclable material. To accomplishthe service areas by <strong>de</strong>pot, the hybrid method produces threesolutions consi<strong>de</strong>ring in<strong>de</strong>pen<strong>de</strong>ntly paper, glass and plastic/metalresults. Service areas based on plastic/metal results revealed toproduce the minimum total distance to be travelled.As future work, the heuristic procedure will be improved and thehybrid method will be applied to the real case study with 212 collectionsites and 5 <strong>de</strong>pots.6. REFERENCES[1] G. Laporte, Y. Nobert, and D. Arpin, “Optimal solutions tocapacitated multi-<strong>de</strong>pot vehicle routing problems,” CongressusNumerantium, vol. 44, pp. 283–292, 1984.[2] G. Laporte, Y. Nobert, and S. Taillefer, “Solving a family ofmulti-<strong>de</strong>pot vehicle-routing and location-routing problems,”Transportation Science, vol. 22, no. 3, pp. 161–172, 1988.[3] R. Bal<strong>da</strong>cci and A. Mingozzi, “A unified exact method forsolving different classes of vehicle routing problems,” MathematicalProgramming, vol. 120, no. 2, pp. 347–380, 2009.[4] F. Tillman and T. Cain, “Upperbound algorithem for singleand multiple terminal <strong>de</strong>livery problem,” Management ScienceSeries a-Theory, vol. 18, no. 11, pp. 664–682, 1972.[5] B. Gol<strong>de</strong>n, T. Magnanti, and H. Nguyen, “Implementing vehiclerouting algorithms,” Networks, vol. 7, no. 2, pp. 113–148, 1977.[6] J. Renaud, G. Laporte, and F. Boctor, “A tabu search heuristicfor the multi-<strong>de</strong>pot vehicle routing problem,” Computers andOperations Research, vol. 23, no. 3, pp. 229–235, 1996.[7] S. Salhi and M. Sari, “A multi-level composite heuristic forthe multi-<strong>de</strong>pot vehicle fleet mix problem,” European JournalOperational Research, vol. 103, no. 1, pp. 397–402,1997.[8] A. Lim and F. Wang, “Multi-<strong>de</strong>pot vehicle routing problem:A one-stage approach,” IEEE Trans Autom Sci Eng., vol. 2,no. 4, pp. 397–402, 2005.[9] B. Crevier, J. Cor<strong>de</strong>au, and G. Laporte, “The multi-<strong>de</strong>potvehicle routing problem with inter-<strong>de</strong>pot routes,” EuropeanJournal Operational Research, vol. 176, no. 2, pp. 756–773,2007.[10] R. Bal<strong>da</strong>cci, E. Hadjiconstantinou, and A. Mingozzi, “Anexact algorithm for the capacitated vehicle routing problembased on a two-commodity network flow formulation,” OperationsResearch, vol. 52, no. 5, pp. 723–738, 2004.ALIO-EURO <strong>2011</strong> – 208


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 2: Results obtained for each module of the hybrid method, consi<strong>de</strong>ring plastic/metal results to <strong>de</strong>fine service areas.ALIO-EURO <strong>2011</strong> – 209


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Design and Planning of Supply Chains with Integrated Forward and ReverseDecisionsSónia R. Cardoso ∗ Ana Paula F. D. Barbosa-Póvoa ∗ Susana Relvas ∗∗ CEG-IST, UTLAv. Rovisco Pais, 1049-001 Lisboa{sonia.cardoso, apovoa, susanaicr}@ist.utl.ptABSTRACTMarkets increasing competition, coupled with a growing concernwith the environment has created a need to increase supply chains’sustainability. To achieve this, the supply chain should integratereverse logistics activities. In this paper, a mixed integer linearprogramming formulation is <strong>de</strong>veloped for the <strong>de</strong>sign and planningof supply chains while consi<strong>de</strong>ring simultaneously productionand reverse logistics activities with the goal of maximizingthe net present value. The mo<strong>de</strong>l is applied to a case study whereforward and reverse activities are consi<strong>de</strong>red. A sensitivity analysisis performed in or<strong>de</strong>r to assess the resulting changes on theoptimal solution.Keywords: Reverse Logistics, Optimisation, Design, Planning1. INTRODUCTIONMarkets increasing competition, coupled with a growing concernwith the environment, has created a new way of thinking when<strong>de</strong>signing and planning supply chains. A need to increase supplychains’ sustainability is emerging. To achive this, companiesmust invest in the <strong>de</strong>sign and operation of these systems in or<strong>de</strong>rto reduce their ecological footprint [1]. Therefore, the supplychain should be now seen as a close loop system [2] where reverselogistics activities are inclu<strong>de</strong>d encompassing the transportationand reprocessing of collected products. Investing in reverse logisticsallows the achievement of cost savings in the procurement,disposal and transportation [3]. However, the establishment of areverse network that is in<strong>de</strong>pen<strong>de</strong>nt of the forward can increaseinfrastructure costs and can reduce the potential profit associatedwith remanufacturing [4]. So, for a better network <strong>de</strong>sign and planningit is necessary to consi<strong>de</strong>r simultaneously the forward and reverseflows. Several studies were published in this area, such as[5] who analyzed reverse logistics concluding that the research onthis subject focuses only on separate aspects and there is no holisticanalysis of the supply chain. As stated by [6] very few mo<strong>de</strong>lscombine, within a single formulation both forward and reverseflows and much less works consi<strong>de</strong>r the integration of the reversechain as i<strong>de</strong>ntified by [7] and [8]. Corroborating such conclusions,[9] mention that the number of published works where both forwar<strong>da</strong>nd reverse flows are taken into account simultaneously isless than the ones that treat them separately. These authors presenta generic mo<strong>de</strong>l for the <strong>de</strong>sign and planning of supply chains andstate that there are several research opportunities in this area. Thus,it is possible to conclu<strong>de</strong> about the importance of the <strong>de</strong>velopmentof a mo<strong>de</strong>l that <strong>de</strong>als with both flows at the same time in a realisticway.Figure 1: Network representation.2. METHODOLOGYIn this paper, it is proposed an optimization mo<strong>de</strong>l for the simultaneous<strong>de</strong>sign and planning of supply chains with forward andreverse flows, given a certain time horizon. The mo<strong>de</strong>l representationuses Mixed-Integer Linear Programming (MILP) formulationand was <strong>de</strong>veloped based on the work by [10]. These authors studiedthe <strong>de</strong>sign and planning of supply chains, taking into accounteconomical and environmental aspects. However, they only consi<strong>de</strong>ra three echelon supply chain and did not incorporate reverselogistics. Such mo<strong>de</strong>l is generalized in the current work.The problem addressed in this work has the objective to <strong>de</strong>terminethe configuration of the network along with planning <strong>de</strong>cisions thatmaximize the net present value. The forward supply chain in studyis formed by four echelons: plants with a set of available processesto be installed; warehouses where final products are assembled andstored; and retailers which are responsible to <strong>de</strong>liver the final productsto markets, final level of the forward supply chain. It is alsoconsi<strong>de</strong>red that factories can exchange all type of products amogthem, including raw materials or intermediate products. In the reverseflow, products are sent from clients to retailers and then aresorted. Products that are too <strong>da</strong>maged are sent to disposal whileproducts in their end of life are sent to factories to be disassembledto be reprocessed and non-conforming products are sent towarehouses to be repacked. In figure 1, it is possible to see a representationof the consi<strong>de</strong>red network.The mo<strong>de</strong>l <strong>de</strong>termines the number, location and capacity of theprocesses that have to be installed in each plant, warehouses andretailers in or<strong>de</strong>r to maximize the net present value of the supplychain. It also allows to <strong>de</strong>fine the best production rates at theplants, forward and reverse flows between all no<strong>de</strong>s of the network,establishment of transportation links between all entities, inventorylevels at warehouses and retailers. At the same time, it hasto respect some constraints, namely mass balances at each no<strong>de</strong>of the network, the material flows between each pair of entitieshave to be within the allowed boun<strong>da</strong>ries, the capacity of each infrastructurecannot be excee<strong>de</strong>d and the products’ bill of materialsmust be complied.The mo<strong>de</strong>l is applied to one example to show its applicability. Thisexample is run for two different cases. The first case inclu<strong>de</strong>s onlyALIO-EURO <strong>2011</strong> – 210


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>the forward flow while the second case analyses a supply chainwhere reverse logistics activities are inclu<strong>de</strong>d. For these examplesthe number of variables varies from 18119 for the first caseto 163463 for the second. Although, and as expected, it was observe<strong>da</strong>n increase in the problem complexity when introducingthe reverse flow. Nevertheless, this fact did not resulted into difficultiesat the solution level, the solution gap obtained was zero inboth cases and the computational time was less than two seconds.Since some assumptions on the example parameters were consi<strong>de</strong>red,a sensitivity analysis on the most critical parameters wasperformed. This allow us to assess the resulting changes on theoptimal solution, infrastructures capacities and on other planning<strong>de</strong>cisions.3. FINDINGSThe MILP is <strong>de</strong>veloped for the <strong>de</strong>sign and planning of supplychains while consi<strong>de</strong>ring simultaneously production and reverselogistics activities. The final results present <strong>de</strong>tails on the productionlevels, forward and reverse flow of products, inventory levels,establishment of transportation links and infrastructures capacities.The results obtained for the two situations analyzed are compare<strong>da</strong>nd it is possible to conclu<strong>de</strong> that the inclusion of the reverse flowswith the associated reverse logistics activities in the supply chainallows the achievement of a better net present value.Furthermore, in terms of the supply chain robustness the structure<strong>de</strong>signed appear quite robust since following a sensitivity analysisperformed on the two most critical parameters, the minimumpercentage of collection of end of life products and the percentageof <strong>de</strong>mand satisfaction, no significant changes in the networkstructure were observed.The MILP mo<strong>de</strong>l was implemented in GAMS language, version22.8, and solved using an IBM-ILOG’s CPLEX branch and boun<strong>da</strong>lgorithm, version 11.0, in an Intel(R) Core(TM) i7 CPU 2.80 GHzcomputer with 6 GB RAM.4. CONCLUSIONIn this work it is proposed an optimization mo<strong>de</strong>l for the <strong>de</strong>signand planning of a four echelon supply chain with forward and reverseflows allowing for the simultaneous incorporation of bothproduction and <strong>de</strong>livery of products as well as reverse logistics activities<strong>de</strong>aling with the recovery of non conforming products aswell as of products in its end of life time. The mo<strong>de</strong>l applicationshows that reverse logistics incorporation may result in an incrementof economical benefits associated with environmental concerns,fact that can be seen as a business opportunity. As futurework, it is inten<strong>de</strong>d to incorporate risk and environmental issuesin the mo<strong>de</strong>l with the goal of maximizing the NPV and simultaneouslyminimizing risk and environmental impacts. In addition, wealso aim to apply the <strong>de</strong>veloped mo<strong>de</strong>l to different types of supplychains so as to <strong>de</strong>monstrate its applicability to real case studies.5. REFERENCES[1] Barbosa-Póvoa, A., Salema, M. and Novais, A., Design andPlanning of Closed-Loop Supply Chains. In Supply ChainOptimization, Editores L. Papageorgiou and M. C. Georgiadis,Wiley-VCH, Germany, 7, 187-218, 2007.[2] Gui<strong>de</strong>, D. and Van Wassenhowe, L., “The Reverse SupplyChain”, Harvard Business Review, 80(2), 25-26, 2002.[3] Krikke, H., Bloemhof-Ruwaard, J. and Van Wassenhowe, L.,“Concurrent product and closed-loop supply chain <strong>de</strong>signwith an application to refrigerators”, International Journalof Production Research, 41, 3689-719, 2003.[4] Uster, H., Easwaran, G. and Çetinkaya, E., “Ben<strong>de</strong>rs <strong>de</strong>compositionwith alternative multiple cuts for a multi-productclosed-loop supply chain network <strong>de</strong>sign mo<strong>de</strong>l”, Naval ResearchLogistics, 54(8), 890-907, 2007.[5] Fleischmann, M., Bloemhof-Ruwaard, J., Dekker, R., Van<strong>de</strong>r Laan, Van Nunen, J. and Van Wassenhove, L., “Quantitativemo<strong>de</strong>ls for reverse logistics: A review”, EuropeanJournal of Operational Research, 103, 1-17, 1997.[6] Goetschalckx, M., Vi<strong>da</strong>l, C. and Dogan, K., “Mo<strong>de</strong>ling and<strong>de</strong>sign of global logistics systems: A review of integratedstrategic and tactical mo<strong>de</strong>ls and <strong>de</strong>sign algorithms”, EuropeanJournal of Operational Research, 143(1), 1-18, 2002b.[7] Papageorgiou, L., “Supply chain optimisation for the processindustries: Advances and opportunities”, Computers &Chemical Engineering, 33(12), 1931-1938, 2009.[8] Melo, M., Nickel, S. and Sal<strong>da</strong>nha <strong>da</strong> Gama, F., “FacilityLocation and Supply Chain Management - A review”, EuropeanJournal of Operation Research, 196, 401-412, 2009.[9] Salema, M., Barbosa-Póvoa, A. and Novais, A., “Simultaneous<strong>de</strong>sign and planning of supply chains with reverse flows:A generic mo<strong>de</strong>lling formulation”, European Journal of OperationalResearch, 203, 336-349, 2010.[10] Guillén-Gosálbez, G. and Grossmann, I., “A global optimizationstrategy for the environmentally conscious <strong>de</strong>sign ofchemical supply chains un<strong>de</strong>r uncertainty in the <strong>da</strong>mage assessmentmo<strong>de</strong>l”, Computers & Chemical Engineering, 34,42-58, 2010.ALIO-EURO <strong>2011</strong> – 211


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Reverse Logistics Network Design for Household Plastic WasteXiaoyun Bing ∗ Jacqueline Bloemhof Jack van <strong>de</strong>r Vorst∗ Wageningen University and Research Center, the NetherlandsLogistics, Decision and Information SciencesP.O. Box 8130, 6700 EW Wageningenxiaoyun.bing@wur.nlABSTRACTThis paper applies MILP methods to improve the network <strong>de</strong>signof reverse logistics for household plastic waste based on the case ofNetherlands. The purpose is to provi<strong>de</strong> <strong>de</strong>cision support for variousstakehol<strong>de</strong>rs in choosing the most suitable recycling collectionmethods with an optimized network <strong>de</strong>sign that both balancestheir interests and improves the recycling efficiency. Separationmethod <strong>de</strong>termines whether the quality and quantity of the plasticsmaterial is high enough to be economically efficient and environmentallyeffective. Currently, source separation (separation athouseholds) is dominating as suggested by legislation. However,since the overall collection rate is not satisfying, municipalities aretrying different ways to <strong>de</strong>al with plastic waste. There is a need toadopt the system according to the characteristics of the municipalities.This research follows the approach of scenario study. We startwith the simulation of the current situation followed by investigatingthe impacts of various changes in the collection system. Foreach scenario, we suggest improvement in the network by repositioningthe locations for separation, sorting and reprocessing sites.(post-separation), making a difference in infrastructure, collectionfrequency, vehicle types, etc. Decisions on the choice of the system<strong>de</strong>pend on issues like the type of municipality <strong>de</strong>scribed bypopulation <strong>de</strong>nsity, geographical location, househol<strong>de</strong>rs’ behavioras well as the availability of resources. The separation method,together with the corresponding collection system and frequency,<strong>de</strong>termine whether the quality and quantity of the plastics materialis high enough to be economically efficient and environmentallyeffective.Keywords: Reverse logistics, Network <strong>de</strong>sign, Mixed integer linearprogramming, Plastic recycling1. INTRODUCTION1.1. PurposeThis paper applies operations research methods to improve the network<strong>de</strong>sign of reverse logistics for household plastic waste basedon the case of the Netherlands. The purpose is to provi<strong>de</strong> <strong>de</strong>cisionsupport for various stakehol<strong>de</strong>rs in choosing the most suitable recyclingcollection methods with an optimized network <strong>de</strong>sign thatboth balances their interests and improves the recycling efficiency.1.2. Problem and Research QuestionDue to a higher volume to weight ratio in comparison to otherrecyclables, plastics have a larger number of kilometers traveledper tonne, meaning more emissions and less efficiency in transportation(Craighill and Powell, 1996)[1]. That is why only a fewrecycling and collection facilities exist compared to other typesof recyclable packaging waste such as glass and paper (Waste online,2010). However, the rising oil price and the cost reductionby using recycled plastics instead of virgin polymer-based plasticslead to a high <strong>de</strong>mand for plastic recycling. As a result, there isa need to build an efficient network that improves the recyclingsystem. Figure 1 illustrates the current flow of plastics recycling.Plastics recycling network in the Netherlands is characterized byvarious collection, separation and treatment systems. The first stepof the processing system, separating plastics from other waste, canoccur at households (source separation) or in separation centersFigure 1: Flow chart of reverse network for plastics wasteA special characteristic of the Dutch network is that although theland area is not large, there are 441 different municipalities varyinga lot in population <strong>de</strong>nsity and housing types (Central Bureau ofStatistics in Netherlands, 2009). For many municipalities, there isa mixture of different household types (apartments, houses, farms)within the municipality which results in not only population <strong>de</strong>nsitydifference but also diversity in plastic waste components. Notall the processing work is done insi<strong>de</strong> the country, for instance,sorting facilities in Germany and reprocessing companies from allover the Europe are involved in the network as well. Currently,source separation is dominating. More than 90% of the municipalitiesare doing source separation as suggested by legislation. Sincethe overall collection rate is not satisfying, municipalities are tryingdifferent ways to <strong>de</strong>al with plastic waste. Households in urbanmunicipalities have limited space at home for doing source separation.Therefore there is a need to adopt the system according tothe characteristics of the municipalities.The research question in this paper is• What is the best reverse network <strong>de</strong>sign for plastic recyclingin Netherlands that is both efficient and sustainable?This inclu<strong>de</strong>s the <strong>de</strong>cision support for making the choice of sourceseparation or post-separation in or<strong>de</strong>r to balance the interests an<strong>da</strong>chieve the lowest overall transport cost from the point of collec-ALIO-EURO <strong>2011</strong> – 212


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>tion to the final processing facility.On contrary to normal distribution networks in which products assembleat the source or sometimes during the flow, plastic wastedisassembles along distribution from the sources to the end processors.Many plastic fractions are collected together at the sourcemixed with dirt and moisture and even other municipal solid waste,<strong>de</strong>pending on the collection method. Along the flow, separationand sorting are going on. In the end, different plastic fractions willbe distributed to separated processors. The useless part out of eachstep of separation/sorting will be disposed through other channels,therefore quantity of plastics also reduces during distribution. Additionally,PET bottles are a special category of plastic waste. Ithas a special channel of recycling in the network other than thenormal plastic waste. The network <strong>de</strong>sign should fit these specialfeatures.1.3. Literature ReviewThe origin of research on reverse logistics can be <strong>da</strong>ted back to1995. Ever after that, there has been growing interest in researchof this filed. Compared with forward logistics, Fleischmann et.al(1997) [2] i<strong>de</strong>ntified a specialty of reverse distribution network,that is, it is not necessarily a symmetric picture of forward distribution.Most of them has a "many to few" network structure.In the case of plastic recycling network, the many municipalitiesas suppliers and the few sorting plants and reprocessors as customersform such a "many to few" structure. Rubio et.al(2006)[3]reviewed the characteristics of the research on reverse logisticsduring the period of 1995-2005 and pointed out that the majorityof research focuses on the study of tactical and operational aspectslike production planning and inventory management. Research onreverse logistics could be directed to the analysis on strategic aspects.Another trend in this research field is that environmentalissues are becoming an important parameter in logistics network<strong>de</strong>sign. Srivastava (2007)[4] reviewed green supply chain managementtaking a reverse logistics angle. The new concept of greensupply chain leads to a shift from minimizing cost to a balance betweencost and environmental impact. In line with these researchdirections, this paper <strong>de</strong>als with the interaction between availabletechnology(separation and sorting) and possible collection methods.Through network planning and scenario study, the purpose isto provi<strong>de</strong> <strong>de</strong>cision support for stakehol<strong>de</strong>rs in choosing the mostsuitable and sustainable recycling strategy.1.4. MethodologyMixed integer linear programming (MILP) mo<strong>de</strong>ls are used in thisnetwork <strong>de</strong>sign. To achieve the objective, the research follows theapproach of scenario study by forming a list of scenarios first,then comparing the network mo<strong>de</strong>ling result of these scenarios.The mo<strong>de</strong>ling is conducted by using a graphical optimization toolIBM Logic Net Plus. Unlike the usual forward supply chain networkmo<strong>de</strong>l, we have all the plastic fractions together with dirt andmoisture as various "products" in the mo<strong>de</strong>l. Municipalities are thesupplier of these "products". The distinctive "many to few" structureis built in and the special feature of product disassembling aswaste disposal during the flow is simulated. The objective of theMILP mo<strong>de</strong>l is to minimize the overall transportation cost of thefour levels of Figure 1. In each scenario, different network layout,assumptions on the choice of collection channels and the characteristicsof municipalities <strong>de</strong>fine the quantity of the products, theirflow and the availability of facilities in the network which are constraintsfor the mo<strong>de</strong>l. In this scenario study, we start with thesimulation of the current situation based on source separation withseparate PET bottle collection. Then we investigate the impacts of• adopting PET collection system from other countries;• choosing a collection method according to the population<strong>de</strong>nsity of the municipality.Mo<strong>de</strong>ling results are compared and discussed to answer the abovementioned research question. For each scenario, we suggest improvementsin the network by repositioning the locations for sorting,separation and reprocessing sites.1.5. Data and Data SourcesMain <strong>da</strong>ta used for building up the mo<strong>de</strong>ls and the <strong>da</strong>ta sources areas follows:• Municipalities (population, quantity of plastic waste, location)Statistics can be collected through the annual reports ofCBS (Central Bureau of Statistics in Netherlands)• Processing facilities (function, location, capacity)Nedvang (Dutch packaging waste recycling association),has relevant information on the processing facilities in Netherlands.Another project partner Aachen University has moreexpertise on the system of Germany facilities.• Plastic waste (components, quality, separation technology)Data is provi<strong>de</strong>d by one of the research anchors of the KenniscentrumNascheiding (KCN), an Expertise Center locate<strong>da</strong>t Wageningen University that investigates the technologicaland economical feasibility, as well as the environmentalimpact, of new technologies for the treatment of plastics(packaging waste) found in household waste.2. FINDINGSThe mo<strong>de</strong>ling results are expected to give the following findings• The logistics bottleneck of applying source separation/postseparationin a country• The impact of choosing separation systems that fit the characteristicsof municipalities• For a chosen combination of source separation and postseparation,the location allocation of the separation and processingfacilities• The influence on transportation efficiency of separately collectingthe PET bottles.Current preliminary results show that in general, having a separatechannel for PET bottles collection reduces the over all transportationcost. Among all the tested scenarios, doing source separationin rural ones and post-separation in urban areas while keeping theseparate PET bottle refund system performs best in saving transportationcost.3. RELEVANCE / CONTRIBUTIONThis paper focuses on a niche scope of reverse logistics by applyingoperations research mo<strong>de</strong>ling techniques in reverse logistics onplastic recycling problem in specific with multiple objectives, multiplelayers and multiple stakehol<strong>de</strong>rs. Traditional network <strong>de</strong>signmo<strong>de</strong>ls are modified and exten<strong>de</strong>d to fit in the specific networkfeatures and to solve the applied problem.• shifting to 100% post-separation;ALIO-EURO <strong>2011</strong> – 213


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>4. ACKNOWLEDGEMENTSThis research is supported by Nedvang (Dutch packaging waste recyclingassociation) in cooperation with KCN (an Expertise centerin Wageningen University of plastic recycling). We would like tothank them for all the support they give and also the great help incollecting <strong>da</strong>ta for the research.Many thanks to the reviewers!5. REFERENCES[1] A. Craighill and J. Powell, “Lifecycle assessment and economicevaluation of recycling: A case study,” RESOURCESCONSERVATION AND RECYCLING, vol. 17, no. 2, pp. 75–96, AUG 1996.[2] M. Fleischmann, J. Bloemhof-Ruwaard, R. Dekker,E. van <strong>de</strong>r Laan, J. van Nunen, and L. VanWassenhove,“Quantitative mo<strong>de</strong>ls for reverse logistics: A review,” EURO-PEAN JOURNAL OF OPERATIONAL RESEARCH, vol. 103,no. 1, pp. 1–17, NOV 16 1997.[3] S. Rubio, A. Chamorro, and F. J. Miran<strong>da</strong>, “Characteristicsof the research on reverse logistics (1995-2005),” INTERNA-TIONAL JOURNAL OF PRODUCTION RESEARCH, vol. 46,no. 4, pp. 1099–1120, 2008.[4] S. K. Srivastava, “Green supply-chain management: A stateof-the-artliterature review,” INTERNATIONAL JOURNAL OFMANAGEMENT REVIEWS, vol. 9, no. 1, pp. 53–80, MAR2007.ALIO-EURO <strong>2011</strong> – 214


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Reverse Cross DockingJuan Pablo Soto ∗ Rosa Colomé Perales † Marcus Thiell ∗∗ UniAan<strong>de</strong>s School of ManagementBogotá, Colombia{jps, mthiell}@adm.unian<strong>de</strong>s.edu.co† ESCI, Universitat Pompeu FabraPg. Puja<strong>de</strong>s 1, Barcelonaosa.colome@esci.esABSTRACTNowa<strong>da</strong>ys companies are facing an important challenge in theirdistribution, as frequent <strong>de</strong>liveries and small or<strong>de</strong>r sizes are thecommon rule to<strong>da</strong>y. For this type of distribution, cross-docking isa logistics activity that generates several advantages like reductionin lead times and manipulation costs. In addition, Reverse Logistics(RL) has achieved more importance in recent years within thebusiness world. In particular companies with fashion products areintroducing RL activities to recover and, in most cases, resale theproducts through the same or through different channels of distributionlike outlets, secon<strong>da</strong>ry markets, or internet, with the purposeto recapture value. Despite of the success of cross-dockingin distribution, the concept has not been applied for the reverseflow so far. In this paper we propose a linear programming mo<strong>de</strong>lthat allows the use of cross-docking in a Reverse Logistics context,where returned products can be redirected to the outlets chainwithout storage.Keywords: Reverse Logistics, Cross-docking1. INTRODUCTIONWith improvements in logistics operations, nowa<strong>da</strong>ys companiesare facing a change in the way they are doing their distribution.To<strong>da</strong>y it is very common to find frequent shipments with smallor<strong>de</strong>r sizes. In such situations, cross-docking is a logistical activitythat generates several advantages [1]in areas like inventorymanagement, or<strong>de</strong>r picking, and transportation [2]. Products arrivingto the cross dock are unloa<strong>de</strong>d from inbound trailers, possiblyreconsoli<strong>da</strong>ted with other products arriving from different <strong>de</strong>stinations,and loa<strong>de</strong>d into outbound trailers within less than 24 hours[3]. In practice, cross-docking is possible because the suppliersreceive high quality information and organize the or<strong>de</strong>rs by final<strong>de</strong>stination. At the moment goods arrive at the distribution center,it is then just necessary to translate them into a pre<strong>de</strong>fined positionfor the final client.Reverse Logistics (RL), the second theoretical concept we refer toin this paper, has received more importance in recent years aroundthe world. The implementation of environmental laws, an increasingcustomer awareness related to environmental issues, the reductionof the product life cycles and the creation of new businessmo<strong>de</strong>ls based on returned products, are some of the drivers forthe introduction of RL operations in supply chains [4]. As a consequence,many of the theories and practices <strong>de</strong>veloped in directlogistics have been adjusted properply to an environment that inclu<strong>de</strong>sthe returned products at the end of its life cycle [5]. Asreverse flows of products are characterzed by high uncertainty inquantity and quality, those flows are like a “black box” until theyarrive to the distribution center. Therefore the application of crossdocking in the reverse logistics context is not prevalent so far. Butin the special case of fashion retail companies, who have their ownnetwork of outlet stores, it seems to be possible to introduce crossdocking in the reverse flow since product assortments can be createdbased on the available products which were unsold during thesales period. Although an "i<strong>de</strong>al" product assortment is plannedfor each outlet store, this is not unchangeable and allows companiesto modify it with a certain <strong>de</strong>gree of flexibility.In this paper we propose a mo<strong>de</strong>l <strong>de</strong>signed for retail chains thathave their own outlet stores to commercialize the returned productsbeing called the “Reverse Cross-docking Mo<strong>de</strong>l”. The nextsection treats a literature review related to this topic, followed bythe explanation of the mo<strong>de</strong>l in section 3, the mathematical formulationin section 4, and a conclusion as well as a research outlookin section 5.2. LITERATURE REVIEWReverse Logistics (RL) has been <strong>de</strong>fined as the process of planning,implementing and monitoring the effective and efficient flowof raw materials, in-process inventory, finished goods and all relatedinformation from the point of consumption to the point oforigin with the purpose of recapturing value or arrange it properly[6]. In recent years, RL has gained more importance both inbusiness and research. This growth is particularly significant insectors with high rates of returns and high stan<strong>da</strong>rds of customerservice. Additionally, the growing concern about climate changeand environmental impact of business activities, joint with the legislationchanges towards a cleaner manufacturing environment, hasenhanced companies to introduce different practices of product recovery.1Cross-docking systems have been implemented since many yearsin the business world, being <strong>de</strong>fined as a warehousing strategy thatinvolves movement of material directly from the receiving dock tothe shipping dock with a minimum time in between [7]. In otherwords, it is a practice of moving goods through distribution centerswithout storing it, increasingly used by enterprises for comprehensivemanagement of its operations in the supply chain. Oneexample for its application is Wal-Mart, who introduces the crossdockingsystem early in the ’90s. A big part of the competitiveadvantage and growth of Wal-Mart in the U.S., was due to the useof this strategy in its operations [8].To the best of the authors knowledge no mo<strong>de</strong>l was presented to<strong>de</strong>al with the implementation of cross-docking in RL so far. Dueto the RL characteristics, mainly related to variety and uncertainty1 (see www.greenbiz.com)ALIO-EURO <strong>2011</strong> – 215


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>of returned products, it can be assumed that it is difficult to applycross-docking in the RL field. This paper contributes to the literatureof RL with the creation of the first mo<strong>de</strong>l <strong>de</strong>aling with theimplementation of cross-docking in a RL context.box-outlet pair. Afterwards, the optimization mo<strong>de</strong>l searches forthe Global Percentage of Acceptance (GPA) that a box needs toachieve in or<strong>de</strong>r to be sent to an outlet store. Figure (1) shows howthe system works.3. PROBLEM DESCRIPTIONDespite the general benefits <strong>de</strong>rived from cross-docking in directlogistics, this use in RL is difficult due to the characteristics of theReverse Supply Chain: outlet stores receive the products withouthaving or<strong>de</strong>red them explicitely as their product offering <strong>de</strong>pendson the return flow of the stores and the <strong>de</strong>stination of the goodsis not <strong>de</strong>fined in advance, because the quality and the quantity areunknown in most cases. Sometimes it is not even know when theproduct arrives; normally, there are no or<strong>de</strong>rs from the "clients"of the chain, i.e. secon<strong>da</strong>ry markets where the commodity is sold.Even in the case that a company has control on the secon<strong>da</strong>ry markets,it is difficult to achieve a fit between these or<strong>de</strong>rs and theproducts that are returned from the main channel of sale.In a situation where companies control both, the direct store chainand the outlet store chain, there is a way of benefiting from crossdockingpractices in the returns flow. Fashion companies fit wellwith the <strong>de</strong>scribed profile, since their product is affected by seasonal<strong>de</strong>mands (autumn-winter and spring-summer), and its merchandisemust rotate during each season. At the end of the seasonall merchandise that has not been sold is taken out of the shops andsent back to the distribution center. From this moment the productsare no longer managed by the main distribution channel as aresult of the inten<strong>de</strong>d variation of product assortments from seasonto season.Traditionally, a returned box from a store follows several processesin or<strong>de</strong>r to be prepared for the distribution to the outlet stores.Main processes are: opening the boxes, sorting the products, organizingthem by reference, size, color and other pre<strong>de</strong>fined features,storage them in the warehouse, and finally use them to fulfillthe or<strong>de</strong>rs generated by the outlets. While this system works relativelywell, it is quite costly, mainly due to the storage costs andtime required to have a product ready to be sent to the outlet stores.Usually, fashion companies create a <strong>de</strong>sirable product assortmentlist for the outlet stores combining a product offer that is supposedto satisfy the final customer. To create this product assortmentlist, companies consi<strong>de</strong>r past sales and customers profiles for eachoutlet.Cross docking can help to minimize operational costs, mainly ofstorage, retrieval and picking. In or<strong>de</strong>r to operate with cross-dockingin the reverse channel it is necessary to have the critical informationavailable (i.e. <strong>de</strong>sired product assortment from each outlet andproduct sent per box from the direct stores). With this informationat hand, it is possible to create a matching lists in which the companycan see if there is a box that coinci<strong>de</strong>s with the <strong>de</strong>sired productsof an outlet. As it is difficult to find a box that fits 100% withthe or<strong>de</strong>r of an outlet store, we assume that the company has someflexibility to change the <strong>de</strong>sirable products of a given outlet store.But this flexibility is limited as the outlet stores do not intend tohave an excess of unwanted products. In other words, this leads tothe question: How many unwanted products is the company ableto send to the outlet stores?If the company sends many un<strong>de</strong>sired products to the outlets, thoseproducts will be returned with a certain probability after passing aperiod of time in the outlet stores, causing corresponding costs. Ifthe company does not have a certain <strong>de</strong>gree of flexibility, just afew boxes can be sent through the reverse cross docking operationmaking it finally inefficient. To <strong>de</strong>al with this situation, we create<strong>da</strong>n optimization mo<strong>de</strong>l where an optimal percentage of matchingis established. This Matching Percentage is computed for everyFigure 1: Sistem OperationIf a given box has a Matching Percentage which is above the GPAthen the box is marked as candi<strong>da</strong>te to be assigned to the correspondingoutlet. The optimization mo<strong>de</strong>l also consi<strong>de</strong>rs the maximizationof the sum of all the matching percentages, assuring thatboxes will be sent to the outlets which fit better with its content. Ifthe GPA is too low, then the estimated costs of taking back productsfrom the outlets at the end of the season increase. If the GPAis too high, then only a few boxes are sent through reverse crossdocking,and traditional storage and picking must be performedwhich consequently increases the costs.The objective function of the mo<strong>de</strong>l minimizes the total costs andmaximizes the sum of all matching percentages (1). Constraintsof the mo<strong>de</strong>l are: computation of products from a box (C ai ) whichare in excess in comparison to the outlet or<strong>de</strong>r (O ai )or numberof products of a given reference which are not enough to fulfillthe outlet requirements (2); computation of the Matching Percentage(MP i j ) (3); Comparison between the Matching Percentage ofa given pair box-outlet and the Global Percentage of Acceptance(GPA) (4);a box i is potentially assigned to an outlet j (PA i j ) ifits matching percentage is greater than GPA (5); a box can be assignedto the traditional method (BT i ) only if it is not assignedfor cross docking to any of the outlets (6) ; as a percentage, GPAand MP i j must be less than or equal to 1 (7) and (8);computationof the cost of returned products at the end of the season. Thistakes into account an historical probability of returning a producta from the outlet j based on historical <strong>da</strong>ta (PR a j ). This is a parameterof the mo<strong>de</strong>l. The estimation of products returned is thefirst integer number greater than the value obtained ( RP ˆ a j ) (9); binary,non-negativity and integer constraints (10),(11) and(12). Themo<strong>de</strong>l was solved using the Lingo Software and results show thatthe costs of the total system can be reduced when the reverse crossdocking practices are implemented.AJI JMinz ∑ C ai ·BT i ·UPC a + ∑ RP ˆ a j ·RLC a +(M−∑∑ MP i j ·PA i j )a=1j=1i=1 j=1(1)A∑ (C ai − O ai ) ·Y ai = SP i j − LP i j ∀i, j (2)a=1( )SPi jMP i j = 1 −∑ A a=1 C ∀i, j (3)aiMP i j − GPA = CD i j −CT i j ∀i, j (4)ALIO-EURO <strong>2011</strong> – 216


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Where:a = {1..A} Set of articlesi = {1..I} Set of boxesPA i j ≥ MP i j − GPA∀i, j (5)JBT i ≤ ∑ PA i j ∀i (6)j=1GPA ≤ 1 (7)MP i j ≤ 1 (8)I∑(PA i j · LP i j ·C ai ) · PR a j ≤ RPˆa j ∀a, j (9)i=1BT i ,PA i j ∈ {0,1} (10)SP i j ,LP i j ,MP i j ,CD i j ,CT i j ,GPA ≥ 0 (11)RP ˆ i j ∈ {integer} (12)j = {1..J} Set of outlet storesM =a big number greater than the maximum possible value of thesum of all matching percentages joint togetherUPC a =Processing cost per unit a in the distribution center.MP i j = Matching Percentage: Is the percentage in which box i,fulfill the or<strong>de</strong>r from outlet j.SP i j = Number of products in box i which are sent in excess to theoutlet j.LP i j = Number of products in box i which are lacking to fulfill theor<strong>de</strong>r of outlet j.CD i j =Positive difference between the percentage of matching andthe GPA.CT i j =Negative difference between the percentage of matchingand the GPA.4. CONCLUSIONS AND FUTURE RESEARCHCross-docking is a common strategy that have shown its benefitsin reducing operational costs and time in business. In this paperwe presented a mo<strong>de</strong>l in which this practice can be used in a ReverseLogistics context. This mo<strong>de</strong>l was applied to an environmentwhere companies own their direct and outlet store channels. Prerequisitfor the mo<strong>de</strong>l application are two characteristics: availabilityof information about the <strong>de</strong>sirable assortment of productsfrom the outlet stores, and information about the products returnedfrom the direct stores at box level. The mo<strong>de</strong>l proposed establishedthe optimum global percentage of acceptance in or<strong>de</strong>r tomaximize the benefits for the company. The results obtained furthermoreshow that companies can obtain important reductions inoperational costs from the application of this reverse cross dockingmo<strong>de</strong>l.For future research, the consi<strong>de</strong>ration of a dynamic allocation ofinventory could provi<strong>de</strong> additional insights and probably lead toan improvement of the initial solution presented in this paper.5. REFERENCES[1] H. Yan and S.-L. Tang, “Pre-distribution and pos-distributioncross-docking operations,” Transportation Research Part E,vol. 45, pp. 843–859, May 2009.[2] R. Larbi, G. Alpan, P. Baptiste, and B. Penz, “Schedulingcross docking operations un<strong>de</strong>r full, partial and no informationon inbound arrivals,” Computers and Operations Research,vol. 38, pp. 889–900, <strong>2011</strong>.[3] G. Alpan, R. Larbi, and B. Penz, “A boun<strong>de</strong>d dynamic programmingapproach to schedule operations in a cross dockingplatform,” Computers and Industrial Engineering, vol. 60, pp.385–396, April <strong>2011</strong>.[4] A. Diaz, M. J. Alvarez, and P. Gonzalez, Logística Inversa yMedio Ambiente. Madrid, Spain: Mc Graw-Hill Interamericana<strong>de</strong> Espana S.A., 2004.[5] M. Fleischmann, J. M. Bloemhof-Ruwaard, R. Dekker,E. V. D. Laan, J. A. van Nunen, and L. N. V. Wassenhove,“Quantitative mo<strong>de</strong>ls for reverse logistics,” European Journalof operational research, vol. 103, pp. 1–17, June 1997.[6] D. Rogers and R. Tibben-Lembke, Going Backwards: ReverseLogistics Trends and Practices. Reverse Logistics ExecutiveCouncil, Eds., 1998.[7] U. M. Apte and S. Viswanathan, “Effective cross docking forimproving distribution efficiencies,” International Journal ofLogistics:Research and Applications, vol. 3, no. 3, pp. 291–302, 2000.[8] G. S. Jr., P. Evans, and L. E. Shulman, Delivering Results:A new man<strong>da</strong>te for human resource professionals. Boston,USA: HBSP, 1992, ch. Competing on Capabilities: The NewRules of Corporate Strategy, pp. 1–355.ALIO-EURO <strong>2011</strong> – 217


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Comparing Roster Patterns Within a Single Depot Vehicle-Crew-Roser ProblemMarta Mesquita ∗ Margari<strong>da</strong> Moz † Ana Paias ‡ Margari<strong>da</strong> Pato §∗ ISA-UTL, CIOTapa<strong>da</strong> <strong>da</strong> Aju<strong>da</strong>, 1349-017 Lisboa, Portugalmarta@math.isa.utl.pt† § ISEG-UTL, CIORua do Quelhas 6, 1200-781 Lisboa, Portugal† mmoz@iseg.utl.pt § mpato@iseg.utl.pt‡ DEIO-FCUL, CIOBloco C6, Piso 4, Ci<strong>da</strong><strong>de</strong> Universitaria, 1749-016, Lisboa, Portugalampaias@fc.ul.ptABSTRACTThe integrated vehicle-crew-roster problem aims to simultaneously<strong>de</strong>termine minimum cost vehicle and <strong>da</strong>ily crew schedules thatcover all timetabled trips and a minimum cost roster covering all<strong>da</strong>ily crew duties according to a pre-<strong>de</strong>fined <strong>da</strong>ys-off pattern. Thisproblem is solved by a heuristic approach based on Ben<strong>de</strong>rs <strong>de</strong>compositionthat iterates between the solution of an integrated vehicle-crewscheduling problem and the solution of a rostering problem.Computational experience with <strong>da</strong>ta from two bus companiesin Portugal is used to compare two rostering patterns withinvehicle-crew-roster solutions.Keywords: Rostering, vehicle-scheduling, crew-scheduling, Ben<strong>de</strong>rs<strong>de</strong>composition1. INTRODUCTIONThe integrated vehicle-crew-roster problem aims to simultaneouslyassign drivers of a company to vehicles and vehicles to a set of pre<strong>de</strong>finedtimetabled trips that cover passenger transport <strong>de</strong>mand in aspecific area, during a planning horizon. Due to the complexity ofthe corresponding combinatorial optimization problem, it is usuallytackled on a sequential basis beginning with vehicle scheduling,followed by crew scheduling and, lastly, driver rostering. Vehiclescheduling produces <strong>da</strong>ily schedules for the vehicles that performall trips. The crew scheduling <strong>de</strong>fines <strong>da</strong>ily crew duties thatcover the respective vehicle schedules. Finally, for the planninghorizon, crew duties are assigned to the drivers of the companyleading to a roster that must comply with labor and institutionalnorms. However, there is a high <strong>de</strong>pen<strong>de</strong>ncy among these threeproblems and <strong>de</strong>spite its computational bur<strong>de</strong>n, some work hasbeen reported on the integration of all or some of these problems,expecting to outperform the corresponding sequential approaches.Among other authors, [1], [2] and [3] have <strong>de</strong>veloped efficient algorithmsfor the integrated vehicle-crew problem. Crew-rosteringintegration has been <strong>de</strong>vised by [4], [5] and by [6] albeit withinother transport contexts (railway, airline and airport staff).Following the i<strong>de</strong>a that staff costs constitute more than 50% of operatingcosts, one wants to compare two different roster patternsin what concerns the resulting integrated vehicle-crew-roster solutions.A heuristic approach for an integrated vehicle-crew-rosterproblem with <strong>da</strong>ys-off pattern (VCRPat) is presented. The approachcombines column generation and branch-and-bound techniqueswithin a Ben<strong>de</strong>rs <strong>de</strong>composition and iterates between thesolution of an integrated vehicle-crew scheduling problem and thesolution of a rostering problem. A preliminary insight on this <strong>de</strong>compositionapproach was already presented by the authors in [7].Ben<strong>de</strong>rs <strong>de</strong>composition methods have been proposed by [8], [9],[10] and [11], although for airline operations problems. This paperis organized as follows: Section 2 introduces the VCRPat alongwith the two <strong>da</strong>ys-off patterns; Section 3 <strong>de</strong>scribes the mathematicalmo<strong>de</strong>l; Section 4 presents the <strong>de</strong>composition algorithm andSection 5 gives some conclusions from preliminary tests.2. PROBLEM DEFINITIONDuring a planning horizon H, partitioned into 49 <strong>da</strong>ys, a set M ofdrivers must be assigned to a fleet of vehicles housed at a <strong>de</strong>pot din or<strong>de</strong>r to perform a set of timetabled trips (trips for short). Thelocation and the number of vehicles available at the <strong>de</strong>pot as wellas the set of trips to be performed on each <strong>da</strong>y h are known. Foreach trip the starting and ending times and locations are given.Trips i and j are compatible if the same vehicle can perform bothtrips in sequence. Between compatible trips i and j a <strong>de</strong>adheadtrip may occur where the vehicle runs without passengers. Thereare three types of <strong>de</strong>adhead trips: those between the end locationof a trip and the start location of a compatible trip, those from the<strong>de</strong>pot to the start location of a trip (pull-out) and those from an endlocation of a trip to the <strong>de</strong>pot (pull-in). The set of timetabled tripsand <strong>de</strong>adhead trips performed by a vehicle on <strong>da</strong>y h ∈ H <strong>de</strong>finesa vehicle schedule. Each vehicle schedule starts and ends at the<strong>de</strong>pot and is subdivi<strong>de</strong>d into points, called relief points, where achange of driver may occur. Two consecutive relief points <strong>de</strong>finea task, that is, the smallest amount of work to be assigned to thesame vehicle and crew.A crew duty is a <strong>da</strong>ily combination of tasks that respects labor law,union contracts and internal rules of the company. These rules <strong>de</strong>pendon the particular situation un<strong>de</strong>r study and usually constrainthe maximum and minimum spread (time elapsed between the beginningand end of a crew duty), the maximum working time withouta break, the break duration, etc. Crew duties are assigned tothe drivers to form their work schedules - the roster. This is usuallydone on a cyclic basis so as all workers have the same type of workand rest periods. In this paper, we <strong>de</strong>al with a group of drivers withmore flexibility on the rosters. These drivers work according to apre-<strong>de</strong>fined cyclic <strong>da</strong>ys-off pattern where all drivers share the sametype of rest periods, but not necessarily the same crew duties. Fora time horizon of 7 weeks (49 <strong>da</strong>ys), this <strong>da</strong>ys-off pattern, PatI,inclu<strong>de</strong>s 4 rest periods of 2 consecutive <strong>da</strong>ys-off and 2 rest periodsALIO-EURO <strong>2011</strong> – 218


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>of 3 consecutive <strong>da</strong>ys-off. These 2 rest periods contain a Satur<strong>da</strong>yand occur in sequence with 5 work<strong>da</strong>ys in between. The remainingwork periods have 6 consecutive work<strong>da</strong>ys.Table 1 displays PatI through a 0 − 1 matrix, where 0 stands for<strong>da</strong>y-off and 1 for work<strong>da</strong>y. Each row of the matrix corresponds toa week<strong>da</strong>y. Seven consecutive columns correspond to the 7 weeksof the time horizon, being the last <strong>da</strong>y in column i (i = 1,...,6)followed by the first <strong>da</strong>y in column i+1 and the last <strong>da</strong>y in column7 followed by the first <strong>da</strong>y of column 1. Since a 49 <strong>da</strong>ys schedulemay begin in row 1 of any column, this <strong>da</strong>ys-off pattern leads toa set S of 7 schedules s i ,i = 1,...,7. That is, for example, a driverassigned to schedule s 4 ∈ S works according to columns 4, 5, 6, 7,1, 2, and 3, during weeks, 1, 2, 3, 4, 5, 6, and 7, respectively.1 2 3 4 5 6 7Mon 0 1 1 1 1 1 0Tue 0 0 1 1 1 1 1Wed 1 0 0 1 1 1 1Thu 1 1 0 0 1 1 1Fri 1 1 1 0 0 1 1Sat 1 1 1 1 0 0 1Sun 1 1 1 1 0 0 1Table 1: Cyclic <strong>da</strong>ys-off pattern (PatI).Usually, in public transit companies the workforce <strong>de</strong>mand is constantfrom Mon<strong>da</strong>y to Fri<strong>da</strong>y but it <strong>de</strong>creases during the weekend.But, for the cyclic <strong>da</strong>ys-off pattern displayed in Table 1, in eachweek<strong>da</strong>y, there are always 2 schedules covering drivers <strong>da</strong>ys-offand 5 schedules covering drivers work<strong>da</strong>ys. In or<strong>de</strong>r to minimizethe number of drivers assigned to work we propose an additionalschedule, s 8 , with 7 rest periods of 2 consecutive <strong>da</strong>ys-off that alwaysoccur on Satur<strong>da</strong>y and Sun<strong>da</strong>y. During the planning horizonH, drivers assigned to s 8 work Mon<strong>da</strong>y through Fri<strong>da</strong>y and restSatur<strong>da</strong>y and Sun<strong>da</strong>y. The 7 schedules in PatI plus schedule s 8<strong>de</strong>fine the set S of schedules in pattern PatII, displayed in Table 2.1 2 3 4 5 6 7 8Mon 0 1 1 1 1 1 0 1Tue 0 0 1 1 1 1 1 1Wed 1 0 0 1 1 1 1 1Thu 1 1 0 0 1 1 1 1Fri 1 1 1 0 0 1 1 1Sat 1 1 1 1 0 0 1 0Sun 1 1 1 1 0 0 1 0Table 2: Cyclic/non cyclic <strong>da</strong>ys-off pattern - PatII.PatII tends to counterbalance the lower <strong>de</strong>mand during weekend.From Mon<strong>da</strong>y to Fri<strong>da</strong>y, the covering of drivers <strong>da</strong>ys-off and work<strong>da</strong>ysis equal in PatI and PatII. However, on Satur<strong>da</strong>y and Sun<strong>da</strong>yPatII has 3 schedules covering drivers <strong>da</strong>ys-off.A roster is an assignment of drivers to schedules in S covering allthe crew duties <strong>de</strong>fined for the planning horizon while satisfyinglabor and internal rules of the company. Any number of driversmay be assigned to each schedule.Note that, both PatI and PatII satisfies a minimum number of <strong>da</strong>ysoffper week (1 <strong>da</strong>y), the minimum number of consecutive <strong>da</strong>ysoff(2 <strong>da</strong>ys), a minimum number of Sun<strong>da</strong>ys/weekends off in theplanning period (2 <strong>da</strong>ys) and a maximum of consecutive work<strong>da</strong>ys(6 <strong>da</strong>ys). Besi<strong>de</strong>s these constraints, drivers must rest a minimumnumber of hours between two consecutive crew duties and, consequently,a crew duty starting in the morning cannot be assignedto a driver that had worked on a crew duty starting in the eveningof the previous <strong>da</strong>y. To avoid infeasible sequences of crew duties,for each <strong>da</strong>y h, the set of crew duties, L h , is partitioned into earlycrew duties, L h E , starting before 3:30 p.m. and late crew duties, Lh A ,starting after 3:30 p.m.Moreover, for a roster to be well accepted in the company it shoul<strong>da</strong>ttain balanced workload. To balance workload, the set L h is alsopartitioned into short duties, LT h , which have a maximum spread of5 hours (without a break), normal duties, LN h , with spread ∈ [5,9]hours, and long duties, LO h , with spread ∈ [9,10.75] hours (withovertime).The VCRPat aims to simultaneously <strong>de</strong>termine a minimum cost setof vehicle schedules that <strong>da</strong>ily covers all timetabled trips, a minimumcost set of crew duties that <strong>da</strong>ily covers all vehicle schedulesand a minimum cost balanced roster for the time horizon.3. MATHEMATICAL MODELFor each <strong>da</strong>y h, we <strong>de</strong>fine the vehicle scheduling network G h =(V h ,A h ). The no<strong>de</strong> set V h = N h ∪{d s ,d t } inclu<strong>de</strong>s N h correspondingto the timetabled trips to be performed on <strong>da</strong>y h and {d s ,d t }corresponding to the <strong>de</strong>pot d. The arc set A h = I h ∪ (d s × N h ) ∪(N h ×d t ) contains I h , the set of arcs representing the pairs of compatibletimetabled trips, and the sets of arcs related with pull-outand pull-in trips. Each path on graph G h , starting in d s and endingin d t , <strong>de</strong>fines a vehicle schedule for a specific vehicle on <strong>da</strong>y h. Aset of paths from d s to d t disjoint on N h , covering all no<strong>de</strong>s fromN h , <strong>de</strong>fines a vehicle scheduling for <strong>da</strong>y h. Decision variables z h i jindicate whether a vehicle performs trips i and j in sequence on<strong>da</strong>y h, or not. In particular, z h d s j and zh id trepresent, respectively,a pull-out from the <strong>de</strong>pot to trip j and a pull-in from i to the <strong>de</strong>pot.Vehicle costs c i j , related with fuel consumption and/or vehiclemaintenance, are associated to the corresponding arcs in G h .Daily vehicle schedules must be covered with <strong>da</strong>ily crew duties.Since the <strong>de</strong>pot as well as the end location of timetabled trips arepotential relief points, we assume that task (i, j) corresponds eitherto the <strong>de</strong>adhead from trip i to trip j followed by trip j or to the<strong>de</strong>adhead from trip i to trip j followed by trip j and a pull-in. LetL h i j ⊆ Lh be the set of crew duties covering task (i, j), on <strong>da</strong>y h. Letvariables w h l indicate whether crew duty l is selected on <strong>da</strong>y h, ornot. A cost e l is assigned to each crew duty l. Cost e l usually inclu<strong>de</strong>sa fixed cost (for example, a driver’s salary) and operationalcosts related to overtime, evening periods, etc.Each anonymous crew duty in the solution has to be assigned toa specific driver which works according to one of the pre-<strong>de</strong>fined<strong>da</strong>ys-off schedules. Let xs m = 1 if driver m is assigned to schedules, or 0 otherwise. Let y mhl = 1 if driver m performs crew duty l on<strong>da</strong>y h, or 0 otherwise.The objective function inclu<strong>de</strong>s different measures. Managementoften wishes to know the minimum workforce required to operatethe fleet of vehicles, so as to transfer drivers to other <strong>de</strong>partmentsof the company or to replace those absent. Such policy results inminimizing crew duty costs e l associated to variables w h l = 1 andcosts r m associated to x m s = 1. To balance workload, penalties λ Tand λ O are associated with variables η T and η O that represent,respectively, the maximum number of short and long crew dutiesassigned to a driver during H. The integer linear programmingformulation follows:min ∑ ( ∑ c i j z h i j + ∑ e l w h l )+ ∑h∈H (i, j)∈A h l∈L h m∈M∑r m xs m +λ T η T +λ O η Os∈S(1)∑ z h i j = 1, j ∈ Nh ,h ∈ H (2)i:(i, j)∈A h∑j:(i, j)∈A h z h i j = 1, i ∈ Nh ,h ∈ H (3)ALIO-EURO <strong>2011</strong> – 219


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>∑ y mhl∈LAh∑ y mhl∈LEh∑ z h d s i ≤ ν,i∈N h h ∈ H (4)∑ w h l − zh i j ≥ 0, (i, j) ∈ Ah ,h ∈ H (5)l∈Li h j∑ y mhl − w h l = 0, l ∈ Lh ,h ∈ H (6)m∈M∑ xs m ≤ 1, m ∈ M (7)s∈S∑ y mhl − ∑ a h s xs m ≤ 0,l∈L h s∈Sm ∈ M,h ∈ H (8)l + ∑l∈L (h−1)El + ∑l∈L (h−1)A∑∑h∈H l∈Lthy m(h−1)l ≤ 1, m ∈ M,h ∈ H − {1}, (9)y m(h−1)l ≤ 1, m ∈ M,h ∈ H − {1}, (10)y mhl − η t ≤ 0, m ∈ M,t ∈ {T,O} (11)z h i j ∈ {0,1}, (i, j) ∈ Ah ,h ∈ H (12)w h l ∈ {0,1}, l ∈ Lh ,h ∈ H (13)y mhl ∈ {0,1}, l ∈ L h ,m ∈ M,h ∈ H (14)x m s ∈ {0,1}, s ∈ S,m ∈ M (15)η T ,η O ≥ 0 and integer. (16)Constraints (2), (3) and (4) <strong>de</strong>scribe the scheduling of vehicles ensuringthat each timetabled trip is performed exactly once by a vehicle.These constraints ensure that, for each <strong>da</strong>y h ∈ H, graph G his partitioned into a set of disjoint paths, vehicle schedules, coveringall vertex in N h . Constraints (4), where ν is the number ofvehicles available at the <strong>de</strong>pot, are related with the <strong>de</strong>pot capacity.Constraints (5) link vehicle and crew duty variables ensuringthat each task in a vehicle schedule is covered by one <strong>da</strong>ily crewduty. Equalities (6) impose that each crew duty, in a solution, mustbe assigned to one driver. Constraints (7) state that each driver isassigned to one of the seven <strong>da</strong>ys-off schedules or is available forother service in the company. Constraints (8), where parametera h s = 1 if h is a work<strong>da</strong>y on schedule s, impose coherence betweenthe assignment of a crew duty to a driver and the schedule assignedto this driver. Inequalities (9) forbid the sequence late/early dutiesto ensure that drivers rest a given minimum number of hoursbetween consecutive duties. Furthermore, (9) and (10) impose a<strong>da</strong>y-off period between changes of duty types. Inequalities (11)<strong>de</strong>termine the maximum number of short/long duties per driver.Note that, these constraints along with the two last terms of theobjective function ensure the integrality of variables η T and η O .To <strong>de</strong>al with the VCRPat, a <strong>de</strong>composition approach is suggestedby three combinatorial structures inclu<strong>de</strong>d in this mathematicalformulation. A network flow problem is related with the <strong>da</strong>ilyscheduling of vehicles. A set covering structure <strong>de</strong>fines the setof crew duties that <strong>da</strong>ily cover the vehicle schedules and a mixedcovering-assignment problem with additional constraints <strong>de</strong>finesthe roster, for the planning horizon H, that covers all <strong>da</strong>ily crewduties.approach based on Ben<strong>de</strong>rs <strong>de</strong>composition. The <strong>de</strong>compositionheuristic alternates between the solution of a master problem involvingthe z, w variables, a vehicle-crew scheduling problem foreach <strong>da</strong>y of H, and the solution of the corresponding subprobleminvolving the y,x,η variables, a rostering problem for H.Concerning the master problem, a non-exact approach is proposedwhere, in each iteration, the Ben<strong>de</strong>rs cuts are relaxed into the masterobjective function, associated with non-negative multipliers.An a<strong>de</strong>quate choice for these multiplier values leads to a relaxedmaster problem that can be partitioned into |H| in<strong>de</strong>pen<strong>de</strong>nt integratedvehicle-crew scheduling problems, one for each <strong>da</strong>y ofthe planning horizon H. In each iteration, crew duty costs are up<strong>da</strong>tedwith information given by the rostering suproblem variablesand the resulting integrated vehicle-crew scheduling problems aresolved by an algorithm which combines a heuristic column generationprocedure with a branch-and-bound scheme.As for the subproblem, fixing the values of the z and w variablesin VCRPat at values z and w, given by the optimal solution ofthe master problem, one obtains a rostering problem. Exact stan<strong>da</strong>r<strong>da</strong>lgorithms are used to solve the linear relaxation of the rosteringsubproblem. Whenever the resulting solution is not integer,branch-and-bound techniques are applied to obtain a feasibleroster. Integer solutions for the roster subproblem involve alarge number of binary variables and a great amount of resourcesis nee<strong>de</strong>d to obtain these solutions. To overcome such drawback,different strategies have been incorporated into the branching process,yielding in most cases, to a "good" feasible roster in shortcomputing time.A computational experiment was performed using two real-world<strong>da</strong>ta set instances. In each iteration, the master problem vehiclecrewsolution and the subproblem rostering solution, in case ofbeing integer, are both inclu<strong>de</strong>d in a pool of feasible solutions forVCRPat for further analyses from different points of view.Preliminary computational results concerning pattern PatI showthat the <strong>de</strong>composition algorithm adjust crew duties in the masterproblem, thus inducing better subproblem solutions in what concernsthe number of drivers and/or the number of short and longcrew duties assigned to a driver. Such improvement on the firstiteration solution quality may be seen through the replacement oflong and short crew duties by normal crew duties originating afairer distribution of work among the drivers. Note that, the firstiteration solution corresponds to the sequential solution.5. CONCLUSIONSThis paper proposes a Ben<strong>de</strong>rs <strong>de</strong>composition based algorithm thatgenerates a pool of feasible solutions for a single <strong>de</strong>pot vehiclecrew-rosterproblem. The approach outperforms the traditional sequentialscheme. The feedback given by Ben<strong>de</strong>rs cuts gui<strong>de</strong>d thebuilding of the <strong>da</strong>ily vehicle-crew schedules thus leading to balancedworkload rosters with fewer drivers.Due to the weight of driver costs in the VCRPat overall cost, themethodology is now being used to analyze the influence of differentroster patterns into the VCRPat final solutions.4. DECOMPOSITION ALGORITHMDifferent temporal scheduling problems may be i<strong>de</strong>ntified in theabove mathematical formulation: |H| <strong>da</strong>ily integrated vehicle-crewscheduling problems, with variables z and w for the vehicle schedulesand for the crew duties, respectively; and a rostering problemfor the whole planning horizon, with variables y, x and η. Thesetemporal scheduling problems share a set of variables in constraintset (6). To handle these linking constraints, we propose a heuristic6. ACKNOWLEDGEMENTSThis research was fun<strong>de</strong>d by POCTI/ISFL/152.7. REFERENCES[1] D. Huisman, R. Freling, and A. Wagelmans, “Multiple-<strong>de</strong>potintegrated vehicle and crew scheduling,” Transportation Sci-ALIO-EURO <strong>2011</strong> – 220


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>ence, vol. 39, no. 4, pp. 491–502, 2005.[2] M. Mesquita and A. Paias, “Set partitioning/covering-base<strong>da</strong>pproaches for the integrated vehicle and crew schedulingproblem,” Computers & Operations Research, vol. 35, no. 5,pp. 1562–1575, 2008.[3] I. Steinzen, V. Gintner, L. Suhl, and N. Kliewer, “A timespacenetwork approach for the integrated vehicle-and crewschedulingproblem with multiple <strong>de</strong>pots,” TransportationScience, vol. 44, no. 3, pp. 367–382, 2010.[4] A. Caprara, M. Monacci, and P. Toth, “A global method forcrew planning in railway applications,” in Computer-Ai<strong>de</strong>dScheduling of Public Transport, Lecture Notes in Economicsand Mathematical Systems, Springer, 2001, pp. 17–36.[5] R. Freling, R. Lentink, and A. Wagelmans, “A <strong>de</strong>cision supportsystem for crew planning in passenger transportation usinga flexible branch-and-price algorithm,” Annals of OperationsResearch, vol. 127, no. 1-4, pp. 203–222, 2004.[6] S. Chu, “Generating, scheduling and rostering of shift crewduties:applications at the hong kong international airport,”European Journal of Operational Research, vol. 177, no. 3,pp. 1764–1778, 2007.[7] M. Mesquita, M. Moz, A. Paias, and M. Pato, “An integratedvehicle-crew-roster problem with <strong>da</strong>ys-off pattern,”CIO-Working Paper 7, Tech. Rep., 2010.[8] J.-F. Cor<strong>de</strong>au, G. Stojković, F. Soumis, and J. Desrosiers,“Ben<strong>de</strong>rs <strong>de</strong>composition for simultaneous aircraft routingand crew scheduling,” Transportation Science, vol. 35, no. 4,pp. 375–388, 2001.[9] A. Mercier, J.-F. Cor<strong>de</strong>au, and F. Soumis, “A computationalstudy of ben<strong>de</strong>rs <strong>de</strong>composition for the integrated aircraftrouting and crew scheduling problem,” Computers & OperationsResearch, vol. 32, no. 6, pp. 1451–1476, 2005.[10] A. Mercier and F. Soumis, “An integrated aircraft routing,crew scheduling and flight retiming mo<strong>de</strong>l,” Computers &Operations Research, vol. 34, no. 8, pp. 2251–2256, 2007.[11] N. Papa<strong>da</strong>kos, “Integrated airline scheduling,” Computers &Operations Research, vol. 36, no. 1, pp. 176–195, 2009.ALIO-EURO <strong>2011</strong> – 221


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Insights on the exact resolution of the rostering problemMarta Rocha ∗ José Fernando Oliveira ∗ Maria Antónia Carravilla ∗∗ DEIG, <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> <strong>Engenharia</strong> <strong>da</strong> Universi<strong>da</strong><strong>de</strong> do PortoRua Dr. Roberto Frias, Porto, Portugal{marta, jfo, mac}@fe.up.ptABSTRACTThe purpose of this paper is to present some findings on the rosteringproblem resolution through the analysis of a real case study.The problem is initially formulated as a mixed integer problem(MIP) and solved with CPLEX, using the ILOG OPL Studio environment.The achieved findings and results are the basis for the<strong>de</strong>velopment of a constructive heuristic that consistently reachesa feasible solution, which is the optimal solution in this particularcase, in a shorter period of time than the MIP mo<strong>de</strong>l.Keywords: Rostering, Staff scheduling1. INTRODUCTIONHuman resource management is one of the most concerning issuesfor any organization due not only to its significant impact inthe total expenditure but also because it is highly constrained byhuman behavior aspects seeing that it <strong>de</strong>als with human beings.Having the right people doing the right task, at the right time, inthe right place, at the minimum cost is typically the aim of staffscheduling or rostering problems. These problems are restricted byseveral constraints such as <strong>de</strong>mand requirements, task skills specifications,legal or contractual obligations, employees preferences,among others. A <strong>de</strong>tailed <strong>de</strong>scription can be found in [1] and [2].2.1. Problem <strong>de</strong>scription2. CASE STUDYThe present work addresses the rostering problem of an organization,lea<strong>de</strong>r in its market segment, that works continuously, aroundthe clock, 365 <strong>da</strong>ys per year. The workforce is divi<strong>de</strong>d in teamswhich must be assigned to three eight-hour shifts: morning, afternoonand night. The workload shall be uniformly distribute<strong>da</strong>mong teams, no distinction is ma<strong>de</strong> concerning skills of the employeesor shift types. The problem consists in <strong>de</strong>termining whichteam will work on each shift in each of the planning horizon <strong>da</strong>ysand how the rest or break <strong>da</strong>ys shall be interposed between work<strong>da</strong>ys.2.2. Developed MIP mo<strong>de</strong>lWe <strong>de</strong>veloped a mixed-integer formulation for this problem, wherethe objective function is to minimize the maximum number of the<strong>da</strong>ys that a team works in each shift. The <strong>de</strong>cision variables assumebinary values indicating when a team is assigned to a shifton a specific <strong>da</strong>y. This objective function levels the working <strong>da</strong>ysof each team, leading to a solution in which each team works thesame number of <strong>da</strong>ys in each shift.The constraints to this problem guarantee that:1. each <strong>da</strong>y, every team has exactly one shift assigned, eithera work shift or a break shift.2. each <strong>da</strong>y, every working shift has exactly one team assigned.3. no team works more than a maximum number of consecutive<strong>da</strong>ys.4. each team works at least a minimum number of consecutiveworking <strong>da</strong>ys.5. the required shift sequence is followed.6. all teams have the same schedule, but with an offset betweenthem.2.3. FindingsWe ran this mo<strong>de</strong>l in the ILOG OPL Studio v6.3 and, althoughit led to some feasible solutions, it revealed the difficulty in findingthe right combination of input parameters: given a number ofteams and a number of shifts, which is the appropriate planninghorizon? And what shall the offset between teams be?In fact, the parameters of this problem are connected in such away that make it very tight and inflexible to allow large parametersvariations. Based on the tests results, we realized, for instance,that in or<strong>de</strong>r to get a feasible solution, the number of <strong>da</strong>ys in theplanning horizon must be a multiple of the number of teams andthat an offset between teams equal to the ratio between the numberof <strong>da</strong>ys and the number of teams usually leads to a good result.We <strong>de</strong>tected that a key issue, to which the mo<strong>de</strong>l proved to bevery sensitive, is the number of available break <strong>da</strong>ys. This valuecan be exactly <strong>de</strong>termined for a given number of teams, numberof shifts and number of <strong>da</strong>ys in the planning horizon. Consi<strong>de</strong>ringconstraints (1) and (2), we know that the number of breaks in each<strong>da</strong>y is given by the difference between the number of teams andthe number of shifts. The total number of breaks can be obtainedby multiplying this figure by the number of <strong>da</strong>ys in the planninghorizon. The number of available break <strong>da</strong>ys for each team is thenachieved by dividing the total number of breaks by the number ofteams.We verified the existence of two important conditions:1. consi<strong>de</strong>ring the scenario of having working-<strong>da</strong>y blocks withthe minimum possible length (equal to the minimum numberof consecutive working <strong>da</strong>ys), if the minimum numberof required break-<strong>da</strong>ys is equal or less than the number ofavailable breaks, the problem has a feasible solution. Otherwise,it may have or not a feasible solution. This conditionis thus sufficient but not necessary.2. consi<strong>de</strong>ring the scenario of having working-<strong>da</strong>y blocks withthe maximum possible length, if the minimum number ofrequired break-<strong>da</strong>ys is greater than the number of availablebreaks, the problem has no feasible solution. This is a necessarycondition but not sufficient.ALIO-EURO <strong>2011</strong> – 222


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>2.4. Developed constructive heuristicWhen the first condition is met we propose a constructive heuristicto build a feasible solution, consisting of only blocks of minimumconsecutive <strong>da</strong>ys length or a combination of these with minimumlength + 1 <strong>da</strong>y blocks. The first step is thus to check whether itis possible to use only minimum length blocks or if there is theneed to use a combination of minimum length blocks and minimumlength plus 1 <strong>da</strong>y blocks. Then, we assign the first blocksof the first shift to all the teams, assuming an offset equal to minimumconsecutive <strong>da</strong>ys or equal to the ratio of number of <strong>da</strong>ys andthe number of teams. If we use minimum length blocks, we assignone break after each block. If we use minimum length and minimumlength + 1 <strong>da</strong>y blocks we assign two breaks after each of theformer blocks and one break after the latter. After assigning theother shifts working blocks, we insert the required breaks at theend of the last shift block to wait until the first shift is availableagain. This closes the first sub-cycle. The procedure is conclu<strong>de</strong>dwith the replication of the first sub-cycle as many times as nee<strong>de</strong>din or<strong>de</strong>r to fulfill the planning horizon for all the teams.This heuristic was manually tested with several input parameterscombinations, which fulfilled the initial assumptions: number of<strong>da</strong>ys multiple of the number of teams and an offset equal to thenumber of minimum consecutive <strong>da</strong>ys or to the ratio between thenumber of <strong>da</strong>ys and the number of teams, and met the condition(1), leading always to a feasible solution. The <strong>de</strong>velopment of thisconstructive procedure aimed to <strong>de</strong>fine a consistent and reliableprocess for reaching a feasible solution. The results achieved so farshow, yet, that when a feasible solution is found, it is the optimalsolution.3. FUTURE WORKFuture work involves the software co<strong>de</strong> <strong>de</strong>velopment in or<strong>de</strong>r tomassively test the heuristic, making it possible to consoli<strong>da</strong>te andgeneralize the achieved results. We are confi<strong>de</strong>nt that this workwill provi<strong>de</strong> an important contribution to the research in staff schedulingand rostering problems.4. REFERENCES[1] A. T. Ernst, H. Jiang, M. Krishnamoorthy, and D. Sier, “Staffscheduling and rostering: a review of applications, methodsand mo<strong>de</strong>ls,” Eur. J. Oper. Res., vol. 153, no. 1, pp. 3–27,2004.[2] P. De Causmaecker and G. Van<strong>de</strong>n Berghe, “Towards areference mo<strong>de</strong>l for timetabling and rostering,” Annals of OperationsResearch, pp. 1–10–10, 03 2010. [Online]. Available:http://www.springerlink.com/content/8024n8h207807504/ALIO-EURO <strong>2011</strong> – 223


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Comparing Hybrid Constructive Heuristics for University Course TimetablingDario Lan<strong>da</strong>-Silva ∗ Joe Henry Obit †∗ ASAP Research Group, School of Computer ScienceUniversity of Nottingham, United Kingdom<strong>da</strong>rio.lan<strong>da</strong>silva@nottingham.ac.uk† Labuan School of Informatics ScienceUniversity Malaysia Sabah, MalaysiaResearch carried out while J. Obit was a PhD stu<strong>de</strong>nt in Nottingham.joehenryobit@yahoo.comABSTRACTThis exten<strong>de</strong>d abstract outlines four hybrid heuristics to generateinitial solutions to the University course timetabling problem.These hybrid approaches combine graph colouring heuristics andlocal search in different ways. Results of experiments using twobenchmark <strong>da</strong>tasets from the literature are presented. All the fourhybrid initialisation heuristics <strong>de</strong>scribed here are capable of generatingfeasible initial timetables for all the test problems consi<strong>de</strong>redin these experiments.Keywords: Course timetabling, Hybrid heuristics, Event scheduling,Constructive heuristics1. INTRODUCTIONWe refer to the University course timetabling problem as <strong>de</strong>scribedby Socha et al. [1] with: n events E = {e 1 ,e 2 ,...,e n }, k timeslotsT = {t 1 ,t 2 ,...,t k }, m rooms R = {r 1 ,r 2 ,...,r m } and a set S ofstu<strong>de</strong>nts. Each room has a limited capacity and some additionalfeatures. Each event requires a room with certain features. Eachstu<strong>de</strong>nt attends a number of events which is a subset of E. Theproblem is to assign the n events to the k timeslots and m rooms insuch a way that all hard constraints are satisfied and the violationof soft constraints is minimised.The hard constraints that must be satisfied for a timetable to befeasible are as follows. HC1: a stu<strong>de</strong>nt cannot attend two events simultaneously,i.e. events with stu<strong>de</strong>nts in common must betimetabled in different timeslots. HC2: only one event may be assignedper timeslot in each room. HC3: the room capacity must beequal to or greater than the number of stu<strong>de</strong>nts attending the eventin each timeslot. HC4: the room assigned to an event must satisfythe features required by the event. The soft constraints that are <strong>de</strong>sirableto satisfy in or<strong>de</strong>r to assess the quality of a timetable are asfollows. SC1: stu<strong>de</strong>nts should not have only one event timetabledon a <strong>da</strong>y. SC2: stu<strong>de</strong>nts should not attend more that two consecutiveevents on a <strong>da</strong>y. SC3: stu<strong>de</strong>nts should not attend an event inthe last timeslot of a <strong>da</strong>y.It has been shown in the literature that a sequential heuristic methodcan be very efficient for generating initial solutions [2, 3]. Asequential heuristic assigns events one by one, starting from theevent which is consi<strong>de</strong>red the most difficult to timetable in somesense. The ‘difficulty’ of scheduling an event can be measured bydifferent criteria (i.e. the number of other conflicting events or thenumber of stu<strong>de</strong>nts attending the event). However, a sequentialheuristic alone does not guarantee that feasible solutions will befound even with the combination of more than one heuristic. Forexample, Abdullah et al. [4] proposed a method, based on a sequentialheuristic, to construct initial timetables. However, theirmethod failed to generate a feasible solution for the large instanceof the Socha et al. problem instances [1].We propose hybrid heuristics to create initial feasible timetablesfor the University course timetabling problem <strong>de</strong>scribed above.We combine traditional graph colouring heuristics with various localsearch methods including a simple tabu search. In the experimentsof this work we use the 11 benchmark <strong>da</strong>ta sets proposedby Socha et al. [1] and also the set of problem instances from theInternational Timetabling Competition (ITC) 2002 [5]. The proposedheuristics generate feasible timetables for all the instancesin our experiments. However, these methods do not tackle the satisfactionof soft constraints. Then, we obtain feasible solutionsthat might still have relatively high number of soft constraint violations.The rationale for this is to allow flexibility for anotheralgorithm, that seeks to improve the satisfaction of constraints, tostart the search from the feasible timetables. This has proven to bebeneficial in our related work helping the improving algorithm toachieve extremely good results [6, 7]. It is difficult to compare theresults in this paper with the literature because most other works(e.g. [3]) incorporate the construction of initial timetables withinthe overall method to solve the problem, i.e. constructing initial solutionsand improving them are combined into a single approach.The next section <strong>de</strong>scribes the proposed hybrid heuristics.2. GENERATING INITIAL TIMETABLESIn or<strong>de</strong>r to <strong>de</strong>velop effective algorithms for tackling hard constraintsin the subject problem, we combine techniques such asgraph colouring, local search and tabu search. We found that thesearch components incorporated in the hybrid methods are inter<strong>de</strong>pen<strong>de</strong>nton their ability to produce a feasible timetable. In otherwords, when one of these components is disabled or removed, theremaining components are not able to produce feasible solutionsin particular for medium and large instances. Therefore, the hybrids<strong>de</strong>scribed next are effective tailored mechanisms to generatefeasible timetables for the subject problem.2.1. Largest Degree, Local Search and Tabu Search (IH1)We adopted the heuristic proposed by Chiarandini et al. [8] an<strong>da</strong>d<strong>de</strong>d the Largest Degree heuristic to Step one as <strong>de</strong>scribed next.Largest Degree refers to the event with the largest number of conflictingevents (events that have at least one stu<strong>de</strong>nt in common).Step one - Largest Degree Heuristic. In each iteration, the un-ALIO-EURO <strong>2011</strong> – 224


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>scheduled event with the Largest Degree is assigned to a timeslotselected at random without respecting conflicts between events.Once all events have been assigned into a timeslot, the maximummatching algorithm for bipartite graph is used to assign each eventto a room. At the end of this step, there is no guarantee for thetimetable to be feasible. Then, steps one and two below are executediteratively until a feasible solution is constructed.Step two - Local Search. We employ two neighbourhood movesin this step. Move one (M1) selects one event at random and assignsit to a feasible pair timeslot-room also chosen at random.Move two (M2) selects two events at random and swaps theirtimeslots and rooms while ensuring feasibility is maintained. Thatis, neighbourhood moves M1 and M2 seek to improve the timetablegenerated in Step one. A move is only accepted if it improves thesatisfaction of hard constraints (because the moves seek feasibility).This step terminates if no move produces a better (closer tofeasibility) solution for 10 iterations.Step three - Tabu Search. We apply a simple tabu search using aslight variation of move M1 above. Here, M1 only selects an eventthat violates hard constraints. The motivation is that the algorithmshould now only target events that violate hard constraints insteadof randomly rescheduling other events like in Step two. The tabulist contains events that were assigned less than tl iterations beforecalculated as tl = rand(10) + δ × n c , where 0 ≤ rand(10) ≤ 10,n c is the number of events involved in hard constraint violations inthe current timetable, and δ = 0.6. The usual aspiration criterionis applied to overri<strong>de</strong> tabu status, i.e. accept the move when a bestknown solution is found. This step terminates if no move producesa better (closer to feasibility) solution for ts iterations.2.2. Saturation Degree, Local Search and Tabu Search (IH2)This heuristic uses Saturation Degree, which refers to the numberof resources (timeslots and rooms) still available to timetable agiven event without conflicts in the current partial solution. Inthe previous heuristic IH1 the assignment of events in Step one isdone without checking conflicts. The difference in heuristic IH2is that we first check conflicts between the unassigned event andthose events already assigned to the selected timeslot. If there aretimeslots with no-conflicting events already assigned (saturation<strong>de</strong>gree of the event to assign is greater than zero), the event isassigned to a feasible timeslot selected at random. If there are nosuch timeslots (saturation <strong>de</strong>gree of the event to assign is zero),the events already assigned to the timeslot are ejected and put ina list of events to re-schedule. The heuristic then attempts to reassignthese ejected events into conflict free timeslots if possible.Otherwise, these ejected events are put into random timeslot-room,even if conflicts arise, then later the local and tabu search of Steptwo and Step three as <strong>de</strong>scribed above, will <strong>de</strong>al with these ejecte<strong>de</strong>vents and the remaining conflicting assignments. In essence, inaddition to using Saturation Degree instead of Largest Degree, thissecond heuristic IH2 tries to fix some conflicts in the timetablebefore starting Steps two and three.2.3. Largest Degree, Saturation Degree, Local Search and TabuSearch (IH3)This heuristic incorporates both Largest Degree and Saturation Degree.The difference with heuristic IH2 is that in Step one, eventsare first sorted based on Largest Degree. After that, we choose theunassigned event with the Largest Degree and calculate its SaturationDegree. Then, Step one of this heuristic IH3 proceeds as inheuristic IH2, but when attempting to re-assign the ejected events,only those ejected events with Saturation Degree greater than zero(still available timeslots and room) are assigned to any feasibletimeslot-room. All ejected events with Saturation Degree zero aremoved from the re-schedule list to the list of unscheduled events.After each re-assigning, we re-calculate the Saturation Degree forall ejected events in the re-schedule list. This process in Step onecontinues and if after some given computation time there are stillevents in the unscheduled list, these events are then assigned torandom timeslot-room without respecting conflicts. Steps one andtwo as <strong>de</strong>scribed above follow implementing the local and tabusearch respectively. In essence, compared to heuristic IH2, thisheuristic IH3 combines Saturation Degree and Largest Degree inStep one trying to re-scheduled ejected events with less resourcesfirst. Algorithm 1 shows the pseudo-co<strong>de</strong> for the hybrid heuristicIH3, which in a sense, is the most elaborate one among methodsIH1, IH2 and IH3.2.4. Constraint Relaxation Approach (IH4)In this fourth heuristic approach, we introduce extra dummy timeslotsto place events with zero Saturation Degree and in this wayenforce the no-conflicts constraint by relaxing the availability oftimeslots. The number of extra dummy timeslots nee<strong>de</strong>d is <strong>de</strong>terminedby the size of the problem instance. This heuristic works asfollows. First, we sort the events using Largest Degree. The eventwith the Largest Degree is chosen to be scheduled first. If theevent has zero Saturation Degree, the event is assigned randomlyto one of the extra dummy timeslots. Once the algorithm assignsall events in the valid timeslots plus the extra dummy timeslotswithout conflicts, we then perform great <strong>de</strong>luge search [6] usingmoves M1 and M2 to reduce the number of timeslots down to 45valid timeslots if necessary. In this local search, only the 45 validtimeslots are consi<strong>de</strong>red, so no events are allowed to move intoany of the extra dummy timeslots. This hybrid heuristic is muchslower that the other three methods above, mainly due to the great<strong>de</strong>luge search. Algorithm 2 shows the pseudo-co<strong>de</strong> for the hybridheuristic IH4, which in a sense, is the most different among allmethods <strong>de</strong>scribed here.3. RESULTS AND DISCUSSIONThe proposed hybrid heuristic initialisation methods were appliedto the Socha et al. [1] instances and also to the ITC 2002 instances[5]. We did not impose time limit as a stopping condition,each algorithm stops when it finds a feasible solution.All methods successfully generate initial solution for small instancesin just few seconds. The medium and large Socha et al. instancesare more difficult as well as all ITC 2002 instances. However,the proposed methods generated feasible solutions for all instances<strong>de</strong>monstrating that the hybridisation compensates weaknessin one component with strengths in another one in or<strong>de</strong>r toproduce feasible solutions in reasonable computation times.Table 1 and Table 2 compare the performance of each method onthe Socha et al. and the ITC 2002 instances respectively. The firstcolumn in each table indicates the problem instance. The next fourcolumns give the best objective function value (soft constraints violation)obtained by each heuristic. The last column in each tableindicates the best computation time in seconds and the correspondingheuristic.The results show that none of the heuristics clearly outperformsthe others in terms of the objective function value (soft constraintsviolation) obtained. Each of the four heuristics outperforms theother three in some of the problem instances. With respect to computationtime we can see in Table 1 that for the Socha et al. problems,the heuristic that achieved the best objective value was almostnever the fastest one (except in problem instance M2). However,for the ITC 2002 problems, we see in Table 2 that in severalcases the heuristic producing the best objective value was also theALIO-EURO <strong>2011</strong> – 225


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Algorithm 1: Initialisation Heuristic 3 (IH3)1 Input: List of Unscheduled events E;2 Sort E by non-increasing Largest Degree (LD);3 while (E is not empty) do4 Choose event e from E with LD (random tie-break);5 Calculate SD for event e;6 if (SD = 0) then7 Select a timeslot t at random;8 Move events scheduled (if any) in timeslot t thatconflict with event e (if any) to the Reschedule list;9 Assign event e to timeslot t;10 for (each event in Reschedule list with SD > 0) do11 Select feasible timeslot t for event e atrandom;12 Re-calculate SD for all events in Reschedulelist;13 end14 Move all events with SD = 0 that remain inRe-schedule list to the Unscheduled list E;15 end16 else17 Select a feasible timeslot t at random for event e;18 end19 if (Unscheduled list E is not empty and time haselapsed) then20 One by one, place events from the Unscheduledlist into any random selected timeslot withoutrespecting the conflict between the events;21 end22 end23 S = current solution;24 loop = 0;25 while (S not feasible ) do26 if (loop < 10) then27 if ( coin f lip()) then28 S ∗ = M1(S); // apply M1 to S29 end30 else31 S ∗ = M2(S); // apply M2 to S32 end33 if ( f (S ∗ ) ≤ f (s)) then34 S ← S ∗ // accept new solution;35 end36 end37 else38 EHC = set of events that violate hard constraints;39 e = randomly selected from EHC;40 S ∗ = M1(S, e); // perform one Tabu Searchiteration with move M1 using event e;41 if ( f (S ∗ ) < f (S) then42 S ← S ∗ ; // accept new solution43 end44 if (loop >= ts ) then45 loop = 0;46 end47 end48 loop + +;49 end50 Output: S feasible solution (timetable);Algorithm 2: Initialisation Heuristic 4 (IH4)1 Input: List of Unscheduled events E;2 Generate dummy timeslots according to problem instance;Sort events in E by non-increasing Largest Degree (LD);3 while (Unscheduled list E is not empty) do4 Choose event e from E with the LD (random tie-break);5 Calculate SD for event e;6 if (SD = 0) then7 Select dummy timeslot at random for event e;8 end9 else10 Chose any feasible timeslot for event e;11 Up<strong>da</strong>te the new solution;12 end13 end14 S = current solution;15 Calculate initial cost function f (S);16 Initial water level B = f (S);17 ∆B = 0.01;18 while (dummy timeslots are not empty) do19 if ( coin f lip()) then20 S ∗ = M1(S); // apply M1 to S21 end22 else23 S ∗ = M2(S); // apply M2 to S24 end25 if ( f (s ∗ ) ≤ f (s)) or ( f (s ∗ )(≤ B)) then26 S ← S ∗ ; // accept new solution27 end28 B = B − ∆B; // lower the water level29 if (B - f(S) ≤ 1) then30 B = B + 5; // increase the water level31 end32 end33 Output: S feasible solution (timetable);fastest. As indicated above, the hybrid initialisation heuristic (IH4)that uses dummy timeslots to <strong>de</strong>al with conflicts and then great <strong>de</strong>lugeas the local search to bring the solution to feasibility, is neverthe fastest approach. However, this heuristic IH4 was capable ofproducing the best solutions for two of the Socha et al. instancesand six of the ITC 2002 instances.In our preliminary experiments, we implemented a sequential heuristic(see [2, 3]) but were able to generate feasible timetables onlyfor the small instances of the Socha et al. <strong>da</strong>taset (in fact, thesesmall instances are consi<strong>de</strong>red to be easy). Even after consi<strong>de</strong>rablyextending the computation time, the sequential heuristic wasnot able to generate feasible solutions for the medium and largeSocha et al. instances or the ITC 2002 <strong>da</strong>tasets.4. CONCLUSIONSMany approaches have been proposed in the literature to tacklethe University course timetabling problem. In this exten<strong>de</strong>d abstractwe have outlined four variants of hybrid heuristics <strong>de</strong>signedto generate initial feasible solutions to this problem. These hybri<strong>da</strong>pproaches combine traditional graph colouring heuristics,like Largest Degree and Saturation Degree, with different typesof local search. The four hybrid variants were tested using twosets of benchmark problem instances, the Socha et al. [1] and theInternational Timetabling Competition 2002 [5] <strong>da</strong>tasets.All the hybrid initialisation heuristics <strong>de</strong>scribed here were capableof producing feasible timetables for all the problem instances.ALIO-EURO <strong>2011</strong> – 226


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Problem IH1 IH2 IH3 IH4 Min TimeS1 173 198 207 200 0.077 (IH2)S2 211 217 189 208 0.078 (IH2)S3 176 190 188 209 0.062 (IH2)S4 250 174 203 192 0.047 (IH1)S5 229 238 226 217 0.078 (IH2)M1 817 772 802 774 5.531 (IH3)M2 793 782 784 802 6.342 (IH2)M3 795 867 828 817 6.64 (IH3)M4 735 785 811 795 5.828 (IH2)M5 773 771 784 769 16.670 (IH1)L 1340 1345 1686 1670 300.0 (IH1)Table 1: Results obtained with each hybrid initialisation heuristic(IH1 to IH4) on the 11 Socha et al. problem instances, best resultsindicated in bold.Problem IH1 IH2 IH3 IH4 Min TimeCom01 805 786 805 805 1.93 (IH3)Com02 731 776 731 778 1.36 (IH3)Com03 760 812 760 777 1.14 (IH2)Com04 1201 1178 1201 1236 4.46 (IH2)Com05 1246 1243 1246 1135 2.11 (IH3)Com06 1206 1219 1206 1133 1.33 (IH3)Com07 1391 1388 1391 1265 2.10 (IH3)Com08 1001 968 1001 1006 1.81 (IH2)Com09 841 859 841 843 1.46 (IH1)Com10 786 816 786 799 4.64 (IH3)Com11 852 877 852 839 1.05 (IH1)Com12 814 831 814 788 2.21 (IH2)Com13 1008 1010 1008 1009 2.26 (IH1)Com14 1040 1032 1040 1355 3.71 (IH2)Com15 1165 1162 1165 1161 1.56 (IH3)Com16 887 911 887 888 1.09 (IH3)Com17 1227 1032 1227 1199 1.13 (IH2)Com18 793 724 793 763 1.29 (IH3)Com19 1184 1212 1184 1209 3.22 (IH3)Com20 1137 1161 1137 1205 0.08 (IH3)Table 2: Results obtained with each hybrid initialisation heuristic(IH1 to IH4) on the 20 ITC 2002 problem instances, best resultsindicated in bold.None of the approaches showed to be clearly better that the others.For a given instance, the heuristic producing the best quality initialtimetable is often not the fastest among the four approaches.However, for all the problem instances there is at least one hybridheuristic capable of generating a feasible timetable in very shorttime, from less than a second to few seconds <strong>de</strong>pending of theproblem instance. The exception is the largest Socha et al. instancewhich is still regar<strong>de</strong>d in the literature as a very challengingproblem. Having some methods capable of generating feasible solutionsfor the University course timetabling problem is importantbecause the effort of more elaborate methods can then be focusedon tackling the violation of soft constraints in or<strong>de</strong>r to improve thetimetable quality.In a following more <strong>de</strong>tailed <strong>de</strong>scription on this research, we intendto present a statistical comparison between the proposed initialisationheuristics, compare these approaches against other proceduresto generate feasible solutions to the University coursetimetabling problem and analyse the effect of each component inthe four hybrid heuristics.5. REFERENCES[1] K. Socha, J. Knowles, and M. Samples, “A max-min ant systemfor the university course timetabling problem,” in Ant Algorithms:<strong>Proceedings</strong> of the Third International Workshop(ANTS 2002), LNCS 2463. Springer, 2002, pp. 1–13.[2] E. Burke, B. A. McCollum, A. Meisels, S. Petrovic, andQ. Rong, “A graph based hyper-heuristic for educationaltimetabling problems,” European Journal of Operational Research,vol. 176, pp. 177–192, 2007.[3] P. Kostuch, “The university course timetabling problem witha three-phase approach,” in The Practice and Theory of AutomatedTimetabling V, LNCS 3616. Springer, 2005, pp. 109–125.[4] S. Abdullah, E. Burke, and B. McCollum, Using a RandomisedIterative Improvement Algorithm with CompositeNeighborhood Structures for University Course Timetabling.Springer, 2007, pp. 153–172.[5] B. Paechter, L. M. Gambar<strong>de</strong>lla, and O. Rossi-Doria. (2002)International timetabling competition 2002. MetaheuristicsNetwork. [Online]. Available: http://www.idsia.ch/Files/ttcomp2002/[6] J. H. Obit and D. Lan<strong>da</strong>-Silva, “Computational study of nonlineargreat <strong>de</strong>luge for university course timetabling,” in IntelligentSystems - From Theory to Practice, Studies in ComputationalIntelligence, Vol. 299, V. Sgurev, M. Hadjiski, andJ. Kacprzyk, Eds. Springer-Verlag, 2010, pp. 309–328.[7] J. H. Obit, “Developing novel meta-heuristic, hyper-heuristicand cooperative search for course timetabling problems,”Ph.D. dissertation, 2010.[8] M. Chiarandini, M. Birattari, K. Socha, and O. Rossi-Doria, “An effective hybrid algorithm for university coursetimetabling,” Journal of Scheduling, vol. 9, pp. 403–432,2006.ALIO-EURO <strong>2011</strong> – 227


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Lower and upper bounds for large size instances of the optimal diversitymanagement problemAgostinho Agra ∗ Jorge Orestes Cer<strong>de</strong>ira † Cristina Requejo ∗∗ Department of Mathematics, University of Aveiro3810-193 Aveiro, Portugal{aagra, crequejo}@ua.pt† Department of Sciences and Engineering of BiosystemsInstituto Superior <strong>de</strong> Agronomia, Technical University of Lisbon (TULisbon)1349-017 Lisboa, Portugalorestes@isa.utl.ptABSTRACTWe give procedures to <strong>de</strong>rive lower and upper bounds for the optimaldiversity management problem, especially conceived to <strong>de</strong>alwith real instances that occur in the production of wire harness forthe automotive industry. We report computational results to assessthe quality of these bounds.Keywords: Integer programming, Duality, Heuristics, P-median1. INTRODUCTIONIn the production of wire harness for the automotive industry <strong>de</strong>cisionshave to be ma<strong>de</strong> on the configurations that should be manufacturedin or<strong>de</strong>r to satisfy, within reasonable production costs,a possible large variety of customers’ requests. Specifically, carsare assembled with the necessary wire connections to activate theset of requested options such as airbags, air conditioned, etc. Aconfiguration is the aggregate of minimum connections allowingto activate a given group of options. The set of requested optionsvary greatly <strong>de</strong>pending on client’s preferences. In theory, therecan be millions of different combinations of options. Since wireharness is mainly manually assembled, in practice only a smallnumber p of different configurations is settled and customers areoften supplied with cars having wire harness including unnecessarywire connections. Clearly, this gives rise to production extracosts (associated with copper wires waste) making the selection ofthe p configurations to produce an important issue.This problem, called the Optimal Diversity Management Problem(ODMP), is a special case of the well known p-median problem.The p-median problem [1, 2] seeks to select p vertices (the medians)of a digraph with weights on the arcs, in or<strong>de</strong>r to minimizethe sum of the weights of the arcs linking each non-median vertexto one of the selected medians.The ODMP, that was shown to be NP-hard [3, 4], is the p-medianfor transitive digraphs. This is the case of the graph resulting fromthe wire harness application above, which, in addition, usuallyconsists of several connected components.An extensive study on the ODMP is <strong>de</strong>veloped in the PhD thesis ofBriant [3], which is the first substantial work on this problem. Briant[3] pointed out that the large size of instances of real problemsis a serious barrier to the efficiency of the algorithms. Dealingwith the huge size instances that appear in real problems is themain concern in studies on the ODMP [5, 6, 7, 4].In this study we give ways to obtain lower and upper bounds onthe values of optimal ODMP solutions, specifically meant to <strong>de</strong>alwith the huge graphs arising from the wire harness application,and exploiting the fact that these graphs have several components.Computational results are reported to assess the quality of thesebounds on real instances.2. FORMULATIONTo formulate the ODMP consi<strong>de</strong>r a weighted transitive digraphG = (V,A,c), where the vertices of V = {1,...,n} represent theconfigurations, and (u,v) is an arc of A if and only if every optionthat configuration u allows to activate could also be activated by v.We say that v covers u (or that u is covered by v). Each configurationv can be interpreted as the subset of options that v activates.Each arc a = (u,v) of G has a cost c a , which is the cost of usingconfiguration v to substitute u.An important property of real ODMP instances is that graph G hasseveral connected components. We <strong>de</strong>note by K = {1,...,m} theset of indices of the connected components, and by G k = (V k ,A k )the subgraph induced by component k, with k ∈ K.Let, for v ∈ V , y v be a 0-1 variable indicating whether vertex v isselected (y v = 1) or not (y v = 0) to be a median. Let, for (u,v) ∈ A,x uv be a 0-1 variable indicating whether configuration representedby vertex v will replace (x uv = 1) or not (x uv = 0) the configurationrepresented by u. Consi<strong>de</strong>r, in addition, for k ∈ K, a positive integervariable p k , that indicates the number of medians in componentk. With these variables the ODMP can be mo<strong>de</strong>led as follows.min∑subject to∑c vu x vuv∈V u∈δ + (v)(1)∑ x vu + y v = 1 v ∈ V (2)u∈δ + (v)∑ y v = p kv∈V kk ∈ K (3)∑ p k = p (4)k∈Kx vu ≤ y u v ∈ V,u ∈ δ + (v) (5)y v ∈ {0,1} v ∈ V (6)x vu ∈ {0,1} v ∈ V,u ∈ δ + (v) (7)p k ∈ N k ∈ K (8)where δ + (v) = {u ∈ V : (v,u) ∈ A}.ALIO-EURO <strong>2011</strong> – 228


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Equations (2) state that either v is a median or v must be linkedto some vertex. Equations (3) and (4) guarantee that the mediansin the m connected components sums p. Inequalities (5) expressthat if a no<strong>de</strong> is not a median, its in<strong>de</strong>gree is equal to zero. Finally(6)-(8) <strong>de</strong>fine the range of the variablesNote that if the values of p k , say p ∗ k , of an optimal ODMP solutionwere known, an optimal ODMP solution would result from theunion of optimal p ∗ k-medians of each component k.Finding p ∗ 1 , p∗ 2 ,..., p∗ m, with p ∗ 1 + p∗ 2 + ... + p∗ m = p, and such thatthe union of optimal p ∗ k-medians is ODMP optimal, is the <strong>de</strong>compositionproblem for the ODMP [6].The <strong>de</strong>composition problem can be mo<strong>de</strong>led as a particular case ofthe multiple choice knapsack problem ([8, 9]), and can be solve<strong>de</strong>fficiently ([9, 10]). Hence, the ODMP problem can be <strong>de</strong>composedin smaller similar subproblems. However, since p ∗ k is notknown in advance, each subproblem k has to be solved, in principal,with a number of medians equal to 1,..., p − m + 1.3. LOWER BOUNDSLower bounds for the ODMP are usually obtained by Lagrangeanrelaxation techniques [5, 11]. We use a heuristic for the dual of thelinear relaxation of formulation (1) - (8), which is similar to theprocedure proposed in [12] to solve the dual of a linear relaxationof the p-median problem.Let λ v , with v ∈ V , be the dual variables associated with equations(2); s k , with k ∈ K, dual variables associated to (3); γ associatedto (4) and t vu , with (v,u) ∈ A, the non-negative dual variablescorresponding to inequalities (5).The dual of the linear programming relaxation of formulation (1)- (8) is as followsmax ∑ λ v − pγv∈Vsubject to∑(9)λ v − s k + t uv ≤ 0 v ∈ V k ,k ∈ K (10)(u,v)∈Aλ v − c vu ≤ t vu v ∈ V,u ∈ δ + (v) (11)s k ≤ γ k ∈ K (12)t vu ≥ 0 v ∈ V,u ∈ δ + (v) (13)The dual variables s k and t vu can be removed from the mo<strong>de</strong>l, yieldingmax ∑ λ v − pγv∈Vsubject to∑(14)γ ≥ λ v + (λ u − c uv ) v ∈ V (15)(u,v)∈AThe heuristic solution is <strong>de</strong>fined assigning to γ the value of ther.h.s. of (15), with λ v := min u∈δ + (v) c vu .4. UPPER BOUNDSThe greedy algorithm has been used to solve large size instancesof the ODMP, and studies refer that the resulting solutions are normallyquite good [3, 6, 4].It is worth mention that in [9] it is shown that the following procedure:Step 1: run the greedy algorithm for each connected componentand for all the possible choices of medians;Step 2: solve the <strong>de</strong>composition problem using the values of greedysolutions obtained on each component,provi<strong>de</strong>s the same objective function value than running the greedyalgorithm over the entire graph.We consi<strong>de</strong>r running a genetic algorithm for the ODMP in eachconnected component k of graph G, for different number of mediansp k , and use the approach in [10] to solve the resulting restricted<strong>de</strong>composition problem to obtain a final ODMP solution.Instead of consi<strong>de</strong>ring all possible number of medians for componentk, we restrict p k to vary in the interval <strong>de</strong>fined by the minimumand maximum number of medians that the greedy has <strong>de</strong>terminedfor component k when solving the ODMP with p, p−1 andp + 1 medians.In or<strong>de</strong>r to take advantage from the knowledge of the greedy solution,in the implementation of the genetic algorithm, we inclu<strong>de</strong>din the initial population (i.e., a collection of random selected p kmedians) for component k, what we call the modified k componentgreedy solution.The modified k component greedy solution is a set of p k medianson the connected component k resulting from adding (or <strong>de</strong>leting)uniformly selected vertices of component k to (from) the set ofmedians <strong>de</strong>fined by the greedy for that component.5. COMPUTATIONAL RESULTSHere we report some computational experience carried out to evaluatethe quality of the proposed lower and upper bounds on theoptimal ODMP values.All the computational tests were performed on a PC running onan Intel(R) Core(TM)2 Duo CPU 2.00 GHz processor and 1.99Gbof RAM. We used real <strong>da</strong>ta instances from the Yazaki Saltano <strong>de</strong>Portugal, a branch of Yazaki, the world’s largest producer of wireharness, consisting of graphs with 3072, 10848, 15360, 22080,and 51840 vertices and, for each one, we tested p ∈ {50, 100, 150,200}.In or<strong>de</strong>r to obtain the optimal values we use the optimization softwareXpress 7.1 with a limit for the computations on each instanceequal to three CPU hours.Table 1 reports the main computational results. The first two columnsindicate the number of vertices (n) and the number of connectedcomponents of the graph (m). The third column specifies the numberof medians (p). The remaining columns indicate the valuesfound for the lower bounds (LB), the optimal solutions (OPT)when they were found, the greedy solutions (Greedy) and the upperbounds corresponding to the values of the solutions producedby the genetic algorithm (UB-Genetic). The values in brackets referto the computational times, in CPU seconds, to <strong>de</strong>termine thecorresponding values.It can be conclu<strong>de</strong>d from Table 1 that both greedy and dual solutionsprovi<strong>de</strong> tight bounds for the real instances consi<strong>de</strong>red. Thegenetic algorithm was capable in most cases to slightly improvethe greedy solutions.It should be mentioned that the inclusion of the (modified) greedysolution in the initial population proved to be essential to obtaingood solutions. Computational tests showed that the genetic algorithmworking on an initial randomized population not includingthe greedy solution provi<strong>de</strong>s, in general, poor solutions when comparedwith the greedy solutions, and with a larger computationaleffort.It is also worth noting that, when the greedy solution is comparedwith the optimal solution (for those instances it was obtained), inALIO-EURO <strong>2011</strong> – 229


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>n m p LB OPT Greedy UB-Genetic3072 8 50 141 088 (1) 143 800 (24) 144 698 (1) 144 209 (40)3072 8 100 79 838 (2) 82 035 (73) 82 365 (1) 82 074 (61)3072 8 150 54 963 (1) 55 472 (69) 56 022 (1) 55 682 (85)3072 8 200 40 155 (1) 40 829 (72) 41 066 (1) 40 908 (147)10848 46 50 10 520 027 ( 6) 10 528 136 (41) 10 528 136 (1) 10 528 136 (1)10848 46 100 3 216 662 (16) 3 389 989 (647) 3 508 534 (1) 3 431 425 (44)10848 46 150 2 032 277 (16) 2 158 828 (2085) 2 253 169 (1) 2 203 339 (89)10848 46 200 1 469 477 (19) 1 576 485 (3184) 1 657 811 (1) 1 617 045 (113)15360 14 50 2 933 (626) - 3 239 ( 8) 3 197 (5694)15360 14 100 1 653 (688) - 1 874 (17) 1 847 (406)15360 14 150 1 174 (543) - 1 314 (27) 1 298 (3442)15360 14 200 925 (639) - 1 013 (37) 1 000 (1047)22080 16 50 2 099 492 ( 6374) - 2 279 053 ( 63) 2 241 344 (1421)22080 16 100 1 101 529 ( 3945) - 1 248 647 (143) 1 212 547 (3199)22080 16 150 752 955 ( 4020) - 863 155 (202) 845 006 (1943)22080 16 200 584 859 (12013) - 662 480 (300) 653 190 (10295)51840 60 100 2 354 812 ( 824) - 2 396 102 (33) 2 358 739 (1049)51840 60 150 1 364 717 ( 950) - 1 432 499 (51) 1 401 920 (334)51840 60 200 824 682 (1938) - 1 043 544 (74) 1 018 434 (497)Table 1: Computational results.most cases, the numbers of medians per connected component coinci<strong>de</strong>,and for most of the remaining cases, the differences do notexceed one. This means that greedy solutions give reliable estimativefor the number of medians in each component of optimalsolutions. Thus, solving the <strong>de</strong>composition problem only consi<strong>de</strong>ringa few number of medians close to the number of medians<strong>de</strong>termined by the greedy solution on each connected component,is likely to be a good strategy for solving the ODMP.6. CONCLUSIONThe ODMP is a combinatorial optimization problem arising inthe production industry of wire harness for the automotive. Withspecial attention to the fact that the graphs arising from this applicationare very large and have several connected components,we proposed ways of obtaining lower and upper bounds. Lowerbounds were obtained through a heuristic for the dual of the linearrelaxation of a mo<strong>de</strong>l for the ODMP. Upper bounds were obtainedthrough a genetic algorithm running in each component ofthe graph, and benefiting from the knowledge of a greedy solutionto combine the partial solutions into a feasible ODMP result. Weintend to extend this approach further, exploiting a specific behaviorof the ODMP objective function with respect to the number ofmedians, to <strong>de</strong>termine a narrow range for the number of mediansin the sub-problems that inclu<strong>de</strong> an optimal <strong>de</strong>composition.7. ACKNOWLEDGEMENTSThe research of the second author was supported by Forest ResearchCentre (Centro <strong>de</strong> Estudos Florestais), the research of theother authors was supported by Center for Research and Developmentin Mathematics and Applications (CIDMA) both from thePortuguese Foun<strong>da</strong>tion for Science and Technology (FCT), cofinancedby the European Community Fund FEDER/POCI 2010.8. REFERENCES[2] J. Reese, “Solution methods for the p-median problem: anannotated bibliography,” Networks, vol. 48, pp. 125–142,2006.[3] O. Briant, “Étu<strong>de</strong> théorique et numérique du problème <strong>de</strong> lagestion <strong>de</strong> la diversité,” Ph.D. dissertation, Institute NacionalPolytechnique <strong>de</strong> Grenoble, Grenoble, France, 2000.[4] A. Agra, D. Cardoso, J. Cer<strong>de</strong>ira, M. Miran<strong>da</strong>, and E. Rocha,“Solving huge size instances of the optimal diversity managementproblem,” Journal of Mathematical Sciences, vol.161, pp. 956–960, 2009.[5] O. Briant and D. Nad<strong>de</strong>f, “The optimal diversity managementproblem,” Operations Research, vol. 52, no. 4, pp. 515–526, 2004.[6] P. Avella, M. Boccia, C. D. Martino, G. Oliviero, A. Sforza,and I. Vasil’ev, “A <strong>de</strong>composition approach for a very largescale optimal diversity management problem,” 4OR, vol. 3,pp. 23–37, 2005.[7] P. Avella, A. Sassano, and I. Vasil’ev, “Computational studyof large-scale p-median problems,” Mathematical Programming,vol. 109, no. 1, pp. 89–114, 2007.[8] H. Kellerer, U. Pferschy, and D. Pisinger, Knapsack Problems.Springer, 2004.[9] A. Agra and C. Requejo, “The linking set problem: a polynomialspecial case of the multiple-choice knapsack problem,”Journal of Mathematical Sciences, vol. 161, pp. 919–929,2009.[10] D. Cardoso and J. Cer<strong>de</strong>ira, “Minimum weight t-<strong>de</strong>composition of an integer,” to appear in Journal of MathematicalSciences.[11] A. Santos, “Solving large p−median problems using a lagrangeanheuristic,” 2009, Optimization Online.[12] M. Captivo, “Fast primal and dual heuristics for the p-medianlocation problem,” European Journal of Operational Research,vol. 52, pp. 65–74, 1991.[1] P. Mirchan<strong>da</strong>ni and R. Francis, Eds., Discrete Location Theory.John Wiley & Sons, 1990.ALIO-EURO <strong>2011</strong> – 230


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Continuous Ant Colony System Applied to Optimization Problems with FuzzyCoeficientsLuiza Amalia Pinto Cantão ∗ Ricardo Coelho Silva † Akebo Yamakami †∗ UNESP – Univ Estadual Paulista, Campus of Sorocaba, Environmental Engineering Dept.Av. Três <strong>de</strong> Março, 511, 18087-180, Sorocaba – SP, Brazilluiza@sorocaba.unesp.br† UNICAMP – Univ. Estadual of Campinas, School of Electrical and Computer EngineeringP.O. Box 6011, 13083-970, Campinas – SP, Brazil{rcoelhos, akebo}@dt.fee.unicamp.brABSTRACTHeuristic algorithms based in ant colonies (named ant system –AS for short) were <strong>de</strong>veloped by Marco Dorigo to solve combinatorialoptimization problems as the traveling salesman problem.This class of algorithms was also a<strong>da</strong>pted by Seid H. Pourtakdoustand Hadi Nobahari for continuous optimization problems (ContinuousAnt Colony Optimization Systems – CACS). In this work, animplementation of CACS was used for nonlinear continuous optimizationproblems with coefficients represented by fuzzy numbers.The fuzzy numbers are mo<strong>de</strong>lled through symmetric triangularmembership functions, Possibility Measure — based on DidierDubois and Henri Pra<strong>de</strong>’s work for comparison of functions withfuzzy values — and centroid <strong>de</strong>fuzzification methods to obtain theordinary value from function values in the pheromone evaluationstep. Experiments with nine benchmark functions show a goo<strong>da</strong>greement — consi<strong>de</strong>ring the imprecise nature of the problem —between the fuzzy optima and their real counterparts.Keywords: Ant Colony System, Optimization, Fuzzy Theory, PossibilityTheory1. INTRODUCTIONAnt Colony System was <strong>de</strong>veloped based in the Traveling SalesmanProblem. The i<strong>de</strong>as behind the system make it suitable forhigh complexity combinatorial optimization problems <strong>de</strong>mandingdiscrete solutions. The heuristic algorithm inspired in ant colonieswas <strong>de</strong>veloped by M. Dorigo and coleagues as we can see in [1],[2], for example.An extension of this algorithm for continuous function optimizationwas proposed by several authors, as in [9], [12] and [13],among others. The work of S.H. Pourtakdoust and H. Nobahari asin [8] and [7] is an example of such an extension, with the ad<strong>de</strong>dbonus of a simpler structure for the application of fuzzy parameterson its formulation.The purpose of this work is the introduction of fuzzy parametersinto an Ant Colony System heuristic applied to Fuzzy MathematicalProgramming. The fuzzy parameters are treated as fuzzy numbers(see [6]) with a double intent here: (i) to mo<strong>de</strong>l the fuzzy parametersand (ii) make the fuzzy algebrical operations. Of courseother changes are required in or<strong>de</strong>r to accomo<strong>da</strong>te the fuzzy numbers,namely a convenient comparison operation between fuzzyquantities, an approach to evaluate the fuzzy function through rankingpresented in [3] and [4], and finally a <strong>de</strong>fuzzification process,as in [11].The results are satisfactory, showing that Continuous Ant ColonySystems can be a valid alternative to treat Fuzzy Mathematical Programmingproblems.2. PRELIMINARIESIn this section, we explain some topics of Fuzzy Theory used inthis work.Definition A fuzzy set ˜C on R is a fuzzy number, if its membershipfunction is <strong>de</strong>fined as follows:0 if x ≥ cx−c⎧⎪ ⎨ c−c if x ∈ [c,c]µ ( ˜C) (x) = c−xc−c if x ∈ [c,c]⎪ ⎩0 if x ≥ cwhere µ ( ˜C) (x) : R → [0,1], c is the mo<strong>da</strong>l value, i. e., µ (c) = 1,( ˜C)c and c are the inferior and superior limits, respectively.We suppose that f L˜c : [c,c] → [0,1] and f R˜c : [c,c] → [0,1] are twocontinuous mappings from the real line R to the closed interval[0,1]. The former is a strictly increasing function and the lateris a monotonically <strong>de</strong>creasing function. In this case, we assumethat <strong>de</strong> fuzzy number is represented by a triangular function, i. e.,˜C = (c,c,c).In or<strong>de</strong>r to facilitate the operations with fuzzy numbers, we assumethat an exact membership function can be approximated by usingpiecewise linear functions based on α-level sets.Definition [11] Let ˜C be a fuzzy number. Its α-level sets ˜C α orα-cuts are <strong>de</strong>fined as˜C α = {x ∈ R| µ ˜C(x) ≥ α}= [min{x ∈ R| µ ˜C (x) ≥ α}, max{x ∈ R| µ (x) ≥ α}]˜C= [(x) L α,(x) U α ] 0 < α ≤ 1(2)Acording to Za<strong>de</strong>h’s extension principle [4], the fuzzy number ˜Ccan also be expressed as(1)˜C = ⋃ α · ˜C α , 0 < α ≤ 1. (3)αThe α-levels representation is used to operate with fuzzy numbers,as shown in [6]; all other operations follow the structure presentedin this reference. They are also useful in the estimation of a representativeordinary number — a process known as <strong>de</strong>fuzzification.In this particular setting we used the centroid <strong>de</strong>fuzzificationALIO-EURO <strong>2011</strong> – 231


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>method, as in [11]. This method <strong>de</strong>fines the centroid of ˜C as thex-axis value of the centroid as its <strong>de</strong>fuzzification value, which canbe expressed as:∫ cc xµD( ˜C) = ˜C (x)dx∫ cc µ ˜C (x)dx (4)where∫ cc µ ˜C (x)dx = 1 2n∫ cc xµ ˜C (x)dx = 1 6n[((x)U α0− (x) L ) n−1α 0 + 2 ∑i=1n−1+ ∑i=1[((x)2U α0− (x) 2L α 0)+ 2n−1∑((x) U α i− (x) L α i) ] ,((x) 2Uα i)− (x) 2Lα ii=1() ](x) U α i · (x) U α i+1− (x) L α i · (x) L α i+1.In or<strong>de</strong>r to compare (or rank) fuzzy numbers, ˜C 1 and ˜C 2 for instance,we can apply a comparison measure built upon the PossibilityMeasure, as presented in [3] and [4]. In this context, if wewant to <strong>de</strong>ci<strong>de</strong> wether ˜C 1 > ˜C 2 or not, we use the following mesure(remembering that ˜C 1 = (c 1 ,c 1 ,c 1 ) and ˜C 2 = (c 1 ,c 2 ,c 2 )):( ())Pos( ˜C 1 ≥ ˜C 2 ) = max 0,min 1,1 + (c 1−c 2 )c 1 +c 2(PSE)( ( ))Pos( ˜C 1 > ˜C 2 ) = max 0,min 1, c 1−c 2 +c 1c 1 +c (PS)2(6)where PSE stands for excee<strong>da</strong>nce possibility and PS for strict excee<strong>da</strong>ncepossibility. Acording to [3], these formulas hold exceptwhen the sums of the spreads in the <strong>de</strong>nominators are zero, whichoccurs when ˜C 1 and ˜C 2 are ordinary numbers.So we assume that ˜C 1 > ˜C 2 when:and(5)Pos( ˜C 1 ≥ ˜C 2 ) ≥ α, α ∈ (0,1] (7)[]min Pos( ˜C 1 > ˜C 2 ), Pos( ˜C 2 > ˜C 1 ) < 1 (8)Condition (8) guarantees that ˜C 1 ≠ ˜C 2 .The topics presented in this section were directly used in the computationalimplementation, so their <strong>de</strong>scriptions are brief and without<strong>de</strong>tails about the theorethical foun<strong>da</strong>tions.3. THE PROBLEMAs <strong>de</strong>scribed in [5], a fuzzy function is used when some <strong>da</strong>ta aboutthe problem is not precisely known. Here a function with fuzzyparameters can be <strong>de</strong>noted by:minf (˜c,x)x i ∈ [a i ,b i ] i = 1 : n(9)x ∈ R nwhere x ∈ R and ˜c is a vector whose entries are fuzzy numbers,such that ˜c ∈ F(R), F(R) is a fuzzy set over R and f (˜c,x) : [F(R)×R n ] → F(R). The interval [a,b] is the region which the minimumvalue of the function, namely x, occurs.Even though problem (9) is an irrestrict on, we <strong>de</strong>termine a searchregion for vector x where the nonlinear function is evaluated.4. CONTINUOUS ANT COLONY SYSTEM (CACS) WITHFUZZY PARAMETERSThe heuristic method <strong>de</strong>veloped by [8] is a modification over theheuristic Ant Colony System — ACS, preserving all of its majorcharacteristics. Some important aspects are related here.4.1. Continuous Pheromone Mo<strong>de</strong>lAs reported in [8] and [7], the pheromone <strong>de</strong>position occurs overa continuous space. For fuzzy problem (9), this step involve onlyordinary numbers, because it concerns only information about thevector x.Consi<strong>de</strong>r a food source surroun<strong>de</strong>d by several ants. The ant’s aggregationaround the food source causes the highest pheromoneintensity to occur at the food source position. Then, increasing thedistance of a sample point from the food source will <strong>de</strong>crease itspheromone concentration. This mo<strong>de</strong>l uses a Probability DistributionFunction (PDF), which <strong>de</strong>termines the probability of choosingeach point x within the interval [a,b].The normal PDF can be used at the state transition rule since thecenter of which is the last best global solution and its variance <strong>de</strong>pendson the aggregation of the promising areas around the bestone, so it contains exploitation behavior. In the other hand, a normalPDF permits all points of the search space to be chosen, eitherclose to or far from the current solution, so it also contains explorationbehavior.4.2. Pheromone Up<strong>da</strong>teAt the start of the algorithm, there is no information availableabout the minimum point and the ants chose their <strong>de</strong>stination onlyby exploration.During each iteration, pheromone distribution over the search spacewill be up<strong>da</strong>ted using the acquired knowledge of the evaluatedpoints by the ants. This process gradually increases the exploitationbehavior of the algorithm, while its exploration behavior will<strong>de</strong>crease, i. e, the value of objective function is evaluated for thenew selected points by the ants. Then, the best point found fromthe beginning of the trial is assigned to x min . Also the value ofσ is up<strong>da</strong>te based on the evaluated points during the last iterationand the aggregation of those points around x min . Then a conceptof weighted variance is <strong>de</strong>fined as follows:(( ) )σ 2 = ∑k 1j=1 D( f j )−D( f min ) x j − x min() , (10)∑ k 1j=1 D( f j )−D( f min )for all j in which D( f j ) ≠ D( f min ), D(·) meaning the <strong>de</strong>fuzzificationpresented in equation (4) and k is the number of ants. Thisstrategy means that the center of the region discovered during thesubsequent iterations is the last best point and the narrowness ofits width is <strong>de</strong>pen<strong>de</strong>nt on the aggregation of the other competitorsaround the best one. The closer the better solutions go to the bestone, the smaller σ is assigned to the next iteration.4.3. AlgorithmThe algorithm <strong>de</strong>scription, based in [7], is shown below, includingthe modification for the fuzzy problem (9).Step 1 choose randomly the initially guessed minimum point x minover the space and calculate the value of the function f (˜c,x) =f min , calcule D( f min ) using (4). For each x i use a uniformPDF over the interval [a i ,b i ].Step 2 Set the inicial value of weighted variance for each pheromoneintensity distribution function: σ i = 3(b i − a i ), i = 1 : n. Itwill be large enough to approximately generate uniformlydistributed initial values of x i within the interval [a i ,b i ].Step 3 Send ants to points (x 1 ,x 2 ,...,x n ) j , j = 1 : k. To generatethese random locations, a random generator with normalPDF is utilized for each x i , where its mean and variance areALIO-EURO <strong>2011</strong> – 232


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>(x j ) min and σ i respectively. If x i is outsi<strong>de</strong> the given interval[a i ,b i ], it is discar<strong>de</strong>d and a new x i is generated again.Step 4 Evaluate f , at each discovered point, namely f 1 , f 2 ,..., f k .Determine the minimum f m and compare these value using(7) and (8) with the current minimum value f min and <strong>de</strong>terminethe up<strong>da</strong>ted f min and its associated (x 1 ,x 2 ,...,x n ) min .Step 5 If a stopping criterion is satisfied (usually the number ofiterations) then stop, else up<strong>da</strong>te the weighted variance parameterσ i for each variable x i using (10); go back to step 3).5. EXPERIMENTSThe following experiments employ test functions from [8] and onefunction from [10]. Also, each function will be presented on itsown table, together the original result from their respective references.All fuzzy numbers have 10% of uncertainty. They are represente<strong>da</strong>s [c,c−c,c+c], where c is the mo<strong>da</strong>l value of the number (µ˜c (x) =1), c − c and c + c are the inferior and superior values for the numberµ˜c (x) = 0, respectively.Each table is composed by four columns, with the first one beingthe number of ants, followed by the best crisp result obtainedon the cited reference, then the fuzzy objective function value andfinally, on the last column, the <strong>de</strong>fuzzified objective function number.The table was constructed upon an implementation usingScilab (www.scilab.org) version 5.3.0.5.1. Function 1f 1 (˜c,x) = ˜ 3905.93 − ˜10(x 2 1 − x 2) 2 − (1 − x 1 ) 2 , −2.048 ≤ x 1 ,x 2 ≤2.048.k [10] f 1 D( f 1 )100 −3905.93 [−3905.93,−3944.99,−3866,87] −3905.93200 [−3905.93,−3944.99,−3866,87] −3905.93500 [−3905.93,−3944.99,−3866,87] −3905.93Table 1: Results for function f 1 .Note that, in [10] don’t have information about the number of antsused in the tests, only the optimum value.5.2. Function 2f 2 (˜c,x) = ˜100(x 2 1 − x 2) 2 + (1 − x 1 ) 2 and −2.05 ≤ x 1 ,x 2 ≤ 2.05.5.3. Function 3k [8] f 2 D( f 2 )100 1.6e − 33 [1.9036e − 19,−0.01,0.01] 0200 3.2e − 22 [2.743e − 21,−0.01,0.01] 0500 1.7e − 12 [2.299e − 20,−0.01,0.01] 0Table 2: Results for function f 2 .f 3 (˜c,x) = ˜1x 2 1 +˜1x 2 2 +˜1x 2 3 , −5.12 ≤ x 1,x 2 ,x 3 ≤ 5.12.k [8] f 3 D( f 3 )100 3.6e − 37 [1.394e − 34,1.255e − 34,1.534e − 34] 1.394D − 34200 1.5e − 20 [9.094D − 36,8.184e − 36,1.000 − 35] 9.094e − 36500 3.0e − 09 [3.761e − 35,3.385e − 35,4.137e − 35] 3.761e − 355.4. Function 4Table 3: Results for function f 3 .k [8] f 4 D( f 4 )100 7.8e − 3 [0,−0.1611111,0.1409091] −0.0050564200 7.7e − 3 [0,−0.1611111,0.1409091] −0.0050564500 1.4e − 2 [0,−0.1611111,0.1409091] −0.00505645.5. Function 5f 5 (˜c,x) = ˜50 + ∑ 5 i=15.6. Function 6f 6 (˜c,x) = ˜1 + ∑ 2 i=1Table 4: Results for function f 4 .()xi 2 − ˜10cos(˜2˜πx i ) , −5.12 ≤ x i ≤ 5.12.k [8] f 5 D( f 5 )100 4.9 [0,−10,10] 1.108e − 14200 7.1 [0,−10,10] 1.108e − 14500 9.4 [0,−10,10] 1.108e − 14Table 5: Results for function f 5 .(xi24000 ˜ − ∏2 i=1 cos √i xi), −5.12 ≤ x i ≤ 5.12.The number ˜ 4000 has only 1% of fuzzy uncertainty.k [8] f 6 D( f 6 )100 4.1e − 3 [−9.375e − 8,−0.1000001,0.0999999] −9.376e − 8200 2.7e − 3 [−9.375e − 8,−0.1000001,0.0999999] −9.376e − 8500 1.1e − 3 [−9.375e − 8,−0.1000001,0.0999999] −9.376e − 85.7. Function 7f 7 (˜c,x) = ˜1 + ∑ 5 i=1Table 6: Results for function f 6 .(xi24000 ˜ − ∏5 i=1 cos √i xi), −5.12 ≤ x i ≤ 5.12.The number ˜ 4000 has only 1% of fuzzy uncertainty.k [8] f 7 D( f 7 )100 7.8e − 3 [−9.375e − 8,−0.1000001,0.0999999] −9.376e − 8200 7.7e − 3 [−9.375e − 8,−0.1000001,0.0999999] −9.376e − 8500 1.4e − 2 [−9.375e − 8,−0.1000001,0.0999999] −9.376e − 85.8. Function 8Table 7: Results for function f 7 .f 8 (˜c,x) = (x 2 1 + x2 2 )0.25 (˜1 + sin 2 (˜50(x 21 + x 2 2 )0.1)) , −100 ≤ x i ≤100.k [8] f 8 D( f 8 )100 2.5e − 3 [2.176e − 02,1.935e − 02,2.176e − 02] 0.0176200 5.9e − 2 [3.509e − 03,1.815e − 03,2.219e − 03] 0.00202500 3.8e − 1 [4.228e − 02,1.945e − 02,2.377e − 02] 0.021685.9. Function 9f 9 (˜c,x) = −˜20exp˜20 + ẽ, −32 ≤ x i ≤ 32.Table 8: Results for function f 8 .( √ ))1 − ˜0.2 n ∑30 i=1 x2 i −exp( 1n∑ 30i=1 cos(˜2˜πx i ) +k f 9 D( f 9 )100 [1.421e − 14,−4.27,4.27] 5.547e − 17200 [1.421e − 14,−4.27,4.27] 5.547e − 17500 [1.421e − 14,−4.27,4.27] 5.547e − 17Table 9: Results for function f 9 . Optimum crisp solution f 9 (0) = 0.f 4 (˜c,x) = ˜0.5 + sin2 (x 2 1 +x2 2 )1/2˜1+ ˜0.001(x 2 1 +x2 2 ) , −100 ≤ x 1,x 2 ≤ 100.ALIO-EURO <strong>2011</strong> – 233


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>6. CONCLUSIONSFuzzy theory was proposed by L. A. Za<strong>de</strong>h in 1965, as a toolto help the quantification of inherent imprecisions on the subjectbeing studied. This theory quickly sprea<strong>de</strong>d to several differentfields, ranging from Engineering, Informatics and Mathematics tomedical diagnosis, plague control e so on.Mathematical Programming is by itself a very important <strong>de</strong>cisiontool in several application areas. Aggregation of Fuzzy characteristicsto it allows the mo<strong>de</strong>lling of uncertainties when the avaliable<strong>da</strong>ta is not exactly know or has inherent inaccuracies, makingMathematical Programming an even more powerful resource.In this sense, the quest for optimization methods that succesfullyembraces Fuzzy Theory has been the focus of this work. Here,we have introduced uncertainty in the coefficients of the objectivefunction. These uncertainties were mo<strong>de</strong>lled by fuzzy numbers.Its application on the Continuous Ant Colony System heuristicsrequired some a<strong>da</strong>ptations, as outlined here.We tested 9 functions from the literature, with 77% of them givingequivalent or better results when compared with their crisp counterparts.Even the 23% of the test cases that performed worse, werekept insi<strong>de</strong> <strong>de</strong> viable solution space and also relatively close to thecrisp solution.So, <strong>de</strong>spite the fact that this implementation still needs some improvements— for instance, incorporating different ant colony heuristics,as proposed in [9] and [12] — its results are compatible withthe ordinary ones, allowing a flexibilization in the mo<strong>de</strong>lling ofreal case, where crisp Mathematical Programming is not directlyapplicable.7. ACKNOWLEDGEMENTSTo FUNDUNESP for the financial support and to Dr. Renato Fernan<strong>de</strong>sCantão for some invaluable hints.8. REFERENCES[1] M. Dorigo and L. M. Gambar<strong>de</strong>lla. Ant colony system: a cooperativelearning approach to the traveling salesman problem.IEEE Transactions on Evolutionary Computation, vol.1(1), 53–66, 1997.[2] M. Dorigo, V. Maniezzo, A. Colorni. Ant system: optimizationby a colony of cooperative agents. IEEE Transaction onSystems, Man and Cybernetics, vol. 26(1), 29–41, 1996.[3] D. Dubois and H. Pra<strong>de</strong>. Possibility theory: an approach tocomputerized processing of uncertainty. Plenum Press, 1986.[4] D. Dubois and H. Pra<strong>de</strong>. Ranking fuzzy numbers in the settingof possibility theory. Information Science, vol. 30, 183-224, 1983.[5] K. D. Jamison and W. A. Lodwick. Minimizing constrainedfuzzy functions. Fuzzy Sets and Systems, vol. 103, 457-464,1999.[6] A. Kaufmann and M. M. Gupta. Introduction to fuzzy arithmetic.Van Nostrand Reinhold, 1991.[7] H. Nobahari and S. H. Pourtakdoust. Optimization of fuzzyrule bases using continous ant colony system. Proceeding ofthe First International Conference on Mo<strong>de</strong>ling, Simulationand Applied Optimization – ICMSAO, 2005.[8] S. H. Pourtakdoust and H. Nobahari. An extension of antcolony system to continous optimization problems. In Proc.ANTS, vol. 3172, LNCS, M. Dorigo, M. Birattari, C. Blum,L. M. Gambar<strong>de</strong>lla, F. Mon<strong>da</strong>ta and T. Sützle, Eds., 294-301,2004.[9] K. Socha and M. Dorigo. Ant colony optimization for continuousdomains. European Journal of Operational Research,vol. 185(3), 1155-1173, 2008.[10] A. <strong>de</strong> Vicente. O processo <strong>de</strong> otimização Ant System comredução no raio <strong>de</strong> busca. TEMA Tend. Mat. Apl. Comput.,vol. 7(1), 159-168, 2006 (in portuguese).[11] Y.-M. Wang. Centroid <strong>de</strong>fuzzification and the maximizingset and minimizing set ranking based on alpha level sets.Computers & Industrial Engineering, Vol. 57, 228-236,2009.[12] X.-M. Hu, J. Zhang, H. S.-Hung, Y. Li and O. Liu. SamACO:variable sampling ant colony optmization algorithm for continuousoptimization. IEEE Transactions on Systems, Man,and Cybernetics – Part B: Cybernetics, vol. 40(6), 1555-1566, 2010.[13] X.-M. Hu, J. Zhang and Y. Li. Orthogonal methods based antcolony search for solving continuous optimization problems.Journal of Computer Science and Technology, vol. 23(1), 2-18, 2008.ALIO-EURO <strong>2011</strong> – 234


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A tree search procedure for forest harvest scheduling problems addressingaspects of habitat availabilityTeresa Neto ∗ Miguel Constantino † João Pedro Pedroso ‡ Isabel Martins §∗ Escola Superior <strong>de</strong> Tecnologia <strong>de</strong> Viseu do Instituto Politécnico <strong>de</strong> Viseu3504-510 Viseu, Portugaltneto@estv.ipv.pt† Centro <strong>de</strong> Investigação Operacional e <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências <strong>da</strong> Universi<strong>da</strong><strong>de</strong> <strong>de</strong> LisboaCi<strong>da</strong><strong>de</strong> Universitária, 1749-016 Lisboa, Portugalmiguel.constantino@fc.ul.pt‡ INESC Porto e <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> Ciências <strong>da</strong> Universi<strong>da</strong><strong>de</strong> do PortoRua do Campo Alegre, 4169-007 Porto, Portugaljpp@fc.up.pt§ Centro <strong>de</strong> Investigação Operacional e Instituto Superior <strong>de</strong> Agronomia <strong>da</strong> Universi<strong>da</strong><strong>de</strong> Técnica <strong>de</strong> LisboaTapa<strong>da</strong> <strong>da</strong> Aju<strong>da</strong>, 1349-017 Lisboa, Portugalisabelinha@isa.utl.ptABSTRACTIn the literature, the most referenced approaches for forest harvestingscheduling problems addressing environmental protectionissues have focused mainly on including constraints on clearcutarea. Nevertheless, these restrictions may not be sufficient to preventthe loss of habitat availability that en<strong>da</strong>ngers the survival ofmany wild species. This work presents a tree search procedurefor finding good feasible solutions, in reasonable time, to forestharvest scheduling problems with constraints on clearcut area andhabitat availability. We use two measures for habitat availability:the area of all habitats and the connectivity between them. Forsolving the problem, we use a tree search procedure: a processinspired in branch-and-bound, specifically <strong>de</strong>signed for this problem.In each branch, a partial solution leads to two children no<strong>de</strong>s,corresponding to harvesting or not a given stand in a given period.Pruning is based on constraint violations or on unreachable objectivevalues. Preliminary computational results are reported.Keywords: Forest management, Harvest scheduling, Habitat availability,Tree search1. INTRODUCTIONForest management problems for timber production have been addressingconcerns with resources other than timber, such as wildlife,soil, water and aesthetics values. Mo<strong>de</strong>ling approaches to confrontthese concerns have mainly involved the use of restrictions on themaximum clearcut area. However, the solution generated by theseapproaches typically has a dispersion of smaller clearcuts acrossthe forest; it is known that forest fragmentation may have significantnegative impacts on some wildlife species. In<strong>de</strong>ed, forestfragmentation generally implies a reduction of habitat availabilitythat is, the total area of habitats (mature patches meeting a minimumtarget area or with an usable interior or core space with minimumarea requirements) and the connections between them [1, 2].Core area of a mature patch is <strong>de</strong>termined by its size and shape andimmediate surrounding conditions. Some animal species are more<strong>de</strong>pen<strong>de</strong>nt on core area than total area of mature patches [3]. Connectivitybetween habitats enables wildlife movement through theforest, thus enhancing the probability of survival. It is consi<strong>de</strong>red akey issue for the biodiversity conservation and for the maintenanceof natural ecosystems stability and integrity [4].There are several works in forest planning that inclu<strong>de</strong> maturepatch size requirements, using exact integer programmingapproaches [5, 6, 7, 8, 9, 10, 11] or heuristic methods [12, 13, 14,15, 16, 17, 18, 19, 20, 21]. To <strong>da</strong>te, as far as we know, no methodfor forest harvest scheduling problems explicitly addressing theinter-habitat connectivity issue has been reported.When full search is possible in reasonable time exact solution takeadvantage over heuristics, as they <strong>de</strong>termine proved optimal solutions.When the problems are too large to be solved exactly, exactmethods may be interrupted in the middle of the search. Treesearch can used as an exact method, especially to solve aca<strong>de</strong>micproblems [22, 23], but it also can be used as a heuristic [24].This work presents a tree search approach for finding good feasiblesolutions, in reasonable time, to forest harvest schedulingproblems with constraints on clearcut area and habitat availability.Every mature patch meeting a minimum target area is consi<strong>de</strong>re<strong>da</strong> habitat (i.e. core area is not consi<strong>de</strong>red). Several connectivityindices have been proposed for landscape conservation planning;we use the probability of connectivity in<strong>de</strong>x proposed by [25].We report computational tests involving both real forests and generatedbenchmark instances.2. PROBLEMBasic forest harvest scheduling problems generally encompass themaximization of the net present value of timber harvested within atemporal horizon, subject to several non-spatial constraints. In thiswork, we consi<strong>de</strong>r lower and upper bounds on the volume of timberharvested in each period (constraints R l 1 and Ru 1 , respectively)and a minimum average age for the forest at the end of the planninghorizon (constraints R 2 ). Constraints on clearcut area and habitatavailability are consi<strong>de</strong>red. Constraints R 3 impose a maximumin the area of each clearcut; constraints R 4 concern the minimumnumber of periods in which stands adjacent to a clearcut can notALIO-EURO <strong>2011</strong> – 235


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>be harvested, the so-called greenup restrictions. Constraints onhabitat availability impose, in each period, a minimum in the totalarea of habitats (constraints R 5 ) and a minimum value for theprobability of connectivity (constraints R 6 ). It is assumed that astand may be harvested only once, and that a harvested stand maybecome mature within the time horizon.To i<strong>de</strong>ntify clearcuts or mature patches (maximal groups of contiguousstands) it is necessary to <strong>de</strong>fine adjacency between stands.For clearcuts, we consi<strong>de</strong>r that two stands are adjacent if they sharea boun<strong>da</strong>ry that is not a discrete set of points (strong adjacency).For mature patches, we consi<strong>de</strong>r that it is sufficient to share at leasta single point (weak adjacency).Many indices have been used for connectivity analysis [26, 27, 28,29, 30, 25]. The authors in [25] encourage the use of the probabilityof connectivity, an in<strong>de</strong>x that is based on the availabilityconcept, dispersal probabilities between habitats and graph structures.This in<strong>de</strong>x uses an indicator p hr of the possibility of a directmovement occurrence (without passing by any other intermediatehabitat) between habitats h and r, obtained by a negative exponentialfunction:p hr = e −Cd hr,where C is a constant greater than zero called the coefficient of dispersion(species <strong>de</strong>pen<strong>de</strong>nt), and d hr is the edge-to-edge distancebetween h and r (in km). This indicator expresses the possibility ofan animal to disperse among habitats. The closer the indicator is to1, the smaller is the inter-habitat distance, and the more favorableis the occurrence of a movement. In this work, the distance betweentwo stands (represented as polygons) is simply computed asthe minimum Eucli<strong>de</strong>an distance between their vertices; the edgeto-edgedistance between two mature habitats is approximated bythe minimum distance between their stands.A path between two habitats h and r, h ≠ r, is ma<strong>de</strong> up of a sequenceof direct movements from h to r in which no habitat isvisited more than once. The connectivity of a path is given by theproduct of the indicators of direct movements that form the path.The largest connectivity among all paths between h to r is <strong>de</strong>notedby g hr , and indicates the path with greatest chance of dispersion.Let H t be the set of all habitats in period t, s h be the area of habitath,∀h ∈ H t and H t be the total habitat area. The probability ofconnectivity for period t is given by:∑∑s h s r g hrh∈HI t =t r∈H tHt2 . (1)I t expresses the possibility of two animal randomly placed into twohabitats to fall into interconnected habitats. I t ranges from 0 to 1,and increases with improving connectivity. It is equal to 1 whenthe forest is composed by a single habitat, and is equal to zerowhen there are no habitats, or all habitats are completely isolated(by being too distant).3. TREE SEARCHThe tree search proposed in this work is inspired in a branch-andbound<strong>de</strong>signed specifically for this problem. The procedure consistsof successive branching on partial solutions; more specifically,in each branch a partial solution can lead to two children solutions,corresponding to the <strong>de</strong>cision of harvesting or not a standin a given period.Let T be the number of periods within the time horizon and n bethe number of stands. The first step is to initialize a queue Q withthe tree’s root no<strong>de</strong>, <strong>de</strong>fined by the following elements:• S 0 , the set of all pairs (stand i,period t) such that i is availableto be harvested in t, sorted by <strong>de</strong>scending or<strong>de</strong>r of thenet present value corresponding to i and t;• a solution x 0 where no <strong>de</strong>cision is taken (x i = T + 1 for allstands i);• the net present value of x 0 , f npv(x 0 ) = 0;• an upper bound ub 0 to the net present value of an optimalsolution to the problem.The maximum cardinality of S 0 is n×T, which happens when allstands are old enough to be harvested in any period.At each tree no<strong>de</strong> k, the first element (i k ,t k ) of S k is selected. Thepartial solution x k leads to two new partial solutions, correspondingto the <strong>de</strong>cision of harvesting or not harvesting stand i k in periodt k (left and right branches, respectively):• x k+1 , where we fix x k+1i k= t k and x k+1i = x k i , for all i ≠ i k;• x k+2 , with x k+2i = x k i for all i.The sets corresponding to the two new branches are S k+1 andS k+2 , initialized by removing (i k ,t k ) from S k . The set S k+1 isup<strong>da</strong>ted by removing any pair (i,t) such that harvesting stand i inperiod t violates the following restrictions: R u 1 ; R 2; R 3 and R 4 ifstands i and i k are adjacent.At any no<strong>de</strong> k ′ , restrictions R l 1 can only be fully checked when S k ′is empty (all the <strong>de</strong>cisions were taken). In this case, if the correspondingsolution x k′ does not satisfy constraints R l 1 , k′ is infeasible,otherwise k ′ is feasible. However, when S k ′ is not empty, wecheck for period t k ′, and the next green-up periods, whether harvestingall stands still available gives a volume of timber greateror equal than the lower bound (infeasibility test). If not, no<strong>de</strong> k ′ isinfeasible, as can not lead to solutions meeting R l 1 . Otherwise, noconclusion is drawn about the infeasibility of k ′ .We check no<strong>de</strong>s k + 1 and k + 2 with the infeasibilty test. If wedo not conclu<strong>de</strong> that no<strong>de</strong> k + 1 is infeasible, more up<strong>da</strong>tes arema<strong>de</strong>: f npv(x k+1 ) is equal to f npv(x k ) plus the net present valueof stand i k in period t k , and the upper-bound ub k+1 is calculated. Ifno conclusion is drawn about the infeasibility of k + 2, the upperboundub k+2 is calculated. Any upper bound is on the optimal netpresent value of the forest harvest scheduling problem addressingaspects of habitat availability where the <strong>de</strong>cisions already taken areincorporated.Any no<strong>de</strong> k ′ can be pruned by one of the following three reasons:• k ′ is infeasible (either S k ′ is empty or not);• S k ′ is empty and x k′ is feasible;• the upper-bound ub k ′ at no<strong>de</strong> k ′ is not greater than the bestnet present value found so far.The new (non-pruned) no<strong>de</strong>s are inserted into queue Q and theprocess continues from these elements. Tree search ends when Qis empty, or a certain CPU time limit is reached.In this work, several types of upper-bounds are tested.The method can be represented by a tree, as shown in figure 1. Thetree has a maximum height of n × T + 1 and a maximum numberof no<strong>de</strong>s of 2 (n×T +1) − 1.4. TREE SEARCH IMPLEMENTATIONThree strategies to gui<strong>de</strong> the search on the tree were implemented:<strong>de</strong>pth-first, best-first and beam search.In <strong>de</strong>pth-first search (DFS), the search <strong>de</strong>scends on the tree untila leaf (pruned no<strong>de</strong>) is reached. This is implemented though aALIO-EURO <strong>2011</strong> – 236


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 1: Tree Search.last-in-first-out (LIFO) process on the queue Q. The right branchsolution is inserted into Q first, followed by the left branch solution.In best-first search (BFS), branch is ma<strong>de</strong> on the Q element thathas the highest upper-bound.In breadth-first search, all the solutions at the same level are searchedbefore exploring the next level. In beam search (BS), breadth-firstsearch is parameterized, by limiting the number of solutions tobranch per level. At each level, the generated partial solutions aresorted by ascending or<strong>de</strong>r of the upper-bound, and the β last solutionsare branched; the other solutions are pruned. This strategyreduces the memory requirements of breadth-first search.On the first two strategies, when Q is empty the whole tree hasbeen explored; in these cases, the best feasible solution is an optimalsolution. Tree search is used as a heuristic when only a part ofthe tree is explored.5. PRELIMINARY RESULTSWe report results for WLC and El Dorado instances (also availableat the web site www.unbf.ca/fmos/), with 73 and 1363 stands,respectively. The El Dorado forest in the U.S.A. El Dorado is referredto in [31]. The test problem runs were ma<strong>de</strong> on a <strong>de</strong>sktopcomputer with an Intel Core 2 - 2 GHz processor and 2 GB RAM.Tree search was implemented with Python language.Different values are used for the minimum value of the probabilityof connectivity. The DFS and BFS strategies were allowed to runfor two hours at most. The results show that the strategies with thedifferent types of upper bounds were able to give feasible solutionsfor the instances. BS largely <strong>de</strong>pends on the value of the parameterβ.6. REFERENCES[1] L. D. Harris, “The fragmented forest: Island biogeographytheory and the preservation of biotic diversity,” University ofChicago, Chicago, Tech. Rep., 1984.[2] M. Kurtilla, T. Pukkala, and J. Loikkanen, “The performanceof alternative spatial objective types in forest planning calculations:a case for flying squirrel and moose,” Forest Ecologyand Management, vol. 166, pp. 245–260, 2002.[3] E. Z. Baskent and G. A. Jor<strong>da</strong>n, “Characterizing spatialstructure of forest landscapes,” Canadian Journal of ForestResearch, vol. 25, no. 11, pp. 1830–1849, 1995.[4] P. Taylor, L. Fahrig, K. Henein, and G. Merriam, “Connectivityis a vital element of landscape structure,” Oikos, vol. 68,no. 3, pp. 571–573, 1993.[5] J. Hof, M. Bevers, L. Joyce, and B. Kent, “An integer programmingapproach for spatially and temporally optimizingwildlife populations,” Forest Science, vol. 40, no. 1, pp. 177–191, 1994.[6] I. Martins, M. Constantino, and J. G. Borges, “Forestmanagement mo<strong>de</strong>ls with spatial structure constraints,”C.I.O./<strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> CiÍncias <strong>de</strong> Lisboa, Working Paper no.2/1999, 1999.[7] S. Rebain and M. E. McDill, “A mixed-integer formulationof the minimum patch size problem,” Forest Science, vol. 49,no. 4, pp. 608–618, 2003.[8] S. Rebain and M. McDill, “Can mature patch constraints mitigatethe fragmenting effect of harvest opening size restrictions?”Int. Trans. Operations Research, vol. 10, no. 5, pp.499–513, 2003.[9] S. F. Tóth, M. E. McDill, and S. Rebain, “Finding the efficientof a bi-criteria, spatially-explicit, harvest schedulingproblem,” Forest Science, vol. 52, no. 1, pp. 93–107, 2006.[10] Y. Wei and H. M. Hoganson, “Scheduling forest core areaproduction using mixed integer programming,” CanadianJournal of Forest Research, vol. 37, no. 10, pp. 1924–1932,2007.[11] K. Öhman and P. Wikström, “Incorporating aspects of habitatfragmentation into long-term forest planning using mixed integerprogramming,” Forest Ecology and Management, vol.255, pp. 440–446, 2008.[12] K. Öhman and L. Eriksson, “The core area concept in formingcontiguous areas for long term forest planning,” CanadianJournal of Forest Research, vol. 28, no. 7, pp. 1032–1039, 1998.[13] K. Öhman, “Creating continuous areas of old forest in longterm forest planning,” Canadian Journal of Forest Research,vol. 30, no. 11, pp. 1817–1823, 2000.[14] A. Falcao and J. Borges, “Combining random and systematicsearch heuristic procedures for solving spatially constrainedforest management scheduling mo<strong>de</strong>ls,” Forest Science,vol. 48, no. 3, pp. 608–621, 2002.[15] K. Öhman, Multi-objective forest planning. Kluwer Aca<strong>de</strong>micPublishers, 2002, ch. Spatial optimisation in forestplanning: a review of recent Swedish research, pp. 153–172.[16] F. Caro, M. Constantino, I. Martins, and A. Weintraub, “A 2-opt tabu search procedure for the multiperiod forest harvestingproblem with adjacency, greenup, old growth, and evenflow constraints,” Forest Science, vol. 49, no. 5, pp. 738–751,2003.[17] H. M. Hoganson, Y. Wei, and R. H. Hokans, “Integratingspatial objectives into forest plans for minnesota nationalforests,” in Systems analysis in forest resources October 7-9Stevenson WA, <strong>Proceedings</strong> of the 2003 Symposium, M. Beversand T. Barrett, Eds., USDA Forest Service - Rocky MountainResearch Station, 2004.[18] I. Martins, M. Constantino, and J. G. Borges, “A column generationapproach for solving a non-temporal forest harvestmo<strong>de</strong>l with spatial structure constraints,” European Journalof Operational Research, vol. 161, no. 2, pp. 478–498, 2005.[19] K. Öhman and T. Lämås, “Reducing forest fragmentation inlong-term forest planning by using the shape in<strong>de</strong>x,” ForestEcology and Management, vol. 212, pp. 346–357, 2005.ALIO-EURO <strong>2011</strong> – 237


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>[20] A. H. Mathey, E. Kremar, and I. Vertinsky, “Re-evaluatingour approach to forest management planning: a complexjourney,” Forestry Chronicle, vol. 81, no. 3, pp. 359–364,2005.[21] Y. Wei and H. M. Hoganson, “Tests of a dynamicprogramming-based heuristic for scheduling forest core areaproduction over large landscapes,” Forest Science, vol. 54,no. 3, pp. 367–380, 2008.[22] A. Sbihi, “A best first search exact algorithm for the multiplechoicemultidimensional knapsack problem,” Journal ofCombinatorial Optimization, vol. 13, no. 4, pp. 337–351,2007.[23] C. Artigues, M. Gendreau, L. M. Rousseau, andA. Vergnaud, “Solving an integrated employee timetablingand job-shop scheduling problem via hybrid branch-andbound,”Computers & Operations Research, vol. 36, no. 8,pp. 2330–2340, 2009.[24] J. P. Pedroso and M. Kubo, “Heuristics and exact methodsfor number partitioning,” European Journal of OperationalResearch, vol. 202, pp. 73–81, 2010.[25] S. Saura and L. Pascual-Hortal, “A new habitat availabilityin<strong>de</strong>x to integrate connectivity in landscape conservationplanning: comparison with existing indices and applicationto a case study,” Landscape and Urban Planning, vol. 83, pp.91–103, 2007.[26] “Fragstats: spatial pattern analysis program for categoricalmaps,” 1995. [Online]. Available: http//www.umass.edu/lan<strong>de</strong>co/research/fragstats/fragstats.html[27] N. H. Shumaker, “Using landscape indices ti predict habitatconnectivity,” Ecology, vol. 77, no. 4, pp. 1210–1225, 1996.[28] T. H. Keitt, D. L. Urban, and B. Milne, “Detecting criticalscales in fragmented landscapes,” Conservation Ecology,vol. 1, no. 1, 1997. [Online]. Available: http//www.consecol.org/Journal/vol1/iss1/art4.[29] A. G. Bunn, D. L. Urban, and T. H. Keitt, “Landscape connecivity:a conservation application of graph theory,” Journalof Environmental Management, vol. 2, no. 10, pp. 265–278, 2000.[30] S. S. L.P. Hortal, “Comparison and <strong>de</strong>velopment of newgraph-based connectivity indices: towards the priorization ofhabitat patches and corridors for conservation,” LandscapeEcology, vol. 21, no. 7, pp. 959–967, 2006.[31] M. Goycoolea, A. T. Murray, F. Barahona, R. Epstein, andA. Weintraub, “Harvest scheduling subject to maximum arearestrictions: exploring exact approaches,” Operations Research,vol. 53, no. 3, pp. 490–500, 2002.ALIO-EURO <strong>2011</strong> – 238


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Automatic Configuration of TPLS+PLS Algorithms for Bi-objective Flow-ShopScheduling ProblemsJérémie Dubois-Lacoste ∗ Manuel López-Ibáñez ∗ Thomas Stützle ∗∗ IRIDIA-CoDE, Université Libre <strong>de</strong> BruxellesBrussels, Belgium{dubois-lacoste, manuel.lopez-ibanez, stuetzle}@ulb.ac.beABSTRACTThe automatic configuration of algorithms is a hot research topicnowa<strong>da</strong>ys, and it is rapidly having an increasing impact on theway algorithms are <strong>de</strong>signed and evaluated. The main focus ofautomatic configuration tools has been so far the configuration ofsingle-objective algorithms. However, these tools may be appliedto the automatic configuration of multi-objective algorithms forPareto-optimization by means of unary quality measures such asthe hypervolume. This study shows that such an approach is ableto outperform state-of-the-art multi-objective optimizers that weremanually configured. The results presented here on five variantsof multi-objective flow-shop problems show that the automaticallyconfigured algorithm reaches at least the same and often better finalquality than the current state-of-the-art algorithm.Keywords: Automatic configuration, Multi-objective, Flow-shopscheduling1. INTRODUCTIONThis paper presents a study of automatic algorithm configurationtools for improving the performance of multi-objective algorithms.Very recently, López-Ibáñez and Stützle [1] applied automatic configurationtechniques to the configuration of multi-objective algorithms.In particular, they automatically configured a multiobjectiveant colony optimization (MACO) framework, leadingto new MOACO algorithms that outperform previously proposedMOACO algorithms for the bi-objective traveling salesman problem(bTSP). Despite the inherent interest for the research on MO-ACO algorithms, the results obtained by the new MOACO algorithmsare still behind state-of-the-art algorithms for the bTSP. Inthis study, our aim is to configure in an automatic fashion a newstate-of-the-art multi-objective optimizer for an N P-hard problem.In particular, we tackle five bi-objective variants of the multiobjectiveflow-shop problem. The current state-of-the-art algorithmfor these five bi-objective permutation flow-shop problems(bPFSPs) was already shown to outperform by a substantial marginall previously available algorithms for these problems [2] and,hence, we expected little room for improvement. Nonetheless, theresults reported here show that automatic configuration leads toa significant improvement over the current state-of-the-art algorithm.The current state-of-the-art algorithm for these five bPFSPs is a hybri<strong>da</strong>lgorithm combining the two-phase local search (TPLS) [3]and the Pareto local search (PLS) frameworks [4]. TPLS tacklesmulti-objective problems by using efficient single-objective algorithmsto solve a sequence of scalarizations (weighted sum aggregations)of the multi-objective problem. PLS is a local searchmethod for multi-objective problems that uses the Pareto dominancecriterion as an acceptance criterion in the local search. Fromthese two frameworks, we have build a hybrid TP+PLS softwareframework.The flow-shop scheduling problem (FSP) [5] is one of the mostwi<strong>de</strong>ly studied scheduling problems. In this work we study the biobjectivevariants that arise from the minimisation of the followingobjectives: the makespan (C max , that is, the completion time of thelast job), sum of flowtimes (SFT, that is the sum of the completiontimes of all jobs), weighted tardiness (WT, that is, the sum ofthe amount of time a job is late weighted by each job’s priority)and the total tardiness (TT, that is the same as WT but all prioritiesare equal). We tackle the bi-objective PFSPs that result from fivepossible pairings of objectives (we do not consi<strong>de</strong>r the combinationof the total and weighted tardiness): (C max , SFT), (C max , TT),(C max , WT), (SFT, TT) and (SFT, WT). These bi-objective problemshave been the focus of intensive research, which is summarisedin a recent review [6].In bi-objective combinatorial optimization problems, candi<strong>da</strong>te solutionsare evaluated according to an objective function vector ⃗f =( f 1 , f 2 ). Given two vectors ⃗u,⃗v ∈ R 2 , we say that ⃗u dominates ⃗v(⃗u ≺ ⃗v) iff ⃗u ≠ ⃗v and u i ≤ v i , i = 1,2. Without preference informationabout the objectives, the aim is, without loss of generality,to minimize the objective functions in terms of Pareto-optimality,that is, to find the set of solutions that are not dominated by anyother feasible solution. This set is called the Pareto set, and itsimage in the objective space is called the Pareto front. Since thisgoal is in many cases intractable, the goal becomes to find a set ofnon-dominated solutions that approximates well the Pareto front.The assessment of the relative quality of different Pareto front approximationsis a difficult problem, since they are often incomparablein the Pareto sense. For this purpose, several unary qualityindicators have been proposed that try to summarise the qualityof a Pareto front approximation into a single scalar value. In thispaper, we use one of the most wi<strong>de</strong>ly used indicators, the hypervolume[7, 8]. In two dimensions, the hypervolume of a Paretofront approximation is the area dominated by at least one of its solutions,and boun<strong>de</strong>d by a point that is larger in all objectives thanall points in the Pareto front.In what follows, we first <strong>de</strong>scribe the outline of the hybrid algorithmthat we use, and we explain how we automatically configureit. We perform an experimental analysis that shows that the automaticallyconfigured versions reach state-of-the-art performance.2. ALGORITHM DESIGNThe TP+PLS framework consists of the sequential execution of theTPLS and PLS algorithms. TPLS uses effective single-objectivealgorithms to solve a sequence of scalarized problems, that is,weighted sum aggregations of the multiple objective functions. Weuse a recent version called A<strong>da</strong>ptive Anytime Two-Phase LocalSearch (AA-TPLS) [9]. Contrary to TPLS, PLS does not rely onALIO-EURO <strong>2011</strong> – 239


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>weights, but is a local search method purely based on Pareto dominance.We briefly <strong>de</strong>scribe these two algorithms, and how wecombine them into a final hybrid algorithm.A<strong>da</strong>ptive Anytime Two-Phase Local Search TPLS in its originalversion [3] consists of two main phases. In the first phase, ahigh-quality solution is generated for one objective using an effectivesingle-objective algorithm. This high-quality solution is theseed that initializes the second phase. During this second phase,a sequence of scalarizations are tackled. A scalarized single-objectiveproblem is <strong>de</strong>fined from the bi-objective one as follows: anormalized weight vector ⃗ λ = (λ,1 − λ), λ ∈ [0,1] ⊂ R is usedto compute the scalar value of a solution s with objective functionvector ⃗f (s) = ( f 1 (s), f 2 (s)) as f λ (s) = λ · f 1 (s) + (1 − λ) ·f 2 (s). The single-objective algorithm that tackles these scalarizationsuses as a seed the best solution found for a previous scalarization.In this way, TPLS can take advantage of any known effectivealgorithm for each single objective.In a recent work [9], we have shown that TPLS can be effectivebut that it has few drawbacks. First, the computation time mustbe known in advance in or<strong>de</strong>r to distribute the computational effortequally in all directions; otherwise, if stopped earlier, the approximationto the Pareto front will be very poor in some regions.Second, it cannot a<strong>da</strong>pt the computational effort to different Paretofront shapes. We proposed AA-TPLS [9] as an improved versionof TPLS that has the anytime property, that is, it aims at producingan as high as possible performance at any moment of its execution;moreover, this improved version a<strong>da</strong>pts to the shape of the Paretofront, focusing the search on those regions that would improve theoverall quality of the Pareto front approximation. Here, we usethis new AA-TPLS as a component of the TP+PLS algorithm.We use an iterated greedy (IG) as the un<strong>de</strong>rlying algorithm of AA-TPLS. IG is a stochastic local search method, originally proposedfor permutation flow-shop scheduling problems to minimize themakespan, for which it is state-of-the-art [10]. In recent work [11],we a<strong>da</strong>pted IG to minimize other objectives, that is, total tardiness(weighted or not), sum of flowtimes, and scalarized problemsarising from all possible pairwise combinations of these three objectives.Automatic configuration tools were used to find efficientparameter settings of IG for each problem. We use these settingshere for IG, to focus on the automatic configuration of six parametersthat control the behavior of our AA-TPLS framework, that is,the multi-objective part of the combination AA-TPLS &IG.Pareto Local Search Pareto Local Search (PLS) can be seen asthe extension of iterative improvement algorithms from the singleto the multi-objective case [12]. In PLS, an acceptance criterionbased on Pareto dominance replaces the usual single-objective acceptancecriterion.Given an initial archive of non-dominated solutions, which are initiallymarked as unvisited, PLS iteratively applies the followingsteps. A solution s is randomly chosen among the ones in thearchive that are still unvisited. Then, the neighborhood of s isfully explored and all neighbors that are not weakly dominatedby s or by any solution in the archive are ad<strong>de</strong>d to the archive.Solutions in the archive dominated by the newly ad<strong>de</strong>d solutionsare removed in or<strong>de</strong>r to keep only non-dominated solutions in thearchive. Once the neighborhood of s has been fully explored, sis marked as visited. When all solutions in the archive have beenvisited, the algorithm stops. Despite its relative simplicity, PLS isan important component of state-of-the-art algorithms for the biobjectivetraveling salesman problem (bTSP) [13] and bi-objectivepermutation flow-shop scheduling problems (bPFSP) [11, 2]. Asthe neighborhood operator of PLS, in [2] we reported experimentsusing three different operators: two being based on either insertionor exchange moves, and the third being a combination of both(thus consi<strong>de</strong>ring more solutions but requiring more time to do it).In this work we automatically configure the choice of this operator.The computation time required by PLS is unpredictable, and may<strong>de</strong>pend on the instance and even on the or<strong>de</strong>r unvisited solutions inthe archive are chosen. The version of PLS used in the final hybri<strong>da</strong>lgorihm is time boun<strong>de</strong>d, that is, it simply stops if the time limitis reached.Hybrid TP+PLS Algorithm Our framework for the hybrid algorithmin this work is based on the two algorithmic schemes introduce<strong>da</strong>bove and it is the same as the one proposed in [2].First single-objective algorithms (in our case, IG algorithms) find ahigh-quality initial solution for each single objective. Then we useAA-TPLS to perform a series of scalarizations that produces a setof high-quality, non-dominated solutions. This set is then furtherimproved by a time-boun<strong>de</strong>d PLS that uses appropriate neighborhoodoperators; in the specific case of the problems tackled in thispaper, these are an insertion, an exchange operator, or a combinationof both. The result is a hybrid TP+PLS algorithm. Throughthe particular choices of the un<strong>de</strong>rlying single-objective algorithmsand the neighborhoods of PLS, we can instantiate the frameworkof the hybrid algorithm for virtually any bi-objective optimizationproblem. Here, these problems are five bi-objective PFSPs.The seven parameters (six for AA-TPLS and one for PLS) of theTP+PLS framework are those that <strong>de</strong>fine the specific settings usedby TPLS and PLS, that is, the multi-objective part of the final algorithm,and the relative duration of these phases. For more <strong>de</strong>tailson these parameters we refer to [2].Automated Hybrid Configuration The automatic configurationtool that we use is I/F-Race [14]. More specifically, we use a new,improved implementation provi<strong>de</strong>d by the irace software [15].This tool handles several parameter types: continuous, integer, categorical,and or<strong>de</strong>red. Continuous and integer parameters take valueswithin a range specified by the user. Categorical parameterscan take any value among a set of possibles ones explicitly givenby the user, while an or<strong>de</strong>red parameter is similar to a categoricalparameter with a pre-<strong>de</strong>fined strict or<strong>de</strong>r of its possible values.As proposed by López-Ibáñez and Stützle [1], I/F-Race may beused to automatically configure multi-objective algorithms by integratingthe hypervolume indicator as the evaluation criterion.For the automatic configuration process, we generated 500 traininginstances of each size, 50 jobs and 20 machines (50x20) and100 jobs and 20 machines (100x20). These instances were producedfollowing the same procedure <strong>de</strong>scribed in [6]. I/F-Race isstopped after 5000 runs of TP+PLS, and each run is given a timelimit proportional to the instance size of 0.1 · n · m seconds, thatis, 100 seconds for instances of size 50x20 and 200 seconds forinstances of size 100x20.We compare the configuration of TP+PLS found by I/F-Race withthe configuration reported in the original publication, that are basedon a careful experimental analysis to find the best possible parametrizationof the algorithm “by hand” [2], to un<strong>de</strong>rstand the effect of eachalgorithm component, and the best <strong>de</strong>sign choice for each of them.We call these original configurations conf hand . In addition, we alsorun I/F-Race adding conf hand to the initial set of candi<strong>da</strong>te configurations.We call conf tun−rnd the best configuration obtained fromrunning I/F-Race without knowledge of the conf hand configuration,and we call conf tun−ic the best configuration obtained fromrunning I/F-Race using conf hand as an initial configuration.ALIO-EURO <strong>2011</strong> – 240


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>3. EXPERIMENTAL ANALYSISWe compare the original configurations proposed in [2] (conf hand ),where it is shown that the hybrid algorithm using this parametrizationgreatly improves upon previous state-of-the-art algorithms,with the configurations obtained from the automatic configurationprocess (conf tun−rnd and conf tun−ic ).For the experimental analysis of the configurations, we use 10 instancesproduced in the same way as the training instances, of size50x20 and 100x20. Each experiment is run until a time limit of0.1 · n · m seconds to allow a computation time proportional to theinstance size, as suggested by Minella et al. [6]. We repeat eachexperiment 10 times with different random seeds.To normalize the hypervolume value across all instances, we firstnormalize all non-dominated points to the range [1,2], and wecompute the hypervolume of the set of normalized points, usingas reference point (2.1,2.1).Graphical analysis To explore graphically the performance ofeach configuration, we examine their empirical attainment functions(EAF). The EAF of an algorithm provi<strong>de</strong>s an estimation ofthe probability for an arbitrary point in the objective space to beattained by (that is, dominated by or equal to) a solution obtainedby a single run of the algorithm. Thus, examining the EAF allowsto know with which frequency a region of the objective spaceis attained by a multi-objective algorithm. By examining the differencesbetween the EAFs of two algorithms, one can not onlyi<strong>de</strong>ntify regions of the objective space where one algorithm performsbetter than another but also know by which magnitu<strong>de</strong>. Thedifferences in favor of each algorithm can be plotted si<strong>de</strong>-by-si<strong>de</strong>and the magnitu<strong>de</strong> of the differences be enco<strong>de</strong>d in gray levels (the<strong>da</strong>rker the color, the higher the difference). For more <strong>de</strong>tails, werefer to López-Ibáñez et al. [16].Figure 1 presents the differences of the EAFs for conf hand andconf tun−rnd for 2 instances of size 50x20. Other instances andobjectives show the same trend: each algorithm performs better indifferent regions, but one can hardly assess that conf hand or oneof the automatically <strong>de</strong>rived configurations outperforms the otheracross the whole non-dominated front.Statistical analysis To assess whether the performance differencesamong the configurations are significant, we perform a statisticaltest on the overall results. Table 1 presents the mean andstan<strong>da</strong>rd <strong>de</strong>viation of the hypervolume for each problem and eachconfiguration, for instances of size 50x20. We perform a pairedt-test with the null hypothesis of equal performance and a confi<strong>de</strong>ncelevel of 0.95, between the conf hand configuration and eachof the other two. A bold face indicates that the difference is statisticallysignificant in favor of one of the automatically <strong>de</strong>rivedconfigurations, and an italic face indicates that the difference isstatistically significant in favor of conf hand . The same test is performedfor instances of size 100x20, and results are reported inTable 2.In all cases except one (PFSP-(SFT, WT) on Table 2), conf handobtains the worst results of all the three configurations, the differencebeing often statistically significant. In particular, conf tun−icimproves in nine out of the ten cases significantly over conf hand(see Tables 1 and 2). Even if the absolute differences in hypervolumeare not very large, this is a noteworthy result given the excellentperformance that the hybrid TPLS+PLS using the conf handconfiguration achieved when compared to previous state-of-the-artalgorithms [2].Table 1: Mean and stan<strong>da</strong>rd <strong>de</strong>viation of the normalized hypervolumeobtained by each configuration, evaluated over 10 runs and10 instances of size 50x20. A bold face indicates that there is astatistically significant difference (see text for <strong>de</strong>tails) in favor ofa given configuration versus conf hand , and an italic face that thedifference is in favor of conf hand .conf hand conf tun−rnd conf tun−icmean sd mean sd mean sd(C max , SFT) 0.974 0.036 0.982 0.038 0.984 0.034(C max , TT) 0.999 0.039 1.005 0.038 1.002 0.035(C max , WT) 1.037 0.026 1.045 0.024 1.045 0.023(SFT, TT) 0.954 0.038 0.955 0.039 0.96 0.04(SFT, WT) 1.022 0.028 1.024 0.03 1.029 0.026Table 2: Mean and stan<strong>da</strong>rd <strong>de</strong>viation of the normalized hypervolumeobtained for each configuration, evaluated over 10 runs and10 instances of size 100x20. A bold face indicates that there is astatistically significant difference (see text for <strong>de</strong>tails) in favor ofa given configuration versus conf hand , and an italic face that thedifference is in favor of conf hand .conf hand conf tun−rnd conf tun−icmean sd mean sd mean sd(C max , SFT) 0.943 0.058 0.968 0.056 0.971 0.058(C max , TT) 1.005 0.043 1.008 0.045 1.012 0.038(C max , WT) 1.013 0.043 1.028 0.039 1.025 0.04(SFT, TT) 0.621 0.129 0.755 0.117 0.761 0.133(SFT, WT) 0.951 0.037 0.922 0.051 0.962 0.0484. CONCLUSIONIn this work, we automatically configured a new state-of-the-artalgorithm for five bi-objective flow-shop problems. The hybridTP+PLS algorithm that we automatically configure is the same asin [2]. In this previous study we proposed a new state-of-the-artalgorithm for bi-objective permutation flow-shop scheduling, togetherwith a highly effective parametrization that should be usedfor each instance size. In this work, we automatically configuredthis hybrid algorithm and showed that the configuration we obtaine<strong>da</strong>re as-good or even slightly better than the ones originallyproposed. The hybrid multi-objective framework that we configureis generic and the same <strong>de</strong>sign procedure could be applied todifferent bi-objective combinatorial problems, potentially improvingover the current state-of-the-art for different problems.5. REFERENCES[1] M. López-Ibáñez and T. Stützle, “Automatic configurationof multi-objective ACO algorithms,” in Ant Colony Optimizationand Swarm Intelligence, 7th International Conference,ANTS 2010, ser. Lecture Notes in Computer Science,M. Dorigo et al., Eds. Springer, Hei<strong>de</strong>lberg, Germany, 2010,vol. 6234, pp. 95–106.[2] J. Dubois-Lacoste, M. López-Ibáñez, and T. Stützle, “A hybridTP+PLS algorithm for bi-objective flow-shop schedulingproblems,” Computers & Operations Research, vol. 38,no. 8, pp. 1219–1236, <strong>2011</strong>.[3] L. Paquete and T. Stützle, “A two-phase local search forthe biobjective traveling salesman problem,” in EvolutionaryMulti-criterion Optimization (EMO 2003), ser. LectureNotes in Computer Science, C. M. Fonseca et al., Eds.Springer, Hei<strong>de</strong>lberg, Germany, 2003, vol. 2632, pp. 479–493.[4] ——, “Stochastic local search algorithms for multiobjectivecombinatorial optimization: A review,” in Handbook of ApproximationAlgorithms and Metaheuristics, T. F. Gonzalez,ALIO-EURO <strong>2011</strong> – 241


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>C max3800 4000 4200 4400C max3750 3850 3950 4050 4150 4250∑ w iTi4e+04 8e+04 1.2e+05 1.6e+05[0.8, 1.0][0.6, 0.8)[0.4, 0.6)[0.2, 0.4)[0.0, 0.2)3800 4000 4200 4400C maxhand4e+04 8e+04 1.2e+05 1.6e+05∑ w iTi∑ w iTi2e+04 6e+04 1e+05 1.4e+05[0.8, 1.0][0.6, 0.8)[0.4, 0.6)[0.2, 0.4)[0.0, 0.2)3750 3850 3950 4050 4150 4250C maxhand2e+04 6e+04 1e+05 1.4e+05∑ w iTituning−1tuning−1Figure 1: Differences of the empirical attainment functions estimated over 10 runs for conf hand and conf tun−rnd , for two instances of size50x20, for conf hand (left) versus conf tun−rnd (right). The problem is PFSP-(C max , WT).Ed. Boca Raton, FL: Chapman & Hall/CRC, 2007, pp. 29–1—29–15.[5] D. S. Johnson, “Optimal two- and three-stage productionscheduling with setup times inclu<strong>de</strong>d,” Naval Research LogisticsQuarterly, vol. 1, pp. 61–68, 1954.[6] G. Minella, R. Ruiz, and M. Ciavotta, “A review and evaluationof multiobjective algorithms for the flowshop schedulingproblem,” INFORMS Journal on Computing, vol. 20, no. 3,pp. 451–471, 2008.[7] E. Zitzler, L. Thiele, M. Laumanns, C. M. Fonseca, andV. Grunert <strong>da</strong> Fonseca, “Performance assessment of multiobjectiveoptimizers: an analysis and review,” IEEE Transactionson Evolutionary Computation, vol. 7, no. 2, pp. 117–132, 2003.[8] C. M. Fonseca, L. Paquete, and M. López-Ibáñez, “An improveddimension-sweep algorithm for the hypervolume indicator,”in <strong>Proceedings</strong> of the 2006 Congress on EvolutionaryComputation (CEC 2006). Piscataway, NJ: IEEE Press,Jul. 2006, pp. 1157–1163.[9] J. Dubois-Lacoste, M. López-Ibáñez, and T. Stützle, “A<strong>da</strong>ptive“anytime” two-phase local search,” in Learning and IntelligentOptimization, 4th International Conference, LION4, ser. Lecture Notes in Computer Science, C. Blum andR. Battiti, Eds. Springer, Hei<strong>de</strong>lberg, Germany, 2010, vol.6073, pp. 52–67.[10] R. Ruiz and T. Stützle, “A simple and effective iteratedgreedy algorithm for the permutation flowshop schedulingproblem,” European Journal of Operational Research, vol.177, no. 3, pp. 2033–2049, 2007.[11] J. Dubois-Lacoste, M. López-Ibáñez, and T. Stützle, “Effectivehybrid stochastic local search algorithms for biobjectivepermutation flowshop scheduling,” in Hybrid Metaheuristics,ser. Lecture Notes in Computer Science, M. J.Blesa, C. Blum, L. Di Gaspero, A. Roli, M. Sampels, andA. Schaerf, Eds. Springer, Hei<strong>de</strong>lberg, Germany, 2009, vol.5818, pp. 100–114.[12] L. Paquete, M. Chiarandini, and T. Stützle, “Pareto local optimumsets in the biobjective traveling salesman problem:An experimental study,” in Metaheuristics for MultiobjectiveOptimisation, ser. Lecture Notes in Economics and MathematicalSystems, X. Gandibleux et al., Eds. Springer, 2004,vol. 535, pp. 177–200.[13] T. Lust and J. Teghem, “Two-phase Pareto local searchfor the biobjective traveling salesman problem,” Journal ofHeuristics, vol. 16, no. 3, pp. 475–510, 2010.[14] P. Balaprakash, M. Birattari, and T. Stützle, “Improvementstrategies for the F-race algorithm: Sampling <strong>de</strong>sign and iterativerefinement,” in Hybrid Metaheuristics, ser. LectureNotes in Computer Science, T. Bartz-Beielstein, M. J. Blesa,C. Blum, B. Naujoks, A. Roli, G. Rudolph, and M. Sampels,Eds. Springer, Hei<strong>de</strong>lberg, Germany, 2007, vol. 4771, pp.108–122.[15] M. López-Ibáñez, J. Dubois-Lacoste, T. Stützle, and M. Birattari,“The irace package, iterated race for automatic algorithmconfiguration,” IRIDIA, Université Libre <strong>de</strong> Bruxelles,Belgium, Tech. Rep. TR/IRIDIA/<strong>2011</strong>-004, <strong>2011</strong>.[16] M. López-Ibáñez, L. Paquete, and T. Stützle, “Exploratoryanalysis of stochastic local search algorithms in biobjectiveoptimization,” in Experimental Methods for the Analysis ofOptimization Algorithms, T. Bartz-Beielstein, M. Chiarandini,L. Paquete, and M. Preuss, Eds. Springer, Berlin,Germany, 2010, pp. 209–222.ALIO-EURO <strong>2011</strong> – 242


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Efficient paths by local searchL. Paquete ∗ J.L. Santos † D.J. Vaz ∗∗ CISUC, Department of Informatics Engineering, University of CoimbraPólo II, 3030-290 Coimbrapaquete@<strong>de</strong>i.uc.pt, dvaz@stu<strong>de</strong>nt.<strong>de</strong>i.uc.pt† CMUC, Department of Mathematics, University of Coimbra3001-454 Coimbrazeluis@mat.uc.ptABSTRACTIn this article, we <strong>de</strong>scribe an experimental analysis on a givenproperty of connectedness of optimal paths for the multicriteriashortest path problem. Moreover, we propose a local search thatexplores this property and compare its performance with an exactalgorithm in terms of running time and number of optimal pathsfound.Keywords: Multicriteria Optimization, Routing, Local Search,Shortest Path1. INTRODUCTIONMulticriteria shortest path problems arise in many applications.For instance, GPS systems allow choosing different criteria such astime or cost. However, there is no shortest path that optimizes allcriteria since the fastest path may not be the cheapest. For instance,highways are fast but expensive since they are tolled, whereas nationalroads are free of charge but slow. Hence, one has to <strong>de</strong>velopalgorithms that output a set of optimal paths representing the optimaltra<strong>de</strong>-off between the several criteria, from which the userchooses the most preferable.This work <strong>de</strong>scribes a large experimental analysis to un<strong>de</strong>rstandthe structure of the efficient paths that can be exploited from analgorithmic point of view. In particular, we aim to know whetherthose efficient paths are close to each other, according to a proper<strong>de</strong>finition of “closeness”. To know whether this holds for most ofthe instances is highly relevant, since we could use this informationto <strong>de</strong>velop even more effective algorithms [5]. Our experimentalresults reported indicate that a large number of instances presentsuch property. Therefore, we propose a local search that exploresthis property and compare it against an exact approach <strong>de</strong>scribedin the literature.2. NOTATION AND DEFINITIONSLet G be a network, G = (V,A) and w a mapping that <strong>de</strong>fines eacharc’s weight, w : A ↦→ Z Q . For the simplicity of notation, we willsay that a path is a sequence of arcs or no<strong>de</strong>s, <strong>de</strong>pending of thecontext. Let us also <strong>de</strong>note the set of feasible paths as P. The goalof this problem is to find the efficient set of paths as follows)min(∑f (p) := w 1 (a),..., ∑ w Q (a) (1)p∈P a∈pa∈pD.J. Vaz acknowledges its grant BII-2009 from Fun<strong>da</strong>ção <strong>de</strong> Ciênciae Tecnologia.The meaning of operator min is as follows: We say that a feasiblepath p dominates another feasible path p ′ if and only if f j (p) ≤f j (p ′ ) for j = 1,...,Q, with at least one strict inequality. If there isno feasible path that dominates p, then we say that p is an efficientpath. The set of all efficient paths is <strong>de</strong>noted by N E . The imageof the feasible set P forms a set of distinct points in the criterionspace. We say that a vector z is non-dominated if it is the image ofsome efficient path p ∈ N E . The set of all non-dominated vectorsis called the non-dominated set. In Eq. (1), operator min finds thenondominated set.A label correcting algorithm to solve this problem (or to find theefficient set) is given by Paixão and Santos [4], which consists ofan a<strong>da</strong>ptation of the algorithm given by Vincke [7]. Although thisalgorithm finds the efficient set, it is too slow for large networks. Inthis work, we propose a new local search algorithm that explores agiven property of the efficient paths that may improve the runningtime. We say that two paths, p 1 and p 2 , are adjacent if and onlyif, after removing the arcs in common, we obtain a single cycle inthe resulting undirected graph [3]. Also, we <strong>de</strong>fine the adjacencygraph G ′ , such that G ′ has a vertex for each efficient path p ∈ N Eand an edge between two vertices if and only if the correspondingpaths are adjacent. The algorithm that is reported here explores theconnectedness of efficient paths, which is <strong>de</strong>fined as the connectednessof G ′ . Although it is not necessarily true that the efficientset for a given network is connected [3], a large fraction of networksmay satisfy this condition. To the knowledge of the authors,connectedness of the efficient set only holds for particular cases ofknapsack problems [1, 6].3. CONNECTEDNESS ANALYSISIn the experimental investigation mentioned above, we used benchmarkinstances <strong>de</strong>scribed in the literature [4]. Those instances aregrouped in three categories according to their size: small, mediumand large. In each of the categories, there are 7 classes: RandomN:Random network (randomly generated arc), with the number ofno<strong>de</strong>s varying, and having constant <strong>de</strong>nsity and number of criteria;RandomD: Random network with constant number of no<strong>de</strong>sand number of criteria, but varying <strong>de</strong>nsity; RandomK: Randomnetwork with constant number of no<strong>de</strong>s and <strong>de</strong>nsity, but varyingthe number of criteria; CompleteN: Complete network with constantnumber of criteria, but varying number of no<strong>de</strong>s; CompleteK:Complete network with constant number of no<strong>de</strong>s, but varyingnumber of criteria; GridN: Grid (square mesh) with constant numberof criteria, but varying number of no<strong>de</strong>s; GridK: Grid (squaremesh) with constant number of no<strong>de</strong>s, but varying number of criteria.Each group corresponds to 50 distinct instances. For eachclass, there are 15-20 groups of 50 instances each. There are 19950instances, from which 6600 are small, 6550 are medium and 6800ALIO-EURO <strong>2011</strong> – 243


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Algorithm 1: Local Search AlgorithmInput: Network G = (V,A), s,t ∈ V .Output: Set S.T,S := /0Let v : P ↦→ Vfor each crit. q = 1,...,Q doa) Find T q , the reversed shortest path tree with root t oncrit. q.b) Let p ∈ T q be path from s to t.c) Flag p as not visited.d) v(p) := se) S := S ∪ pend forfor each path p in S that is not visited doa) Flag p as visited.for each no<strong>de</strong> i ∈ p from v(p) to t dofor each arc (i, j) ∈ A dofor each crit. q ′ = 1,...,Q doa) Let p (s,i) be a path from no<strong>de</strong> s to i and p (s,i) ⊆ pb) Let r ∈ T q ′ be the path from no<strong>de</strong> j to t in crit. q ′c) p ′ := p (s,i) ∪ {(i, j)} ∪ rd) v(p ′ ) := je) Flag p ′ as not visitedf) S := Filter(S ∪ {p ′ })end forend forend forend forare large. For each of those instances, the weight of each arc foreach criterion is generated randomly according to an uniform distributionin the range of [1,1000].We <strong>de</strong>veloped an algorithm for <strong>de</strong>tecting connectedness for a givenset of efficient paths. This algorithm outputs the number of connectedcomponents of the adjacency graph. For <strong>de</strong>tecting whethera given instance is connected according to the notion of connectedness<strong>de</strong>scribed in Section 2 we ran the algorithm for finding the setof efficient paths as <strong>de</strong>scribed by Paixão and Santos [4], and thenused this set as input to the algorithm <strong>de</strong>scribed above to <strong>de</strong>termineif the set of efficient paths was connected. All the small andmedium instances, along with some large instances that have beentested, were found to have the set of efficient paths connected.4. LOCAL SEARCH ALGORITHMThe local search algorithm presented in this section generates candi<strong>da</strong>teefficient paths that are neighbors with respect to the <strong>de</strong>finitionof adjacency given in Section 2. Note that the number ofefficient and neighbor paths can be exponentially large [2]. Therefore,we focus on a subset of neighbors whose size only <strong>de</strong>pendslinearly on the number of criteria, number of no<strong>de</strong>s and/or arcs.The local search works as follows. First, all the shortest pathsfrom every no<strong>de</strong> to the target and for each criterion are generatedby using Dijkstra’s algorithm. Then, for each one of these shortestpaths, new paths are generated from a path p as follows: for eachno<strong>de</strong> i of p from s to the target t, <strong>de</strong>viate from p at no<strong>de</strong> i throughan arc (i, j) and then, for each criterion, follow the shortest paththat was previously computed from no<strong>de</strong> j to the target. With thisprocedure, further new candi<strong>da</strong>tes for efficient paths are generated.The algorithm iterates over the procedure above for all paths thatare generated. To avoid generating repeated paths, at each newpath p ′ generated from path p, the algorithm starts from the firstno<strong>de</strong> in p ′ where the <strong>de</strong>tour occured. This no<strong>de</strong> will be <strong>de</strong>notedpercentage of solutions foundratio of CPU−time1008060402001e+041e+021e+001e−02●●●●●●K N K N K N DComplete Grid Random●●●●●●●●●●●●●K N K N K N DComplete Grid RandomFigure 1: Percentage of efficient solutions found (top) and ratio ofCPU-time between label correcting and local search (bottom). Theratio is shown in logarithmic scale. The white and grey boxplotsrepresent small and medium instances, respectively.by v(p) and for the shortest path initially <strong>de</strong>termined, we <strong>de</strong>finev(p) = s. We also <strong>de</strong>note a path that follows path p from no<strong>de</strong> s tono<strong>de</strong> i by p (s,i) . The resulting algorithm is shown in Algorithm 1.The procedure Filter(S) in the final step removes the dominatedpaths from set S At each iteration of the second loop, the algorithmuses a LIFO strategy to choose the next path from S. In or<strong>de</strong>r to<strong>de</strong>fine a stopping criterion, we use the following technique [5]:The algorithm flags each new path found as not visited; the pathbecomes visited when it is chosen to generate new paths. Thisalgorithm stops when all paths in S are flagged as visited.Figure 1 presents the experimental results obtained by using the localsearch algorithm as compared to the label correcting approach.The plot in the top gives a boxplot for the percentage of efficientpaths found by the local search algorithm for each instance typeand size. The plot in the bottom shows a boxplot for the ratioof CPU-time between the label correcting approach and the localsearch algorithm. The experimental results indicate that the localsearch algorithm behaved well in Random and Complete instances,where it finds over 80% of the efficient paths in RandomK andRandomN instances and between 50% and 90% in the remaining.For these classes of problems, the local search algorithm takes lessthan one tenth of the run-time of the exact approach. Additionally,in GridK instances, the local search finds more than 80% of theefficient paths, but it is slower than the exact approach. Finally,only a few portion of efficient paths was found by the local searchalgorithm in GridN instances.5. CONCLUDING REMARKSIn this article, we performed an experimental analysis of connectednessfor the multicriteria shortest path problem. The positiveresults obtained in this study suggest that local search algorithmsmay be an effective approach. We propose a local search algorithmthat explores a stricter version of the neighborhood consi<strong>de</strong>red for●●●●●●●●●ALIO-EURO <strong>2011</strong> – 244


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>connectedness. From our point of view, the results obtained by ourapproach were quite positive for most of the instance types, bothin terms of number of efficient paths found and running time.This approach can even be improved in terms of solution quality,mainly for instances of type Grid with increasing size, by consi<strong>de</strong>ringan extension of the neighborhood that is explored by ourapproach. However, it is an open question whether it would still beefficient in terms of running time as compared to exact algorithmsfor this problem. As for instances of type Grid with increasingnumber of objectives, a more efficient dominance check may improveour approach. Finally, we remark that this local search explorationcan be also applied for other problems <strong>de</strong>fined over networks,such as the multicriteria minimum spanning tree problem.6. REFERENCES[1] J. Gorski, L. Paquete, F. Pedrosa, Greedy algorithms for aclass of knapsack problems with binary weights, Computers& Operations Research, <strong>2011</strong>, in press.[2] P. Hansen, Bicriterion path problems, In G. Fan<strong>de</strong>l and T. Gal(Eds.), Multiple Criteria Decision Making Theory and Application,LNEMS 177, Springer, pp. 109–127, 1979.[3] M. Ehrgott, K. Klamroth, Connectedness of efficient solutionsin multiple criteria combinatorial optimization. EuropeanJournal of Operational Research, 97: 159–166, 1997.[4] J.P. Paixão and J.L. Santos, Labelling methods for the generalcase of the multiobjective shortest path problem - a computationalstudy. Working paper CMUC 07-42, University ofCoimbra, 2007.[5] L. Paquete, T. Stützle, On local optima in multiobjectivecombinatorial optimization problems. Annals of OperationsResearch, 156(1): 83–97, 2007.[6] F. Seipp, S. Ruzika, L. Paquete, On a cardinality constrainedmulticriteria knapsack problem, Report in WirtschaftsmathematikNr. 133/<strong>2011</strong>, University of Kaiserslautern. <strong>2011</strong>.[7] P. Vincke, Problémes multicritères, Cahiers du Centred’Etu<strong>de</strong>s <strong>de</strong> Recherche Opérationelle 16, 425–436, 1974ALIO-EURO <strong>2011</strong> – 245


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Solving a Multiobjective Flowshop Scheduling Problem by GRASP withPath-RelinkingIryna Yevseyeva ∗ Jorge Pinho <strong>de</strong> Sousa ∗ † Ana Viana ∗ ‡∗ INESC Porto, FEUP campus,Rua Dr. Roberto Frias, 378, 4200-465 Porto, Portugaliyevseyeva@inescporto.pt† <strong>Facul<strong>da</strong><strong>de</strong></strong> <strong>de</strong> <strong>Engenharia</strong> <strong>da</strong> Universi<strong>da</strong><strong>de</strong> do Porto,Rua Dr. Roberto Frias, s/n, 4200-465 Porto, Portugaljsousa@inescporto.pt‡ Instituto Superior <strong>de</strong> <strong>Engenharia</strong> do Porto,Rua Dr. António Bernardino <strong>de</strong> Almei<strong>da</strong>, 431, 4200-072 Porto, Portugalaviana@inescporto.ptABSTRACTIn this work, a hybrid metaheuristic for solving the biobjectiveflowshop problem with makespan and tardiness objectives is proposed.It is based on the well-known greedy randomized a<strong>da</strong>ptivesearch procedure (GRASP) with path-relinking a<strong>da</strong>pted to themultiobjective case. The proposed approach is tested on severalflowshop instances and compared to existing results from literaturewith the hypervolume performance measures.Keywords: Multiobjective, GRASP, Path-relinking, Scheduling,Flowshop1. INTRODUCTIONTraditionally, scheduling problems are solved with one objectiveat a time. For instance, minimization of makespan or tardiness.However, in reality several usually conflicting objectives need tobe optimized simultaneously. Solving multiobjective problems isnot easy, since there exists no single optimal schedule, and theschedule which provi<strong>de</strong>s the minimal makespan has a larger tardinesswhen compared to the schedule with minimal tardiness butlarger makespan. The multiobjective nature of the problem leadsto the search of the Pareto set of nondominated solutions.Optimizing any of these objectives is NP-hard [1] and exact methodsare able to solve only small size problems. On the other hand,metaheuristics are able to find approximate good quality solutionswithin feasible computational time. In metaheuristics, the searchis directed towards good (at least near optimal) approximate solutionsby applying some exploitation (or intensification) techniques,e.g. local search. At the same time, the search space is studied indifferent directions. This is done by applying some exploration(or diversification) techniques, e.g., multi-start with different randominitial points. Powerful metaheuristics are at the core of solvingNP-hard multiobjective problems. However, there is a largeexperience in the <strong>de</strong>velopment of heuristics for single objectivescheduling problems. In this work, an attempt to benefit fromboth of these approaches is ma<strong>de</strong> by <strong>de</strong>veloping a multiobjectivegreedy randomized a<strong>da</strong>ptive search procedure (GRASP) with pathrelinkingfor flowshop scheduling.Typically, in multiobjective optimization instead of the one bestperforming solution, a set of Pareto optimal solutions, relativelygood according to all objectives, is of interest. Similarly to EvolutionaryAlgorithms and Scatter Search, GRASP can work witha population of solutions initialized by randomized heuristic(s).The quality of solutions constructed with some heuristic may stillbe improved with local search techniques. On the other hand, interlinkinglocal optima with path-relinking contributes to the explorationof the search space between good solutions. The combinationof multi-start local search that performs exploitation ofthe objective space with path-relinking that explores the objectivespace results in a powerful multiobjective metaheuristic discussedin this work.2. BACKGROUND2.1. Flowshop scheduling mo<strong>de</strong>lIn a flowshop problem, n jobs have to be processed on m machineswith processing time p i j of each job j ∈ J on each machine i ∈I. Then, the total number of all possible schedules is equal to(n!) m . The goal is to find the schedule that is optimal accordingto some objective function(s). Usually, it is supposed that eachmachine can process only one job at a time without interruptionsand the or<strong>de</strong>r of jobs is the same on each machine. When theor<strong>de</strong>r of jobs in permutation is known, only n! possible schedulescan be constructed. This problem is called permutation flowshopscheduling problem (PFSP) and is the one consi<strong>de</strong>red in this paper.In this work, the schedule is constructed such that it minimizes themaximum makespan (completion time of the last job) C max andminimizes the tardiness T j , simultaneously (see [1] for examplesof other possible objectives). Processing of a job j on a machinei can start only after processing the same job on a machine i − 1is finished. Being C j the completion time of a job j on the lastmachine, C i j the completion time of a job j on a machine i, andd j a due <strong>da</strong>te of a job j, C i j = max{C i−1, j ; C i, j−1 } + p i j ; C max =max{C j }(∀ j), is the completion time of the last job on the lastmachine; and T j = max{C j − d j ,0} is the tardiness of a job j.2.2. Multiobjective optimizationAt the outcome of multiobjective optimization there is a set ofnon-dominated solutions. Each of such solutions is "optimal" inthe sense that improvements in one objective causes <strong>de</strong>gra<strong>da</strong>tionin some other one(s). The Pareto set of solutions can be foundALIO-EURO <strong>2011</strong> – 246


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>by evaluating all solution vectors in the objective space and findingnondominated solutions based on the Pareto dominance relation.This relation states that solution x s is better than solutionx p (x s ,x p ∈ R d ), if it is strictly better in at least one objective andnot worse on the rest of the objectives. Assuming minimization ofall objectives this can be written as follows: f k (x s ) ≤ f k (x p )∀k ∈{1,...,o} and ∃l ∈ {1,...,o} : f l (x s ) < f l (x p ), where f k (x s ), f k (x p ),k ∈ {1,...,o} are evaluations of x s , x p in the objective space R o .2.3. GRASPThe greedy randomized a<strong>da</strong>ptive search procedure (GRASP) was<strong>de</strong>veloped by Feo and Resen<strong>de</strong> in 1989 [2]. It is a multi-startheuristic that iteratively performs construction of solutions, e.g.,with some heuristic(s), and makes a local search around solutionsprovi<strong>de</strong>d by the construction phase. Restarting these two consecutivephases with random initial solutions provi<strong>de</strong>s diversity of thefinal results. On the other hand, local search insures exploitation ofneighbourhoods of the solutions found at the construction phase.GRASP with path-relinking was successfully applied to a singleobjective jobshop scheduling problems in [3], [4]. In this work,GRASP with path-relinking is a<strong>da</strong>pted to the multiobjective flowshop.3. MULTIOBJECTIVE GRASP WITH PATH-RELINKINGFOR THE PFSPOver the last <strong>de</strong>ca<strong>de</strong>s, many multiobjective metaheuristics havebeen <strong>de</strong>veloped, such as multiobjective tabu search, multiobjectivesimulated annealing, multiobjective genetic algorithms, etc.Most of them have already been applied to flowshop schedulingproblems. For a recent and comprehensive survey on solving multiobjectivePFSP the rea<strong>de</strong>r is referred to [5].The successful application of GRASP with path-relinking to singleobjective scheduling problems see e.g., [3], [4], has motivated theextension of GRASP to multiobjective PFSP in this work. GRASPhas already been applied to the multiobjective PFSP in [6]. However,their approach is based on the aggregation of multiple objectivesinto a single one, and, finally, solving a single objectiveGRASP. On contrary, the multiobjective GRASP with path-relinking(moGRASP-PR) proposed in this work allows searching the Paretoset of non-dominated solutions.3.1. Construction phaseThe original GRASP works with one solution at each iteration,on contrary, here, a population of initial solution is constructe<strong>da</strong>ccording to some heuristics. The i<strong>de</strong>a of using a population ofinitial solutions is common to evolutionary algorithms and scattersearch, and assumes working in parallel with some set of diversesolutions. Obtaining such solutions with heuristics guarantees thatinitial solutions are feasible and good according to at least one ofthe objectives.In the scheduling literature, there is a long time tradition of <strong>de</strong>velopingefficient heuristics for different types of objectives based ondispatching rules. For instance, for the makespan objective, theshortest processing time (SPT) heuristic stands as the best performingone, while for the tardiness objective, the earliest due<strong>da</strong>te (EDD) heuristics is reported to be the best [1]. On the otherhand, more complicated heuristics were <strong>de</strong>veloped for each typeof scheduling problems. For instance, for the flowshop, Nawaz,Du<strong>de</strong>k and Ham (NEH) heuristic [7] is consi<strong>de</strong>red to be the mostefficient one [8].In this work, several heuristics were selected for constructing aninitial population. For the makespan and tardiness objectives thebest solutions were obtained with the NEH heuristics with jobsinitially or<strong>de</strong>red according to the LPT and EDD rules respectively.Then, to diversify the initial population, usually at the cost of qualityof solutions, the rest of solutions is selected from two RestrictedCandi<strong>da</strong>te Lists (RCLs), constructed for the makespan objectiveaccording to the SPT rule and for tardiness with respect to the EDDrule.Assuming minimization of the maximum makespan C j = C max ,∀ j, only some jobs with the smallest makespan are selected intothe RCL. The α-part of the best jobs is <strong>de</strong>fined between the jobswith the minimal C j and maximal C j values of makespan. Consequently,C j = min(C j | j ∈ J a ) and C j = max(C j | j ∈ J a ), where J ais the set of unscheduled jobs. Then, the RCL can be <strong>de</strong>fined asfollowsRCL = { j ∈ J a |C j ≤ C j ≤ C j + α(C j −C j )}, (1)where the α parameter, such as 0 ≤ α ≤ 1 is selected <strong>de</strong>pendingon the <strong>de</strong>gree of randomness <strong>de</strong>sired. Similarly, the RCL for EDDis constructed.Selection from the RCL of the job to be scheduled is random andguarantees diversification of solutions selected in the initial populationfrom the same list. Half of the initial population is selectedfrom the RCL constructed based on SPT and the other half is takenfrom the RCL constructed based on EDD. Such construction createsgood solutions according to only one objective at a time, butnot with respect to both objectives simultaneously.3.2. Local search phaseConstruction with a greedy-randomized approach does not guaranteeoptimality of the solutions, that is why local search can stillimprove the quality of solutions by exploring the neighbourhoodsof the best solutions obtained at the construction phase. A neighbourhoodN(x) of a solution x ∈ X is a set of solutions that areobtained by slightly changing x (by an operation called move) insome specific way for each particular type of problem. When comparedto single objective optimization, where either the first improvingsolution better than the current one or the best among allpossible solutions in the whole neighbourhood is accepted, in themultiobjective case, all non-dominated solutions with respect tothe neighbourhood are accepted.Due to the importance of the or<strong>de</strong>r of jobs in the flowshop problem,the most efficient neighbourhoods for it are those that <strong>de</strong>stroy theor<strong>de</strong>r of jobs as less as possible. In this sense, the less <strong>de</strong>structiveis the insertion neighbourhood that removes a job from its currentposition and inserts in some other random position. The swappingneighbourhood that exchanges positions of two randomly selectedjobs is also shown to be efficient. Due to the quadratic growthof both neighbourhoods, when increasing the size of the problemsolved, usually, only some fixed-size sub-neighbourhoods of solutionsselected randomly are consi<strong>de</strong>red for evaluation. Exploringsub-neighbourhoods does not guarantee i<strong>de</strong>ntification of local optima.However, it is compensated by consuming less of the availablecomputation time.3.3. Path-relinkingIn the original GRASP, the local search exploits a region of thesearch space around some starting point, and exploration is compensatedby the multiple restarts. However, from evolutionarycomputation it is known that taking two good solutions and swappingparts of them may result in new good solutions. Such operationis known as crossover. Glover suggested a more <strong>de</strong>terministicapproach to trace solutions that are located on trajectories (orALIO-EURO <strong>2011</strong> – 247


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>paths) connecting good (or elite) solutions [9]. The basic i<strong>de</strong>a isto transform one of the solutions (initial one) in the direction ofthe other (guiding) solution. Small changes to the initial solutionallow to obtain new solutions located in between of the two relinkedsolutions. In [10], path-relinking was applied together withGRASP, and in [4], this approach was successfully applied to jobshop scheduling.There are different ways to perform path-relinking [11]. Here, anew solution is obtained by swapping elements of the initial solutionwith respect to the position of one of them in the guidingsolution. The new solution is evaluated on objectives and comparedto the initial solution. In case the new solution dominatesthe initial one it gets the chance to be ad<strong>de</strong>d to the archive, seenext section. In [12], it was <strong>de</strong>monstrated that for some problems,the solutions located near local optima are better than those locatedon the rest of the paths. This principle is used in the truncatingpath-relinking, where only sub-paths (and, consequently,sub-neighbourhoods) near to local optima are explored.Depending on the size of the set to be interlinked, the neighbourhoodcreated by path-relinking can be large, and, consequently,time-consuming. That is why here to reduce computational complexityand still keep the quality of solutions high, the truncatingpath-relinking is applied for all possible pairs of nondominated solutions.3.4. Archive of solutionsAll nondominated solutions found after construction, local searchand path-relinking phases are stored in an external archive. The solutionis ad<strong>de</strong>d to the archive if it is non-dominated with respect tothe solutions already stored in the archive. On the other hand, solutionsstored in the archive that become dominated by the currentsolution are removed from it. An archive of solutions has alreadybeen used in the elitist multiobjective GRASP for the quadraticassignment problem in [13].To reduce computational efforts, each new neighbour solution isfirst compared for the non-dominance with the original solutionfrom which it stemmed from, either initial solution of the localsearch (LS) or initial and/or guiding solutions in case of pathrelinking(PR). In case the new solution appears to be non-dominated,it is compared to all solutions stored in the archive for non-dominance.Such approach was suggested in [14] to avoid redun<strong>da</strong>nt computations.3.5. moGRASP-PR routineThe general scheme of the main loop of the moGRASP-PR cycleapplied to the current population is presented in Algorithm 1.Algorithm 1: moGRASP-PR main loopRequire: Number of generation NGEN, population size NINDgen = 0archive = {}while gen < NGEN doinit_pop = construct(NIND)archive ← non_dominated(init_pop ⋃ archive)neighbours = LS(archive)archive ← non_dominated(neighbours ⋃ archive)neighbours = PR(archive)archive ← non_dominated(neighbours ⋃ archive)neighbours = LS(archive)archive ← non_dominated(neighbours ⋃ archive)gen = gen + 1end whileAt the beginning, a population of solutions with size NIND = 102is constructed. The two solutions are constructed with the NEHheuristics for makespan and tardiness, respectively, initially or<strong>de</strong>re<strong>da</strong>ccording to the LPT and EDD heuristics. Then, for thepre<strong>de</strong>fined number of generations, NGEN = 20, the rest of thepopulation is composed of 50 solutions, constructed from the bestjobs selected randomly from the RCL based on the SPT rule, and50 solutions selected from the RCL based on the EDD rule. Thenon-dominated solutions are selected from all constructions andpreserved in the archive. Then, the archive is up<strong>da</strong>ted at each iterationafter new neighbours are created with either local search orpath-relinking.3.6. Performance assessmentAmong different performance measures available in the literature,the hypervolume [15] is consi<strong>de</strong>red to have good convergence properties[16]. The main disadvantages of the hypervolume is its computationalcomplexity. Recently, several efficient algorithms thattry to reduce computational time for calculating the hypervolumehave been proposed. In this work, the hypervolume is computedwith the improved dimension-sweep algorithm proposed in [17].4. EXPERIMENTAL RESULTSIn this work, we compare the performance of the multiobjectiveGRASP with path-relinking applied to the PFSP with the resultsof 21 best algorithms obtained in [5]. The experiments presentedin table 1 were performed on the well-known Taillard’s benchmarkfor the PFSP proposed in [18] and modified for the makespan andtardiness objectives in [5]. The set of 110 instances is availablefor download at http://soa.iti.es/files/Taillard_DueDates.7z. In thiswork, the instances with different combinations [n × m] of n jobsand m machines are used, such as [20,50] × [5,10,20] and [100 ×5]. The moGRASP-PR algorithm runs for 1 (for large size problems)to 3 (for small size problems) times on each instance (in total226 problems are solved) with 20 GRASP iterations in each run.The results presented in table 1 are the averages of hypervolumesfor each algorithm on all runs.Table 1: Comparative results with the hypervolume criterionMethod Hypervol Method HypervolMOSA_Varadhar 0.927 ε-NSGAII 0.71MOGALS_Arroyo 0.861 moGRASP-PR 0.703PESA 0.851 (µ + λ)-PAES 0.605PESAII 0.848 ε-MOEA 0.621PGA_ALS 0.815 PAES 0.588MOTS 0.795 MOSA_Suresh 0.851MOGA_Murata 0.755 SA_Chakrav 0.515CMOGA 0.741 PILS 0.43NSGAII 0.725 ENGA 0.426SPEA 0.724 A-IBEA 0.159CNSGAII 0.722 SPEAII 0.159A more <strong>de</strong>tailed analysis shows that the algorithm proposed inthis paper performs best compared to the rest of the algorithmsfor small <strong>da</strong>ta sets. However, for the large <strong>da</strong>ta sets it convergesprematurely. That is why, the overall hypervolume is not as highas expected. Some diversity preservation mechanism is planned tobe integrated into the algorithm in the near future.The moGRASP-PR algorithm is implemented in Python 2.6 on asingle Intel Core 2 Duo T9550 processor running at 2.66 GHz withALIO-EURO <strong>2011</strong> – 248


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>4 GB of RAM.5. CONCLUSIONS AND FUTURE WORKIn this work, a multiobjective GRASP with path-relinking is appliedto a biobjective flowshop problem with the makespan and thetardiness objectives. The approach was tested on several flowshopinstances and compared to existing methods with the hypervolumeperformance measures. The initial results are promising forthe small flowshop instances, but the performance of the algorithmshould be investigated for the larger ones.6. ACKNOWLEDGEMENTSThe authors are grateful to Nicolau Santos and Rui Rei for assistingwith the Python co<strong>de</strong> and to Gerardo Minella for comments onhis paper.7. REFERENCES[1] M. L. Pinedo, Scheduling: Theory, Algorithms, and Systems.Springer, 2008.[2] T. Feo and M. G. C. Resen<strong>de</strong>, “A probabilistic heuristic for acomputationally difficult set covering problem,” OperationsResearch Letters, vol. 8, pp. 67–71, 1989.[3] S. Binato, W. Hery, D. M. Loewenstern, and M. G. C. Resen<strong>de</strong>,“A grasp for job shop scheduling,” in Essays and Surveyson Metaheuristics. Kluwer Aca<strong>de</strong>mic Publishers, 2000,pp. 59–79.[4] R. M. Aiex, S. Binato, and M. G. C. Resen<strong>de</strong>, “Parallel graspwith path-relinking for job shop scheduling,” Parallel Comput.,vol. 29, no. 4, pp. 393–430, 2003.[5] G. Minella, R. Ruiz, and C. M., “A review and evaluation ofmultiobjective algorithms for the flowshop scheduling problem,”INFORMS Journal on Computing, vol. 20, no. 3, pp.451–471, 2008.[6] B. S. H. Khan, G. Prabhaharan, and P. Asokan, “A grasp algorithmfor m-machine flowshop scheduling problem withbicriteria of makespan and maximum tardiness,” Int. J. Comput.Math., vol. 84, no. 12, pp. 1731–1741, 2007.[7] M. Nawaz, J. E. E. Enscore, and I. Ham, “A heuristic algorithmfor the m-machine, n-job flow-shop sequencing problem,”Omega, vol. 11, no. 1, pp. 91 – 95, 1983.[8] E. Taillard, “Some efficient heuristic methods for the flowshop sequencing problem,” European Journal of OperationalResearch, vol. 47, no. 1, pp. 65–74, July 1990.[9] F. Glover, “Tabu search and a<strong>da</strong>ptive memory programing –advances, applications and challenges,” in Interfaces in ComputerScience and Operations Research. Kluwer, 1996, pp.1–75.[10] M. Laguna and R. Martí, “Grasp and path relinking for 2-layer straight line crossing minimization,” INFORMS Journalon Computing, vol. 11, pp. 44–52, 1999.[11] M. G. C. Resen<strong>de</strong> and C. C. Ribeiro, “Grasp with pathrelinking:recent advances and applications,” in Metaheuristics:Progress as Real Problem Solvers, T. Ibaraki,K. Nonobe, and M. Yagiura, Eds. Springer, 2005, pp. 29–63.[12] M. G. C. Resen<strong>de</strong>, R. Martí, M. Gallego, and A. Duarte,“Grasp and path relinking for the max-min diversity problem,”Comput. Oper. Res., vol. 37, no. 3, pp. 498–508, 2010.[13] H. Li and D. Lan<strong>da</strong>-Silva, “An elitist grasp metaheuristic forthe multi-objective quadratic assignment problem,” in EvolutionaryMulti-Criterion Optimization, ser. Lecture Notes inComputer Science, M. Ehrgott, C. Fonseca, X. Gandibleux,J.-K. Hao, and M. Sevaux, Eds. Springer, 2009, vol. 5467,pp. 481–494.[14] L. Paquete, M. Chiar, and T. Stützle, “Pareto local optimumsets in the biobjective traveling salesman problem: An experimentalstudy,” in Metaheuristics for Multiobjective Optimization,Lecture. Springer, 2004, pp. 177–200.[15] E. Zitzler and L. Thiele, “Multiobjective evolutionary algorithms:A comparative case study and the strength pareto approach,”IEEE Transactions on Evolutionary Computation,vol. 3, no. 4, pp. 257–271, 1999.[16] J. Knowles and D. Corne, “On metrics for comparing nondominatedsets,” in <strong>Proceedings</strong> of the 2002 Congress onEvolutionary Computation (CEC 2002), Honolulu, Hawaii,2002, pp. 711–716.[17] C. M. Fonseca, L. Paquete, and M. López-Ibáñez, “An improveddimension-sweep algorithm for the hypervolume indicator,”in IEEE Congress on Evolutionary Computation,Vancouver, Cana<strong>da</strong>, 2006, pp. 1157–1163.[18] E. Taillard, “Benchmarks for basic scheduling problems,”European Journal of Operational Research, vol. 64, no. 2,pp. 278–285, 1993.ALIO-EURO <strong>2011</strong> – 249


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Stabilized Column Generation for the Rooted Delay-Constrained Steiner TreeProblemMarkus Leitner ∗ Mario Ruthmair ∗ Günther R. Raidl ∗∗ Institute of Computer Graphics and Algorithms, Vienna University of TechnologyFavoritenstr. 9-11, 1040 Vienna, Austria{leitner, ruthmair, raidl}@ads.tuwien.ac.atABSTRACTWe consi<strong>de</strong>r the rooted <strong>de</strong>lay-constrained Steiner tree problemwhich arises for example in the <strong>de</strong>sign of centralized multicastingnetworks where quality of service constraints are of concern.We present a path based integer linear programming formulationwhich has already been consi<strong>de</strong>red in the literature for thespanning tree variant. Solving its linear relaxation by columngeneration has so far been regar<strong>de</strong>d as not competitive due tolong computational times nee<strong>de</strong>d. In this work, we show how tosignificantly accelerate the column generation process using twodifferent stabilization techniques. Computational results indicatethat due to the achieved speed-up our approach outperforms so-farproposed methods.Keywords: Network <strong>de</strong>sign, Stabilized column generation,Delay-constrained Steiner tree1. INTRODUCTIONWhen <strong>de</strong>signing a communication network with a central serverbroadcasting or multicasting information to all or some of the participantsof the network, some applications such as vi<strong>de</strong>o conferencesrequire a limitation of the maximal <strong>de</strong>lay from the serverto each client. Besi<strong>de</strong> this <strong>de</strong>lay-constraint minimizing the costof establishing the network is in most cases an important <strong>de</strong>signcriterion. As another example, consi<strong>de</strong>r a package shipment organizationwith a central <strong>de</strong>pot guaranteeing its customers a <strong>de</strong>liverywithin a specified time horizon. Naturally the organization aimsat minimizing the transportation costs but at the same time has tohold its promise of being in time. Such network <strong>de</strong>sign problemscan be mo<strong>de</strong>led as rooted <strong>de</strong>lay-constrained Steiner tree problem(RDCSTP), which is an NP-hard combinatorial optimization problem[1]. The objective is to find a minimum cost Steiner tree ofa given graph with the additional constraint that the total <strong>de</strong>layalong each path from a specified root no<strong>de</strong> to any other requiredno<strong>de</strong> must not exceed a given <strong>de</strong>lay bound.More formally, we are given an undirected graph G = (V,E) witha set V of n no<strong>de</strong>s, a fixed root no<strong>de</strong> s ∈ V , a set T ⊆ V \{s} of terminalor required no<strong>de</strong>s, a set S = V \(T ∪{s}) of optional Steinerno<strong>de</strong>s, a set E of m edges, a cost function c : E → Z + , a <strong>de</strong>layfunction d : E → Z + , and a <strong>de</strong>lay bound B ∈ Z + . A feasible solutionto the RDCSTP is a Steiner tree G S = (V S ,E S ), s ∈ V S , T ⊆V S ⊆ V, E S ⊆ E satisfying the constraints ∑ e∈PS (t) d e ≤ B, ∀t ∈ T ,where P S (t) ⊆ E <strong>de</strong>notes the edge set of the unique path from roots to terminal t. An optimal solution G ∗ S is a feasible solution withminimum costs c(G ∗ S ) = ∑ e∈E Sc e .2. PREVIOUS & RELATED WORKThere are many recent publications <strong>de</strong>dicated to this problem andits more special variants. Several metaheuristics have been appliedto the RDCSTP, such as GRASP [2, 3], path-relinking [4] and variableneighborhood search [3]. More heuristic approaches can befound for the spanning tree variant with T = V \ {s}, e.g. GRASPand variable neighborhood <strong>de</strong>scent (VND) in [5] and ant colonyoptimization and variable neighborhood search in [6]. Furthermore,preprocessing methods are presented in [6] to reduce the sizeof the graph significantly in or<strong>de</strong>r to speed up the solving process.Exact methods based on integer linear programming (ILP) havebeen explored by Leggieri et al. [7] who <strong>de</strong>scribe a compact exten<strong>de</strong>dno<strong>de</strong>-based formulation using lifted Miller-Tucker-Zemlininequalities. Since the used Big-M inequalities usually yield ratherlow linear programming (LP) relaxation bounds this formulationis improved by separating directed connection cuts. Several ILPapproaches for the spanning tree variant have been examined byGouveia et al. in [8] based on a path formulation solved by twodifferent methods. Stan<strong>da</strong>rd column generation (CG) turns out tobe computationally inefficient while a Lagrangian relaxation approachtogether with a fast primal heuristic exhibits better performance.A third approach reformulates the constrained shortestpath problem on a layered graph and solves it using a multicommodity flow (MCF) formulation. Since the size of the layeredgraph and therefore the efficiency of the according mo<strong>de</strong>l heavily<strong>de</strong>pends on the number of achievable discrete <strong>de</strong>lay values thisapproach can in practice only be used for instances with a quite restrictedset of achievable <strong>de</strong>lay values. Additionally a MCF mo<strong>de</strong>lusually suffers from the huge amount of flow variables used altogetherleading to a slow and memory-intensive solving process.Nevertheless solving these layered graph mo<strong>de</strong>ls turned out to bevery effective on certain classes of instances.3. PATH FORMULATIONIn this section we present a path based ILP formulation for theRDCSTP which is a straightforward modification of the mo<strong>de</strong>ldiscussed by Gouveia et al. [8] for the spanning tree variant ofthe RDCSTP. In our directed formulation we use arc set A containingan arc (s, j) for each edge {s j} ∈ E inci<strong>de</strong>nt to the rootno<strong>de</strong> and two oppositely directed arcs (i, j), ( j,i) for all otheredges {i j} ∈ E, i, j ≠ s. We further assume the edge cost and<strong>de</strong>lay functions to be <strong>de</strong>fined on the set of arcs too, i.e. c i j = c eand d i j = d e , ∀(i, j) ∈ A,e = {i j} ∈ E. The integer master problem(IMP) <strong>de</strong>fined by (1)–(6) is based on variables x i j ∈ {0,1},∀(i, j) ∈ A, which indicate arcs inclu<strong>de</strong>d in the directed solution.We further use path variables λ p ∈ {0,1}, ∀p ∈ P = ⋃ t∈T P t , whereP t ⊆ 2 A is the set of feasible paths from the root no<strong>de</strong> s to terminalt. Each path is represented by its arc set. A path p ∈ P t to terminalt ∈ T is feasible if and only if it satisfies the <strong>de</strong>lay bound, i.e.ALIO-EURO <strong>2011</strong> – 250


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>∑ (i, j)∈p d i j ≤ B. Variable λ p is set to one if path p ∈ P is realized.(IMP) min ∑ c i j x i j (1)(i, j)∈As.t. ∑ λ p ≥ 1p∈P t∀t ∈ T (2)∑x i j − λ p ≥ 0 ∀t ∈ T, ∀(i, j) ∈ A (3)p∈P t |(i, j)∈p∑ x i j ≤ 1 ∀ j ∈ V (4)(i, j)∈Ax i j ∈ {0,1} ∀(i, j) ∈ A (5)λ p ∈ {0,1} ∀p ∈ P (6)Since the number of feasible paths for each terminal t ∈ T andthus the total number of variables in the mo<strong>de</strong>l is in general exponentiallylarge, we apply CG – see e.g. [9, 10] – to solve theLP relaxation. We start with a small subset ˜P t ⊆ P t , ∀t ∈ T , ofpath variables λ p in the restricted master problem (RMP) wherethe integrality conditions on arcs (5) and paths (6) are replaced by(7) and (8), respectively. Further variables are ad<strong>de</strong>d on <strong>de</strong>man<strong>da</strong>ccording to the solution of the pricing subproblem.x i j ≥ 0 ∀(i, j) ∈ A (7)λ p ≥ 0 ∀p ∈ P (8)Let µ t ≥ 0, ∀t ∈ T , <strong>de</strong>note the dual variables associated to theconvexity constraints (2) and πi t j ≥ 0, ∀t ∈ T , ∀(i, j) ∈ A, <strong>de</strong>notethe dual variables associated to the coupling constraints (3). Thenthe pricing subproblem is <strong>de</strong>fined as(t ∗ , p ∗ ) = argmin t∈T,p∈Pt − µ t + ∑(i, j)∈pπi t j . (9)Hence we need to solve a resource constrained shortest path problemon a graph (V,A) with nonnegative arc costs πi t j , ∀(i, j) ∈ A,for each terminal t ∈ T . We solve each such problem in pseudopolynomialtime O(B · |A|) using the dynamic programming base<strong>da</strong>lgorithm from [8]. As long as path variables λ p , p ∈ P t , t ∈ T withnegative reduced costs ¯c = −µ t +∑ (i, j)∈p πi t j exist, we need to ad<strong>da</strong>t least one of them and resolve the RMP. This process is repeateduntil no further variable with negative reduced costs exists.In each iteration we add for each terminal t ∈ T multiple path variablesusing the approach from [8]: We consi<strong>de</strong>r all no<strong>de</strong>s v ∈V thatare adjacent to terminal t and all <strong>de</strong>lay bounds b = 0,...,B − d vtfor which a path from s to v in conjunction with arc (v,t) is a feasiblepath to t. In case a shortest path p to v of total <strong>de</strong>lay b,b = 0,...,B − d vt , exists and p ′ = p ∪ {(v,t)} yields negative reducedcosts, the corresponding variable is ad<strong>de</strong>d to the RMP.4. COLUMN GENERATION STABILIZATIONIt is well known that basic CG approaches typically suffer fromcomputational instabilities such as <strong>de</strong>generacy or the tailing-offeffect [11] which often increase the nee<strong>de</strong>d computational effortfor solving them dramatically. Stabilization techniques to reducethe effects of these instabilities are usually classified intoproblem specific approaches such as the usage of dual-optimalinequalities [12, 13] and problem in<strong>de</strong>pen<strong>de</strong>nt approaches, seee.g. [14, 15]. The latter are often based on the concept of stabilitycenters and <strong>de</strong>viations from a current stability center are penalized,e.g. by using piecewise linear penalty functions. Recently,we showed how to significantly accelerate the CG process for asurvivable network <strong>de</strong>sign problem without modifying the RMPby choosing alternative dual optimal solutions when solving thepricing subproblem [16, 17, 18]. In the following we will a<strong>da</strong>ptthis technique for the RDCSTP before we discuss an alternativestabilization approach based on piecewise linear penalty functions.4.1. Alternative Dual Optimal SolutionsLet γ j ≤ 0, ∀ j ∈ V , be the dual variables associated to constraints(4) and ˜P = ⋃ t∈T ˜P t <strong>de</strong>note the set of paths for which correspondingvariables have already been inclu<strong>de</strong>d in the RMP. Then the dualof the RMP is given by (10)–(15).s.t.max ∑ µ t + ∑ γ j (10)t∈T j∈V∑ πi t j + γ j ≤ c i j ∀(i, j) ∈ A (11)t∈Tµ t − ∑(i, j)∈pπi t j ≤ 0 ∀t ∈ T, ∀p ∈ ˜P t (12)µ t ≥ 0 ∀t ∈ T (13)πi t j ≥ 0 ∀t ∈ T, ∀(i, j) ∈ A (14)γ j ≤ 0 ∀ j ∈ V (15)Let (µ ∗ ,π ∗ ,γ ∗ ) <strong>de</strong>note the current dual solution computed by theused LP solver when solving the RMP. It is easy to see that forarcs A ′ not part of any so far inclu<strong>de</strong>d path – i.e. A ′ = {(i, j) ∈A | ∄p ∈ ˜P : (i, j) ∈ p} – any values for the dual variables π i j areoptimal as long as ∑ t∈T π t i j∗ + γ j ∗ ≤ c i j , ∀(i, j) ∈ A ′ , since theydo not occur in inequalities (12). Dual variable values π t i j∗ , t ∈ T ,may also be increased for arcs (i, j) ∈ A\A ′ if inequalities (11) arenot binding. We conclu<strong>de</strong> that any values π t i j ≥ πt i j∗ , ∀(i, j) ∈ A,∀t ∈ T , are dual optimal if ∑ t∈T π t i j ≤ ∑ t∈T π t i j∗ + δi j , ∀(i, j) ∈ Aholds, where δ i j = c i j + |γ j | − ∑ t∈T π t i j∗ , ∀(i, j) ∈ A. Note thatstate-of-the-art LP solvers such as IBM CPLEX, which we use inour implementation, usually choose minimal optimal dual variablevalues, i.e. π t i j∗ = 0, ∀t ∈ T , ∀(i, j) ∈ A ′ .On the contrary to most other stabilization approaches we do notmodify the RMP. Instead we aim to choose alternative dual optimalsolutions which facilitate the generation of those path variablesrelevant for solving the LP relaxation of the IMP by increasingthe dual variable values used as arc costs in the pricing subproblem.Obviously, we can simply split the potential increase δ i jequally to all relevant dual variables, i.e. use alternative dual variables¯π i t j = πt ∗ δi j + i j|T | , ∀t ∈ T , ∀(i, j) ∈ A. In our previous workfor a survivable network <strong>de</strong>sign problem [16, 17, 18], however, itturned out to be beneficial to initially use different dual optimalsolutions, one for each terminal t, which finally converge towards¯π i t j , ∀t ∈ T , ∀(i, j) ∈ A. Hence, we consi<strong>de</strong>r two additional variantswhose correspondingly used dual variables will be <strong>de</strong>noted as˜π i t j and ˆπt i j , ∀t ∈ T , ∀(i, j) ∈ A, respectively. Equation (16) <strong>de</strong>finesdual variable values ˜π i t′j used in the pricing subproblem when consi<strong>de</strong>ringterminal t ′ ∈ T . Parameter q ∈ N, 1 ≤ q ≤ |T |, is initiallyset to one and gradually incremented by max{1, |T |10 } in case nonegative reduced cost path has been found. After at most ten suchmajor steps ˜π i t′j = ¯πt′ i j , for each terminal t′ . Thus, we can terminatethe CG process if q = |T | and no path variables have been ad<strong>de</strong>d.{˜π i t′j = πi t ∗ δj + i jq if t = t ′πi t ∗,∀(i, j) ∈ A. (16)j otherwiseDual variable values ˆπ i t j correspond to ˜πt i j except for the fact thatq is directly set to |T | once no new negative reduced cost path canbe found when using q = 1.4.2. Piecewise Linear StabilizationAs mentioned above other successful stabilization techniques areoften based on penalizing <strong>de</strong>viations from a current stability cen-ALIO-EURO <strong>2011</strong> – 251


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>g(π)γ (l)ɛ + ζɛδ (l)π (l)Figure 1: Piecewise linear dual penalty function g(π).ter by adding piecewise linear penalty functions to the dual problem.Among these, especially five-piecewise linear function haveshown to frequently yield good results if all parameters are chosencarefully; compare [19]. In our case, however, preliminary testswith various settings and concrete parameter values showed thatdue to a large number of relatively time consuming up<strong>da</strong>tes of thestability center this concept does not seem to pay off. Since highdual variable values facilitate the generation of good path variablesit is reasonable to penalize only small dual variable values. Hencewe use a modified version of the approach from [14] where in eachmajor iteration l ∈ N, l ≥ 1, only dual variable values smaller thanthe current stability center π (l) |T |·|A|∈ R + are penalized accordingto vectors δ (l) ,γ (l) |T |·|A|∈ R + , see Figure 1. Let πi t j∗ , ∀t ∈ T ,∀(i, j) ∈ A, <strong>de</strong>note the dual variable values after the CG approachon the penalized mo<strong>de</strong>l at major iteration l terminates. If there existsat least one dual variable value in the penalized region – i.e. if∃t ∈ T ∧ (i, j) ∈ A : πi t j∗ < δ t (l)i j – we need to up<strong>da</strong>te the stabilitycenter according to the current dual solution – i.e. π (l+1) = π ∗– and correspondingly set δ (l+1) and γ (l+1) and continue the CGprocess. As has been shown previously [14] this process, whichneeds to be repeated until each dual variable value lies within anunpenalized region, terminates yielding the LP relaxation value ofthe IMP after finitely many steps.According to preliminary tests, the following settings have beenchosen for our computational experiments. We set ε = 0.3 andζ = 1, the size of the inner trust region T (l) = π (l) −δ (l) = 1 whileπ (l) − γ (l) = 5 · T (l) for all dimensions, i.e. ∀t ∈ T , ∀(i, j) ∈ A. LetA ′ <strong>de</strong>note the set of arcs used by the paths inclu<strong>de</strong>d in the initialmo<strong>de</strong>l. We set π t i j(1) = π ti j∗ + δi j , ∀t ∈ T , ∀(i, j) ∈ A ′ , π t i j(1) =πi t ∗ δj + i j|T | , ∀t ∈ T ,∀(i, j) /∈ A′ .5. COMPUTATIONAL RESULTSAll computational experiments have been performed on a singlecore of a multi-core system consisting of Intel Xeon E5540 processorswith 2.53 GHz and 3 GB RAM per core. We used IBMCPLEX 12.2 as LP solver and applied an absolute time limit of10000 CPU-seconds to all experiments. All preprocessing methodsmentioned in [6] are used to reduce the input graphs priorto solving. To build an initial set of paths a simple constructionheuristic is applied on Steiner tree instances: the <strong>de</strong>lay constrainedshortest paths to all terminal no<strong>de</strong>s are iteratively ad<strong>de</strong>d to the treedissolving possible cycles. On instances where T = V \ {s} weapply the Kruskal-based heuristic followed by a VND both introducedin [5]. Tables (1) and (2) report average CPU-times inseconds and nee<strong>de</strong>d iterations for different instance sets. In bothtables π ∗ <strong>de</strong>notes the unstabilized CG approach, and ¯π, ˆπ, and˜π refer to the three strategies discussed in Section 4.1 for usingalternative dual optimal solutions in the pricing subproblem. Thepiecewise linear stabilization approach from Section 4.2 is <strong>de</strong>notedby PL, Lag G and CG G <strong>de</strong>note the Lagrangian and CG approachfrom [8], respectively. The results of the latter two have, however,been computed on a different hardware using an ol<strong>de</strong>r CPLEX versionfor the CG approach and are thus not directly comparable.πCPU time [s]IterationsSet B Lag G CG G π ∗ ¯π ˆπ ˜π PL CG G π ∗ ¯π ˆπ ˜π PLr,100 100 493 4752 314 13 15 10 72 1041 189 25 39 92 115150 639 8215 111 10 8 8 48 12561 357 26 42 98 144200 288 10001 123 4 4 8 46 18736 904 28 41 102 238250 526 10001 261 5 4 9 71 24881 1676 32 44 115 325c,100 100 809 10026 38 10 9 12 78 480 176 31 44 96 171150 544 10034 135 26 15 18 142 329 346 41 56 118 187200 711 10061 1151 50 37 21 367 314 697 58 69 123 311250 1066 10076 3779 43 27 25 500 327 2702 68 78 141 444e,100 100 976 10033 481 90 75 25 598 239 208 40 64 115 307150 1817 10106 3980 705 356 66 2927 193 364 52 84 138 403200 2972 10096 9297 5148 2670 177 8607 209 397 92 123 172 459250 4008 10104 10000 7013 3489 142 9090 195 357 98 160 203 339r,1000 1000 971 8064 25 7 6 11 25 891 119 22 39 96 841500 1744 8538 112 12 10 16 60 4240 253 27 43 112 1182000 869 10002 220 11 11 20 70 15600 716 28 42 114 1252500 790 10007 535 14 12 20 89 18233 1527 34 48 124 156c,1000 1000 668 8186 60 26 24 18 82 869 91 26 38 84 1091500 942 10024 112 30 25 33 111 418 163 37 46 104 1222000 2389 10037 788 68 57 34 235 451 401 36 58 109 1882500 1256 10037 1272 70 44 48 425 437 953 53 62 122 261e,1000 1000 2846 10065 137 52 34 25 474 615 129 34 56 107 1651500 3041 10031 4540 711 378 71 2787 469 266 53 70 130 2962000 5882 10083 8423 1814 897 134 6418 396 254 71 95 172 4432500 5726 10070 10000 4583 2222 183 9468 385 176 88 136 181 439Table 1: Results for instance sets from [8] consisting of five completegraphs with 41 no<strong>de</strong>s, T = V \{s}, different graph structures(r, c, e), <strong>de</strong>lay ranges (100, 1000), and bounds B.CPU time [s]Iterations|T ||V \{s}| B π ∗ ¯π ˆπ ˜π PL π ∗ ¯π ˆπ ˜π PL0.3 30 19 6 6 10 36 143 30 36 92 8450 139 15 16 23 55 413 41 50 124 102100 2849 97 89 55 509 1345 44 55 149 1940.7 30 77 29 29 34 171 142 32 46 93 19850 727 112 107 80 1091 561 51 62 130 475100 7942 819 923 253 7557 1361 79 92 182 9581.0 30 213 77 62 67 630 184 34 54 98 79750 1807 302 328 172 5769 614 56 81 142 2039100 9615 2615 2196 837 10000 851 86 123 214 694Table 2: Results for 30 randomly generated complete graphs with|V | = 100, different sets of terminal no<strong>de</strong>s, <strong>de</strong>lays and costs uniformlydistributed in [1,99] and <strong>de</strong>lay bounds B.We conclu<strong>de</strong> that all stabilization methods based on alternativedual-optimal solutions lead to an enormous reduction of the necessaryCPU-time. While ˆπ performs best for easier instances, ˜πclearly outperforms all other approaches on har<strong>de</strong>r instances, i.e.on those which generally need more time. Stabilization based onpiecewise linear penalty functions outperforms unstabilized CG inthe majority of cases, but is clearly not competitive to our three approachesbased on alternative dual-optimal solutions. We furtherobserve that our unstabilized CG variant needs significantly lessiterations than the conceptually i<strong>de</strong>ntical one discussed by Gouveiaet al. [8]. We believe that next to a different CPLEX version,these differences are mainly based on choosing a better set of initialpath variables, more sophisticated graph preprocessing, andthe fact that we use the dual simplex algorithm which turned out toperform better than the primal one in our case. Comparing the relativecomputational times of the Lagrangian approach from [8] totheir CG approach with the speed-up achieved by our stabilizationmethods, we conclu<strong>de</strong> that the proposed stabilized CG method alsooutperforms this method. All approaches based on dual-optimalsolutions terminated before the time limit was met in all but one ofthe experiments reported in Table 1, while both unstabilized CGvariants and the piecewise linear stabilization approach failed todo so for a number of experiments.ALIO-EURO <strong>2011</strong> – 252


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>6. CONCLUSIONS & FUTURE WORKIn this paper we showed how to significantly accelerate a columngeneration approach based on a path formulation for the RDC-STP using alternative dual-optimal solutions in the pricing subproblem.We conclu<strong>de</strong> that this method does further outperforma stabilization method based on piecewise linear penalty functionsas well as a previously presented approach based on Lagrangianrelaxation [8]. We are currently extending the presented stabilizedcolumn generation towards a branch-and-price approach in or<strong>de</strong>rto compute proven optimal solutions to medium sized instances ofthe RDCSTP. In future, we also plan to consi<strong>de</strong>r additional pricingstrategies – e.g. by restricting the total number of path variables tobe inclu<strong>de</strong>d in each pricing iteration – and want to compare our approachto further stabilization techniques such as e.g. interior pointstabilization [15]. Finally, we also want to study the impact ofchoosing better initial columns computed by metaheuristics whichmay lead to further significant speed-up as well as implement theLagrangian relaxation approach from [8] for a fair comparison.7. REFERENCES[1] V. P. Kompella, J. C. Pasquale, and G. C. Polyzos, “Multicastingfor multimedia applications,” in INFOCOM ’92.Eleventh Annual Joint Conference of the IEEE Computer andCommunications Societies, IEEE, 1992, pp. 2078–2085.[2] N. Skorin-Kapov and M. Kos, “A GRASP heuristic for the<strong>de</strong>lay-constrained multicast routing problem,” TelecommunicationSystems, vol. 32, no. 1, pp. 55–69, 2006.[3] Y. Xu and R. Qu, “A GRASP approach for the <strong>de</strong>layconstrainedmulticast routing problem,” in <strong>Proceedings</strong> ofthe 4th Multidisplinary International Scheduling Conference(MISTA4), Dublin, Ireland, 2009, pp. 93–104.[4] N. Ghaboosi and A. T. Haghighat, “A path relinking approachfor <strong>de</strong>lay-constrained least-cost multicast routingproblem,” in 19th IEEE International Conference on Toolswith Artificial Intelligence, 2007, pp. 383–390.[5] M. Ruthmair and G. R. Raidl, “A kruskal-based heuristic forthe rooted <strong>de</strong>lay-constrained minimum spanning tree problem,”in EUROCAST 2009, ser. LNCS, R. Moreno-Díazet al., Eds., vol. 5717. Springer, 2009, pp. 713–720.[6] ——, “Variable neighborhood search and ant colony optimizationfor the rooted <strong>de</strong>lay-constrained minimum spanningtree problem,” in PPSN XI, Part II, ser. LNCS, R. Schaeferet al., Eds., vol. 6239. Springer, 2010, pp. 391–400.[7] V. Leggieri, M. Haouari, and C. Triki, “An exact algorithmfor the Steiner tree problem with <strong>de</strong>lays,” Electronic Notes inDiscrete Mathematics, vol. 36, pp. 223–230, 2010.[8] L. Gouveia, A. Paias, and D. Sharma, “Mo<strong>de</strong>ling and solvingthe rooted distance-constrained minimum spanning treeproblem,” Computers & Operations Research, vol. 35, no. 2,pp. 600–613, 2008.[9] C. Barnhart, E. L. Johnson, G. L. Nemhauser, M. W. P.Savelsbergh, and P. H. Vance, “Branch-and-price: Columngeneration for solving huge integer programs,” OperationsResearch, vol. 46, pp. 316–329, 1998.[10] G. Desaulniers, J. Desrosiers, and M. M. Solomon, Eds., ColumnGeneration. Springer, 2005.[11] F. Van<strong>de</strong>rbeck, “Implementing mixed integer column generation,”in Column Generation, G. Desaulniers, J. Desrosiers,and M. M. Solomon, Eds. Springer, 2005, pp. 331–358.[12] H. B. Amor, J. Desrosiers, and J. M. V. Carvalho, “Dualoptimalinequalities for stabilized column generation,” OperationsResearch, vol. 54, no. 3, pp. 454–463, 2006.[13] J. M. V. <strong>de</strong> Carvalho, “Using extra dual cuts to accelerateconvergence in column generation,” INFORMS Journal onComputing, vol. 17, no. 2, pp. 175–182, 2005.[14] H. B. Amor and J. Desrosiers, “A proximal trust-region algorithmfor column generation stabilization,” Computers &Operations Research, vol. 33, pp. 910–927, 2006.[15] L.-M. Rousseau, M. Gendreau, and D. Feillet, “Interior pointstabilization for column generation,” Operations ResearchLetters, vol. 35, no. 5, pp. 660–668, 2007.[16] M. Leitner, G. R. Raidl, and U. Pferschy, “Accelerating columngeneration for a survivable network <strong>de</strong>sign problem,” in<strong>Proceedings</strong> of the International Network Optimization Conference2009, M. G. Scutellà et al., Eds., Pisa, Italy, 2009.[17] M. Leitner and G. R. Raidl, “Strong lower bounds for a survivablenetwork <strong>de</strong>sign problem,” in ISCO 2010, ser. ElectronicNotes in Discrete Mathematics, M. Haouari and A. R.Mahjoub, Eds., vol. 36. Elsevier, 2010, pp. 295–302.[18] M. Leitner, G. R. Raidl, and U. Pferschy, “Branch-and-pricefor a survivable network <strong>de</strong>sign problem,” Vienna Universityof Technology, Vienna, Austria, Tech. Rep. TR 186–1–10–02, 2010, submitted to Networks.[19] H. B. Amor, J. Desrosiers, and A. Frangioni, “On the choiceof explicit stabilizing terms in column generation,” DiscreteApplied Mathematics, vol. 157, pp. 1167–1184, 2009.ALIO-EURO <strong>2011</strong> – 253


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Heuristics for Discrete Power Control - A Case-Study in Multi-Carrier DSLNetworksMartin Wolkerstorfer ∗ Tomas Nordström ‡ †∗ Telecommunications Research Center Vienna (FTW)Donau-City-Straße 1, A-1220 Vienna, Austria{wolkerstorfer, nordstrom}@ftw.atABSTRACTThe performance of multi-user digital subscriber line (DSL) networksis limited by the electro-magnetic coupling between twistedpair cables. The adverse effect of this coupling can be reducedby controlling the transmit powers of all lines. The correspondingmulti-user, multi-carrier power control problem can be mo<strong>de</strong>le<strong>da</strong>s a multi-dimensional nonlinear Knapsack problem whichhas previously motivated the application of various mathematical<strong>de</strong>composition methods. These methods <strong>de</strong>compose the probleminto a large number of combinatorial per-subcarrier problems. Ourmain contribution lies in the proposal and analysis of various lowcomplexityheuristics for these combinatorial problems. We provi<strong>de</strong>insights in the parameter setting as well as simulation resultson a large set of 6 and 30-user DSL scenarios. These show thatsimple randomized greedy heuristics perform well even in case ofa very stringent complexity budget and that the heuristics’ averagesuboptimality is <strong>de</strong>pen<strong>de</strong>nt on the targeted <strong>da</strong>ta-rate.Keywords: Power Control, DSL, Metaheuristics, Column Generation1. MOTIVATIONDigital subscriber lines (DSL) are the most wi<strong>de</strong>ly <strong>de</strong>ployed broadban<strong>da</strong>ccess technology to<strong>da</strong>y, with more than 320 million customersworld-wi<strong>de</strong> in 2010 [1]. DSL systems suffer from theelectro-magnetic coupling between the twisted pair cables whichinduces so called “far-end crosstalk” noise at the DSL receivers.This in turn is the main limiting factor for the <strong>da</strong>ta-rate performanceof current DSL mo<strong>de</strong>ms. Furthermore, to<strong>da</strong>y’s DSL systemsare based on discrete multi-tone (DMT) modulation whichsplits the available frequency bandwidth into in<strong>de</strong>pen<strong>de</strong>nt subchannels(“subcarriers”). We consi<strong>de</strong>r the problem of optimally controllingthe transmitted power levels on each of these subchannelsand hence the crosstalk noise as well as the conveyable total <strong>da</strong>tarate.This problem is also fun<strong>da</strong>mental in multi-carrier wirelessnetworks [2]. The classical objective is the maximization of aweighted sum of <strong>da</strong>ta-rates. Recently this technique has also beendiscovered useful for reducing the system power consumption inDSL [3, 4]. Current state-of-the-art multi-carrier power control algorithmsfor tens of subscriber lines are based on techniques suchas user-iterative power up<strong>da</strong>tes, dual relaxation of transmissionrate and sum-power constraints, as well as successive continuousand convex approximation, cf. [3, 5, 6] and the references therein.Dual relaxation results in the in<strong>de</strong>pen<strong>de</strong>nt optimization of a largenumber of per-subcarrier problems. The distinctive feature of thenonlinear Dantzig-Wolfe <strong>de</strong>composition [7, Ch. 23] based schemeThis work has been supported in parts by the Austrian Governmentand the City of Vienna within the competence center program COMET.in [8] is that it allows for the suboptimal solution of the in<strong>de</strong>pen<strong>de</strong>ntper-subcarrier problems.Our main contribution is the proposal of various heuristics forcomplexity reduction of solving the combinatorial per-subcarrieroptimization subproblems, thereby expanding upon the work in[8]. We begin in Section 2 by reviewing the optimization problemof controlling the transmit power in DSL. In Section 3 wethen turn to the main focus of this paper, namely the combinatorialper-subcarrier problems and various heuristics for their solution.Section 4 gives an example of the heuristics’ performance whenapplied in conjunction with the framework in [8] to solve the mainproblem from Section 2. Our conclusions are summarized in Section5.2. BACKGROUND - GLOBAL PROBLEMWe <strong>de</strong>note the in<strong>de</strong>x sets of users and subcarriers by U = {1,...,U}and C = {1,...,C}, respectively, where U and C are the totalnumber of users and subcarriers, respectively. The optimizationvariables are the power levels p c u of user u on subcarrier c,where we will compactly write p c ∈ R+ U for the power allocationof all users on subcarrier c. The <strong>da</strong>ta-rate of user u on subcarrierc is a nonlinear function ru c (p c ) [9] which notably <strong>de</strong>pendson the power allocation of all users on that subcarrier. Again, wewill compactly write r c (p c ) ∈ R U to <strong>de</strong>note all users’ rates onsubcarrier c. Reversely, the power allocation p c (r c ) for rates r ccan be computed as the unique [10] solution of a system of linearequations of size U × U. Power levels are constrained by aregulatory power mask constraint p c u ≤ ˆp c u and the implicit constraintru c (p c ) ∈ B,∀u ∈ U ,c ∈ C , motivated by practical modulationschemes which only support a discrete set of <strong>da</strong>ta-rates B.Altogether we may compactly write the set of feasible power allocationson subcarrier c asQ c = {p c |r c u (p c ) ∈ B, 0 ≤ p c u ≤ ˆp c u,∀u ∈ U }. (1)Additional to these per-subcarrier constraints the U users haveminimum target-rates R ∈ R+ U <strong>de</strong>pen<strong>de</strong>nt on the accepted servicelevel, as well as technology-<strong>de</strong>pen<strong>de</strong>nt maximum sum-power levelsP ∈ R+ U . Our optimization objective is <strong>de</strong>fined as the sum ofper-subcarrier objectives ¯f c (p c ,ŵ, ˘w),c ∈ C . These are given asthe following weighted sum of users’ transmit powers and rates¯f c (p c ,ŵ, ˘w) = ŵ ⊺ p c − ˘w ⊺ r c (p c ), ∀c ∈ C , (2)where the weights ŵ, ˘w ∈ R+ U allow us to tra<strong>de</strong>-off between rateand energy optimization, i.e., we can consi<strong>de</strong>r rate-maximizationand energy-minimization as special cases. We are now ready toformally write the optimization problem for multi-user power controlin DSL as the following multi-dimensional nonlinear Knap-ALIO-EURO <strong>2011</strong> – 254


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>sack problem [11]minimizep c ∈Q c ,∀c∈Csubject to∑c∈C¯f c (p c ,ŵ, ˘w)∑ r c (p c ) ≽ R,c∈C∑ p c ≼ P.c∈C3. THE COMBINATORIAL SUBPROBLEM3.1. Subproblem Formulation(3a)(3b)(3c)Name Abbr. ReferenceJoint Greedy Optimization JOGO [8]Sequential Greedy Optimization SEGO [8]Local Search LS Section 3.3Rollout Algorithm RA [15]Greedy Rand. A<strong>da</strong>pt. Search Proc. GRASP [14, Ch. 8]Iterated Local Search ILS [14, Ch. 11]Simulated Annealing SA [14, Ch. 10]Ant Colony System ACS [16]Randomized SEGO rSEGO Section 3.4Randomized LS rLS Section 3.4Solver “Couenne” COU [17]Optimal Branch-and-Bound OPT [8]After Lagrange relaxation of constraints (3b) and (3c), respectively,one faces for each subcarrier c ∈ C an in<strong>de</strong>pen<strong>de</strong>nt, nonlinear(and non-convex), wi<strong>de</strong>-sense combinatorial [12, Sec. 4.4]pricing subproblem in the form of [8]minimize f c(r c ) = ¯f c (p c (r c ),ŵ + ν, ˘w + λ), (4){r c |p c (r)∈Q c }where ν, λ ∈ R+ U are the Lagrange multipliers associated withconstraints (3b) and (3c), respectively, cf. [8] for <strong>de</strong>tails. Note thatfor reasons of algorithm <strong>de</strong>sign we use the discrete vector r c as ourvariables instead of the uniquely coupled power allocation p c use<strong>da</strong>bove. In the following sections we study the solution of the subproblemin (4) and therefore omit subcarrier indices c for ease ofnotation. The search space we consi<strong>de</strong>r is × u∈U B, i.e., we onlysearch over discrete rate allocations and do not explicitly consi<strong>de</strong>rthe constraints in (4). An allocation r violating these constraintshas by <strong>de</strong>finition an objective f (r) = ∞ and our algorithms therebynever traverse infeasible allocations. As mentioned above, in or<strong>de</strong>rto evaluate the objective f (r) and to <strong>de</strong>termine feasibility weneed to solve a linear system of equations, cf. (1) and (4). Later wewill introduce the number of evaluations of p(r) as a reproduciblecomplexity measure to compare different algorithms.The optimal solution of the problem in (4) was shown to have polynomialcomplexity in [8]. However, obtaining optimal solutionsfor practical values of U was found intractable for conventionalbranch-and-bound schemes [8, 13]. Furthermore, the number ofthese per-subcarrier problems is in the or<strong>de</strong>r of thousands in thenewest generations of DSL technology. This altogether motivatesour work on fast heuristics in the following sections.3.2. Constructive Greedy Base HeuristicsIn the full paper we review the greedy base heuristic as well as thesequential greedy heuristic in [8] and provi<strong>de</strong> an analysis of various6-user VDSL scenarios, cf. Section 3.5 for simulation <strong>de</strong>tails.This analysis shows that the suboptimality of the base heuristic iszero for all collocated network scenarios while the highest suboptimalityappears in classical near-far type of scenarios. This insightwill gui<strong>de</strong> the parameter settings of randomized heuristics below.Basically two approaches will be taken in the following to improveupon purely greedy schemes, namely a) a randomization of greedy<strong>de</strong>cisions, and/or b) randomized local searches.3.3. Local SearchLocal search schemes aim at iteratively improving a given solutionr. Their key ingredient is the <strong>de</strong>finition of a neighborhoodN (r) ⊆ × u∈U B around r from which a next candi<strong>da</strong>te allocationis picked, cf. [14] for various examples of local search schemes.Here we restrict ourselves to two possible neighborhood <strong>de</strong>fini-Table 1: Heuristics compared on the problem in (4).tions: The first is a simple one-step neighborhoodN (1) (r) = {˜r ∈ × u∈U B | ˜r u = r u ± ∆,˜r i = r i ,∀i ∈ U \ {u},u ∈ U }, (5)which contains all allocations ˜r that can be reached by perturbinga single element of r by ∆. The second used neighborhood isN (2) (r) =N (1) (r) ∪ N ¯ (2) (r), (6)¯ N (2) (r) ={˜r ∈ × u∈U B | ˜r u = r u ± ∆, ˜rū = rū ± ∆,˜r i = r i ,∀i ∈ U \ {u,ū},u ≠ ū,u,ū ∈ U }, (7)which contains all allocations ˜r that can be reached by perturbingat most two different elements of r by ∆. Furthermore, twoneighborhood search strategies are consi<strong>de</strong>red, namely the “firstimproving”and the “best-improving” search strategy, cf. [14, Ch.8].3.4. Heuristics Inspired by Meta-HeuristicsIn the full paper we will present <strong>de</strong>tailed <strong>de</strong>scriptions of variousheuristics for the bit-loading problem in (4) which are partly inspiredby well-known meta-heuristics, cf. the overview of all studie<strong>da</strong>lgorithms in Table 1. Rollout algorithms and rSEGO/ant colonysystem algorithms are <strong>de</strong>terministic and randomized sequential <strong>de</strong>cisionmaking algorithms, respectively. GRASP is an extension ofthe greedy base heuristic using randomization, while iterated localsearch, randomized local search, as well as simulated annealingare randomized local search schemes.3.5. Methodology, Simulations and DiscussionIn or<strong>de</strong>r to be able to compare to optimal schemes as in [8] werestrict ourselves in this section to U = 6 users. We constructour network scenarios using a set of specified line lengths L ={200,400,600,800}m, consi<strong>de</strong>ring all U-combinations with repetitions.For example, for U = 6 this results in( )|L | +U − 1m == 84, (8)Ugenerated network scenarios. Note that this allows us to i<strong>de</strong>ntifyscenarios where the given algorithms perform badly. Such scenarioswere used to initially set the algorithmic parameters. Basedon these settings various parameter changes were selected and theimpact on the average performance studied by Monte-Carlo simulation.As in [8] we use equal Lagrange multipliers λ u ,ν u , for allu ∈ U . For setting the parameters of the heuristics we chose Lagrangemultipliers λ u = 1,ν u = 0 and weights ˘w u = 0,ŵ u = 1/U,which leads to a maximum sum-rate in our 6-user scenarios [8].ALIO-EURO <strong>2011</strong> – 255


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>10 1 # Power EvaluationsSuboptimality [%]10 010 110 210 310 410 3 10 4Dantzig-WolfeMaster ProblemHeuristicsPer-SubcarrierSubproblemsCombinationHeuristicSolutionFigure 2: Framework [8] for applying heuristics in DSL.Suboptimality [%]1.510.5rLSILSSAGRASPrSEGOACS(a)010 10 10 5 10 0(b)Figure 1: Average suboptimality of randomized heuristics in 6-user VDSL scenarios; a) Depen<strong>de</strong>ncy on the complexity budget;b) Depen<strong>de</strong>ncy on Lagrange multipliers λ u = λ,∀u ∈ U .We further make the practical assumption that there is a restrictionin simulation time for solving the subproblems in (4). However, inor<strong>de</strong>r to make our results reproducible we will use the number ofpower evaluations p(r) by solving a linear system of equations [9]as the stopping criterion of all consi<strong>de</strong>red heuristics. Note that thismetric also inclu<strong>de</strong>s evaluations of infeasible allocations, cf. thediscussion in Section 3.1. Simulation results provi<strong>de</strong>d in the fullpaper further motivate this complexity metric as it was found topreserve the comparability among different heuristics. While forsix users we were able to compute the optimum of the problem in(4), for a higher number of users we either compare to the greedybase heuristic JOGO due to its simplicity, or to a lower bound beingthe optimal objective of a discrete, convex problem relaxation,cf. [8, Alg. 5] for an analytic solution with O(U) complexity. Wefind that this lower bound gives a low gap to the optimal objectivewhen the Lagrange multipliers λ (and therefore the users’ rates)are low. Simulations were carried out using our DSL simulatoravailable in [18] and using common parameters as in [8].We investigated the solution quality of all presented heuristics forsolving the subproblems in (4) in a VDSL system with 1635 subcarriers,where for the comparisons in this section we only selecta subset of subcarriers C ˜ = 1,51,...,1601. As a benchmarkfor all our algorithms we use “Couenne” [17], a free branch-andboundbased solver for non-convex mixed-integer problems. As abase-line for our stochastic heuristics we ad<strong>de</strong>d a randomized localsearch (rLS) scheme where the LS algorithm is reinitialized at randomstarting points r uniformly drawn from × u∈U B. In the fullpaper we provi<strong>de</strong> the specific parameter settings and the intuitionsbehind these settings for all heuristics <strong>de</strong>scribed in Section 3.4.Figure 1(a) <strong>de</strong>picts the average suboptimality of all randomizedheuristics as a function of the complexity budget in various 6-userVDSL network scenarios. Intuitively, allowing the algorithms totest more solutions leads on average to a better performance. ACSperforms best in these test scenarios, where its curve stops at 10 3as it is optimal on the simulated points beyond that. Note that rLSeventually performs better than ILS and SA, which hints at insufficientdiversification capabilities of these two schemes. Figure1(b) similarly shows, for fixed complexity budget of 10 3 evaluations,the <strong>de</strong>pen<strong>de</strong>ncy of the heuristics’ average suboptimality onthe Lagrange multipliers λ u = λ,∀u ∈ U , and hence on the targetedtransmission rate as the average rate per user increases withthese multipliers. JOGO, which is used as an initial incumbent forall schemes, was found to have a monotonously increasing suboptimalitywith λ. Also, the optimal rates do not change in most scenariosabove λ = 10 −2 . Differently to JOGO, all heuristics showa peak suboptimality for a specific multiplier value, however, atdifferent values for different heuristics. Intuitively this can be explainedby the fact that with increasing λ what matters most is thetotal number of bits achieved by all users. Then it matters lesshow the bits are distributed among the users as this distributiononly influences the power consumption which has a comparablylow weight in the objective for high λ. In 30-user VDSL scenarioswith a complexity limit of 2 · 10 4 power evaluations per subcarrierproblem and using the same parameter settings for all algorithmsas above the picture is very different. The randomized heuristicsGRASP, rSEGO and ILS perform now best, with an improvementupon the objective values achieved by the greedy base heuristic byon average 9.9%, 9.8%, and 9.2%, respectively. Note howeverthat the simple <strong>de</strong>terministic extension of the greedy constructiveheuristic by a two-step local search improves the greedy heuristicalready by on average 8% while taking on average only 0.4 · 10 4power evaluations. Notably, the maximum improvement in sumobjectiveover all 33 tested subcarriers encountered in any testednetwork scenario was as high as 32%. Further insights in the performanceof all heuristics in Table 1 for 6 and 30-user VDSL scenarioswill be given in the full version of this paper.4. PERFORMANCE EVALUATION FOR DSLThe purpose of this section is to provi<strong>de</strong> evi<strong>de</strong>nce of the practicalusefulness of the proposed approach based on heuristics. As theseonly target the subproblems in (4) we exemplarily apply the heuristicrSEGO in conjunction with the complexity reduction techniquein [19] and the column generation framework in [8], cf. Figure 2.The algorithm consisting of these techniques is compared to stateof-the-artmulti-carrier power control algorithms [5, 20]. Whenusing the dual relaxation based, iterative spectrum balancing algorithm(ISB) in [5] we subsequently use the greedy central discretebit-loading algorithm (CDBL) in [20] to obtain a discrete feasiblesolution. As an example of the DSL performance we consi<strong>de</strong>r asum-rate maximization problem in a near-far downstream scenariowith 50 collocated users, where 40 lines connect to the central officeat a distance of 800m and 10 lines connect to a closer remotecabinet at 200m distance. Compared to CDBL we obtain a 2.1%sum-rate increase, or more importantly an 8.3% sum-rate increasefor the lines connected to the central office. Comparing to ISB ourresults show an 8.2% increase in total sum-rate and a 12.3% increasein sum-rate for the lines connected to the central office. Thefull paper will provi<strong>de</strong> simulation settings and extensive averageperformance and complexity comparisons.ALIO-EURO <strong>2011</strong> – 256


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>5. CONCLUSIONSWe studied the application of various heuristics to a combinatorial,non-convex power allocation problem in digital subscriber lines(DSL). Parameter setting for various 6−user DSL scenarios allowedto obtain near-optimal results using several of the proposedrandomized heuristics. Un<strong>de</strong>r various 30−user scenarios extendingthe greedy constructive heuristic by a proposed local searchscheme gave already substantial improvements at low complexity.Randomized heuristics still gave slight improvements beyondthat for mo<strong>de</strong>rate complexity limits. Summarizing, the proposedheuristics have shown an average gain in objective value comparedto the greedy constructive heuristic of up to 10%.6. ACKNOWLEDGEMENTSThis work has been supported in parts by the Austrian Governmentand the City of Vienna within the competence center programCOMET.7. REFERENCES[1] F. Vanier, “World broadband statistics: Short report Q32010,” Point Topic Ltd., Tech. Rep., December 2010.[2] C. Wong, R. Cheng, K. Letaief, and R. Murch, “MultiuserOFDM with a<strong>da</strong>ptive subcarrier, bit, and power allocation,”IEEE Journal on Selected Areas in Communications, vol. 17,no. 10, pp. 1747–1758, October 1999.[3] M. Wolkerstorfer, D. Statovci, and T. Nordström, “Dynamicspectrum management for energy-efficient transmissionin DSL,” in IEEE ICCS 2008, Guangzhou, China, 19–21November 2008.[4] M. Guenach, C. Nuzman, K. Hooghe, J. Maes, andM. Peeters, “Reduced dimensional power optimization usingclass AB and G line drivers in DSL,” in IEEE GLOBECOMWorkshops, Miami, USA, 6–10 December 2010, pp. 1443–1447.[5] W. Yu and R. Lui, “Dual methods for nonconvex spectrumoptimization of multicarrier systems,” IEEE Transactions onCommunications, vol. 54, no. 7, pp. 1310–1322, July 2006.[6] P. Tsiaflakis, M. Diehl, and M. Moonen, “Distributed spectrummanagement algorithms for multiuser DSL networks,”IEEE Transactions on Signal Processing, vol. 56, no. 10, pp.4825–4843, October 2008.[7] G. Dantzig, Linear programming and extensions. PrincetonUniversity Press, 1963.[8] M. Wolkerstorfer, J. Jaldén, and T. Nordström, “Complexityreduction strategies for the discrete multi-carrier powercontrol problem,” Submitted to IEEE Transactions on SignalProcessing, January <strong>2011</strong>.[9] P. Gol<strong>de</strong>n, H. Dedieu, and K. Jacobsen, Eds., Fun<strong>da</strong>mentalsof DSL technology. Auerbach Publications, 2006.[10] R. Yates, “A framework for uplink power control in cellularradio systems,” IEEE Journal on Selected Areas in Communications,vol. 13, no. 7, pp. 1341–1347, September 1995.[11] T. Morin and R. Marsten, “Branch-and-bound strategies fordynamic programming,” Operations Research, vol. 24, no. 4,pp. 611–627, July-August 1976.[12] K. Price, R. Storn, and J. Lampinen, Differential Evolution.Springer, 2005.[13] P. Tsiaflakis, J. Vangorp, M. Moonen, and J. Verlin<strong>de</strong>n, “Alow complexity optimal spectrum balancing algorithm fordigital subscriber lines,” Signal Processing, vol. 87, no. 7,pp. 1735–1753, July 2007.[14] F. Glover and G. Kochenberger, Eds., Handbook of Metaheuristics.Kluwer Aca<strong>de</strong>mic Publishers, 2003.[15] D. Bertsekas, “Rollout algorithms for discrete optimization:A survey,” in Handbook of Combinatorial Optimization,D. Zu and P. Par<strong>da</strong>los, Eds. Springer, August 2010, to appear.[16] M. Dorigo and L. Gambar<strong>de</strong>lla, “Ant colony system: A cooperativelearning approach to the traveling salesman problem,”IEEE Transactions on Evolutionary Computation, vol. 1,no. 1, pp. 53–66, April 1997.[17] P. Belotti, “Couenne: A user’s manual,” Lehigh University,Tech. Rep., 2009.[18] (2008, October) xDSL simulator v3.1. [Online]. Available:xdsl.ftw.at[19] M. Wolkerstorfer and T. Nordström, “Coverage optimizationin DSL networks by low-complexity discrete spectrum balancing,”submitted to IEEE GLOBECOM <strong>2011</strong>, Houston,Texas, USA, 5–9 December <strong>2011</strong>.[20] J. Lee, R. Sonalkar, and J. Cioffi, “Multi-user discrete bitloadingfor DMT-based DSL systems,” in IEEE GLOBE-COM 2002, vol. 2, Taipei, Taiwan, China, 17–21 November2002, pp. 1259–1263.ALIO-EURO <strong>2011</strong> – 257


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A HYBRID META-HEURISTIC FOR THE NETWORK LOAD BALANCINGPROBLEMDorabella Santos ∗ Amaro <strong>de</strong> Sousa † Filipe Alvelos ‡∗ Instituto <strong>de</strong> Telecomunicações3810-193 Aveiro, Portugaldorabella@av.it.pt† Instituto <strong>de</strong> Telecomunicações, Universi<strong>da</strong><strong>de</strong> <strong>de</strong> Aveiro3810-193 Aveiro, Portugalasou@ua.pt‡ Centro Algoritmi / DPS, Universi<strong>da</strong><strong>de</strong> do Minho4710-057 Braga, Portugalfalvelos@dps.uminho.ptABSTRACTGiven a capacitated telecommunications network with single pathrouting and an estimated traffic <strong>de</strong>mand matrix, the network loadbalancing problem is the <strong>de</strong>termination of a routing path for eachtraffic commodity such that the network load balancing is optimized,i.e., the worst case link load is minimized, among all suchsolutions, the second worst case link load is minimized, and soon... We discuss a meta-heuristic which runs a GRASP with PathRelinking procedure on a restricted search space <strong>de</strong>fined by ColumnGeneration. We discuss some computational results showingthat, for the network load balancing problem, this approach is successfulin obtaining good quality solutions in short running times.Keywords: Load Balancing, GRASP with Path Relinking, ColumnGeneration, Hybrid Meta-Heuristics1. META-HEURISTIC PROPOSALGreedy Randomized A<strong>da</strong>ptive Search Procedure (GRASP) is ameta-heuristic first introduced for the set covering problem [1].It is a multi-start local search method where, at each start, a solutionis randomly generated (with some greediness) and localsearch is applied to it to find its closest local minimum solution(in a minimization problem). Path Relinking (PR), originally propose<strong>da</strong>s an intensification method applied to tabu search [2], isa method that tries to find better solutions by the combination oftwo initial solutions. The common combination of GRASP withPR (GRASP+PR) is to run PR at the end of each GRASP iterationbetween its local minimum solution and one solution randomly selectedfrom a given list of elite solutions (please see [3] for a surveyon many applications where GRASP with Path Relinking hasbeen applied). The key i<strong>de</strong>a of our approach is to run GRASP+PRon a restricted search space instead of running it on the completesolution space of the problem. The restricted search space aims tomake the search more efficient provi<strong>de</strong>d that it contains good qualitysolutions. To manage the restricted search space, we use ColumnGeneration (CG): the initial restricted search space is composedby the columns generated by CG solving the LP relaxationof the original problem; then, during the search, the restrictedsearch space is modified by including new columns and/or excludingexisting columns. New columns are generated by CGsolving a perturbed problem which is <strong>de</strong>fined based on the currentincumbent solution of the GRASP+PR and on the LP valueof the original problem. This meta-heuristic may be seen as animplementation of the general framework for combining CG andmeta-heuristics entitled SearchCol (Meta-heuristic search by columngeneration) [4].2. NETWORK LOAD BALANCING OPTIMIZATIONConsi<strong>de</strong>r a telecommunications network mo<strong>de</strong>led on a graph G(N,A)where N is the set of network no<strong>de</strong>s and A is the set of networklinks connecting no<strong>de</strong>s. The link between no<strong>de</strong>s i ∈ N and j ∈ Nis <strong>de</strong>noted by {i, j} and each link {i, j} ∈ A has a given capacityc {i j} . Consi<strong>de</strong>r a set of commodities K, where each commodityk ∈ K is to be routed through a single path on the network and ischaracterized by its origin no<strong>de</strong> o k ∈ N, its <strong>de</strong>stination no<strong>de</strong> d k ∈ Nand its <strong>de</strong>mand b k > 0.Let P k be the set of paths available on graph G between the endno<strong>de</strong>s of k ∈ K and let δ pk be a binary parameter that is 1 if link{i j}{i, j} ∈ A is in the path p ∈ P k . To mo<strong>de</strong>l the optimization problem,we consi<strong>de</strong>r the following <strong>de</strong>cision variables: the binary variablesϕ p k which are 1 if path p ∈ P k is chosen as the routing path ofcommodity k ∈ K; and the real variables µ {i j} accounting for theload on link {i, j} ∈ A. The following set of constraints <strong>de</strong>fines thecomplete solution space:∑∑lll ∑ ϕ p k = 1 ∀k ∈ K (1)p∈P kb k δ pk{i j} ϕ p k = c {i j}µ {i j} ∀{i, j} ∈ A (2)k∈K p∈P kφ p k ∈ {0,1}, µ {i j} ∈ [0,1] (3)Constraints (1) guarantee that exactly one path of P k is chosen foreach k ∈ K, constraints (2) account for the loads on each link, andconstraints (3) are the domain constraints. The load balancing optimizationproblem uses the concept of lexicographical optimization.Given two vectors a = (a 1 ,...,a m ) and b = (b 1 ,...,b m ), vectora is lexicographically is said to be smaller than vector b if eithera 1 < b 1 or there exists an in<strong>de</strong>x l ∈ {1,...,m − 1} such that a i = b ifor all i ∈ l and a l + 1 < b l + 1. Now consi<strong>de</strong>r the vector of linkloads µ = (µ {i j} : {i, j} ∈ A) and let [µ] be the vector obtainedfrom by rearranging its elements in a non-increasing or<strong>de</strong>r. Theload balancing optimization problem can be <strong>de</strong>fined in a non-linearALIO-EURO <strong>2011</strong> – 258


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>manner as:llllexmin [µ] (4)Subject to(1) − (3)where lexmin <strong>de</strong>notes the lexicographical minimization of [µ],i.e., finding a vector [µ ∗ ] which is lexicographically minimal amongall possible vectors [µ]. It is known that the solution of the loadbalancing optimization problem can be obtained by solving a sequenceof mixed integer linear programming mo<strong>de</strong>ls. One suchmethod is the conditional means approach (for <strong>de</strong>tails, please see[5-6]). The use of CG to solve the LP relaxation if this sequenceof mo<strong>de</strong>ls is straightforward since constraints (1) <strong>de</strong>fine a subproblemfor each commodity k ∈ K whose solutions are paths. Forthis reason, CG is usually named Path Generation and the columnsgenerated by CG for the subproblem associated to commodity krepresent paths between its end no<strong>de</strong>s o k and d k in graph G.3. EFFICIENCY OF THE PROPOSED META-HEURISTICThe network load balancing problem was addressed in [7], whereGRASP+PR was applied to the restricted search space given by thecolumns generated by CG solving the LP relaxation of the problem.In that approach, the restricted search space is not modifiedduring the search and PR is done only between the local minimumsolution of each GRASP iteration and the incumbent solution (i.e,the elite list is composed by a single solution). That approach wascompared to the equivalent GRASP+PR applied to the completesolution space of the load balancing problem. The computationalresults showed that the constrained search space gives much betterresults because it contains good quality solutions and, due to itssize, enables to find them in much shorter running times.In or<strong>de</strong>r to test our meta-heuristic, we have <strong>de</strong>fined a set of 24instances based on the well known network topology of the NSFnetwork with 26 no<strong>de</strong>s and 42 links. In all test instances, we haverandomly generated a <strong>de</strong>mand matrix with the aim of emulatingdifferent possible real scenarios.In terms of PR, the computational results show that: (i) the use ofan elite list does not provi<strong>de</strong> significant improvements when PR isdone only with one randomly selected elite solution; (ii) there is asignificant improvement if PR is done to each of all elite solutionsprovi<strong>de</strong>d that the list size is not too large and the elite solutions arequite different; (iii) forward and backward PR was the best strategyin terms of heuristic efficiency.Since the proposed method is a stochastic process, it gives differentsolutions in different runs. In or<strong>de</strong>r to generate useful <strong>da</strong>ta forcomparison analysis, we have adopted the following methodology.Whenever we aim to compare two algorithms, we run both algorithms10 times, giving the same runtime to each, and compareeach pair of solutions. Then, we sum the number of times the secon<strong>da</strong>lgorithm was better than the first algorithm. We average thesenumbers over the set of test instances of interest and calculate, inpercentage, how many runs produce better results with the secon<strong>da</strong>lgorithm when compared with the first one.Globally, the best algorithm exhibits an efficiency performance improvementof 94.1% when compared to running GRASP+PR withno restricted search space modifications, showing that our metaheuristicis far better than the simplest one exploited in [7].4. REFERENCES[1] T. Feo and M. Resen<strong>de</strong>,“A probabilistic heuristic for a computationallydifficult set covering problem", Operations ResearchLetters, 8, pp. 67-71, 1989[2] F. Glover, “Tabu search and a<strong>da</strong>ptive memory programming- Advances, applications and challenges", in Interfaces inComputer Science and Operations Research, R.S. Barr, R.V.Helgason, and J.L. Kennington, (Eds.), Kluwer, pp. 1-75,1996[3] M. Resen<strong>de</strong> and C. Ribeiro, “GRASP with path relinking: recentadvances and applications", in Metaheuristics: Progressas Real Problem Solvers, T. Ibaraki, K. Nonobe and M. Yagiura,(Eds.), Springer, pp. 29-63, 2005[4] F. Alvelos, A. <strong>de</strong> Sousa and D. Santos, “SearchCol: metaheuristicsearch by column generation", in Hybrid Metaheuristics,M. Blesa, C. Blum, G. Raidl, A. Roli, M. Sampels(Eds.), Lecture Notes in Computer Science, Volume 6373,pp. 190-205, 2010[5] W. Ogryczak and T. Śliwiński, “On solving linear programswith the or<strong>de</strong>red weighted averaging objective", EuropeanJournal of Operational Research, 148, pp. 80-91, 2003[6] W. Ogryczak, M. Pióro and A. Tomaszewski, “Telecommunicationsnetwork <strong>de</strong>sign and max-min optimization problem",Journal of Telecommunications and Information Technology,3, 2005[7] D. Santos, A. <strong>de</strong> Sousa, F. Alvelos and M. Pióro, “Linkload balancing optimization of telecommunication networks:a column generation based heuristic approach", in Proc. ofNETWORKS Conference, IEEE Xplore, 2010ALIO-EURO <strong>2011</strong> – 259


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Mo<strong>de</strong>ling the collision avoi<strong>da</strong>nce for the ATM by a mixed 0–1 nonlinear approachAntonio Alonso Ayuso ∗ Laureano F. Escu<strong>de</strong>ro ∗ Francisco Javier Martín Campo ∗∗ Department of Statistics and Operational ResearchC/ Tulipán s/n, 28933, Móstoles, Madrid (Spain){antonio.alonso,laureano.escu<strong>de</strong>ro,javier.martin.campo}@urjc.esABSTRACTA 0–1 nonlinear mo<strong>de</strong>l for the Collision Avoi<strong>da</strong>nce in Air TrafficManagement (ATM) problem is presented. The aim of this problemis <strong>de</strong>ciding the best strategy for an arbitrary aircraft configurationssuch that all conflicts in the airspace are avoi<strong>de</strong>d where aconflict is the loss of the minimum safety distance that two aircrafthave to keep in their flight plans. A mixed 0–1 nonlinear optimizationmo<strong>de</strong>l based on geometric constructions is <strong>de</strong>veloped knowingthe initial flight plan (coordinates, angles and velocities in eachtime period) and minimizing the acceleration variations where aircraftare forced to return to the original flight plan when no aircraftare in conflict. A linear approximation by using iteratively Taylorpolynomials is <strong>de</strong>veloped to solve the problem in linear terms, aswell as a metaheuristic based on Variable Neigbourhood Search(VNS) in or<strong>de</strong>r to reduce the resolution time.Keywords: Air Traffic Management (ATM), Collision avoi<strong>da</strong>nce,Mixed 0-1 nonlinear optimization1. INTRODUCTIONAircraft conflict <strong>de</strong>tection and resolution is currently attracting theinterest of many air transportation service provi<strong>de</strong>rs and is concernedwith the following question: Given a set of airborne aircraftand their inten<strong>de</strong>d trajectories, what control strategy should be followedby the pilots and the air traffic service provi<strong>de</strong>r to preventthe aircraft from coming too close to each other?Several approaches can be found in the literature, where differentworks tackle the problem from different points of view. In Kucharand Yang (2000) [1] can be found several approaches until 2000.However, the <strong>de</strong>velopments from thereafter are very interesting.Some of the most important works can be found in Martín-Campo(2010) [2].The organization of the remain<strong>de</strong>r of the note is as follows. First,Section 2 presents the problem <strong>de</strong>scription. Section 3 introducesthe notation of the elements of the problem. Section 4 presentsthe mixed 0–1 nonlinear mo<strong>de</strong>l. Section 5 gives the main i<strong>de</strong>asof the iterative procedure for problem solving as well the VNDSmetaheuristic scheme. Section 6 shows the main computationalresults and, finally, section 7 conclu<strong>de</strong>s.2. PROBLEM DESCRIPTIONA 0–1 nonlinear constrained mo<strong>de</strong>l is <strong>de</strong>veloped by using the geometricand theoretical i<strong>de</strong>as from the Velocity Changes problem(VC) presented in Pallottino et al. (2002) [3] and the Velocity andAltitu<strong>de</strong> Changes problem (VAC) presented in Alonso-Ayuso et al.(2010) [4] and Martín-Campo (2010) [2]. The VC and VAC mo<strong>de</strong>lsassume instantaneous changes in velocity to avoid a conflict. Inthe new mo<strong>de</strong>l, so-called VCTP (Velocity Changes through TimePeriods), continuous velocity changes are proposed by using theproperties of a rectilinear movement and uniformly acceleratingmovement. The VCTP mo<strong>de</strong>l can also assume nonlinear trajectoriesby consi<strong>de</strong>ring the polygonal (in each time period) of thetrajectory.The mo<strong>de</strong>l suppose that the preliminary trajectories of F aircraftare known and it can be extracted the aircraft configurations inT fixed time points. In these points the velocity and the position(abscissa and ordinate) of each aircraft in each point and the motionangles between two points are known. With these <strong>da</strong>ta weconstruct a new mo<strong>de</strong>l for obtaining the optimal configuration bychanging the aircraft accelerations and avoiding all conflicts betweenthe aircraft.3. NOTATIONLet the following notation for the formulation of the mo<strong>de</strong>l:SetsF = {1,...,F}, set of aircraft in the airspace.T = {0,...,T }, set of time periods.Parameterss, safety distance between aircraft, usually, 2.5 nautical miles.e, distance bound to consi<strong>de</strong>r as a conflicting pair of aircraft.w 1 ,w 2 , weight (between 0 and 1) for each objective function term.div, integer parameter greater than 1 to be consi<strong>de</strong>red for the boundsof some variables.For all t ∈ T :I t , length of the time period between times instants t − 1 and t.For all f ∈ F and t ∈ T :x ∗tf ,y∗t f , initial configuration of position, abscissa and ordinate, foraircraft f in time period t, respectively.d ∗tf , covered distance for aircraft f during time period t in the initialconfiguration.v ∗tf , initial velocity configuration for aircraft f in time period t.a ∗tf , initial acceleration configuration for aircraft f in time periodt.r t f , safety radius for each aircraft f in time period t, usually 2.5nautical miles (nm).v t f ,vt f , minimum and maximum velocities allowed for aircraft fin time period t, respectively.a t f ,at f , minimum and accelerations allowed for aircraft f in timeperiod t, respectively.ALIO-EURO <strong>2011</strong> – 260


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>m ∗tf , direction of motion in (−π,π] for aircraft f in time period t.ˆx t f ,ŷt f , position parameters to be up<strong>da</strong>ted in the Taylor approximationfor aircraft f in time period t.dˆt f , distance parameter to be up<strong>da</strong>ted in the Taylor approximationfor aircraft f in time period t.ˆv t f , velocity parameter to be up<strong>da</strong>ted in the Taylor approximationfor aircraft f in time period t.c a+ft ,ca− ft , costs for positive and negative acceleration changes foraircraft f in time period t, respectively.c v ft , costs for the difference between the initial and optimal velocityconfiguration for aircraft f in time period t.c d ft , costs for the difference between the initial and optimal covereddistance for aircraft f in time period t.For all f ∈ F :x ∗td ff,y ∗td ff ,x ∗tr ff ,y ∗tr ff , <strong>de</strong>parture and arrival positions (abscissa andordinate) for aircraft f .d totf , total length of the polygonal of the trajectory for aircraft f .t d f ,tr f , scheduled <strong>de</strong>parture and arrival times for flight f .Data preprocessingFor all f ∈ F and t ∈ T :t f , covered distance for aircraft f during time period t, for f ∈ Fand t ∈ T .γ 1 ft ,γ2 ft ,γ3 ft ,γ4 ft ,γt f auxiliary 0–1 variables to mo<strong>de</strong>l the case ofearly or <strong>de</strong>lay for aircraft f in time period t, for f ∈ Fand t ∈ T .For all i, j ∈ F : i < j, for all t ∈ T :t = { max{t d i ,td j } + 1,...,min{tr i ,tr j } − 1} and n = 1,...,8:δi njt , auxiliary 0–1 variables for mo<strong>de</strong>ling the or-constraints.4. MIXED 0–1 NONLINEAR MODELNow, the full formulation for the VCTP mo<strong>de</strong>l is presented below,including all the aspects that have been studied above.minw 1 ∑ ∑f ∈F t∈T( c a +ft a t+fa t f − at f+ ca− ft a t−fa t f − at fsubject to ∀ f ∈ F ,∀t ∈ T : t = {t d f + 1,...,tr f }v t f ∀ f ∈ F ,∀t ∈ T : t = {t d f + 1,...,tr f })+ w 2 ∑f ∈F∑c d ftβ t f (1)t∈Tv t−1f + a t f I t v t f (2a)a t f a t f a t f (2b)d t f = v t−1fI t + 1 2 (at+ f− a t−f )I 2 t (3)x t f ,yt f ,dt f , upper bounds for variables x, y and d, respectively.x t f ,yt , lower bounds for variables x and y, respectively.fFor all i, j ∈ F : i < j, for all t ∈ T :t = { max{t d i ,td j } + 1,...,min{tr i ,tr j } − 1} :f c t i j , 0–1 parameter that <strong>de</strong>tects if there is a “false conflict” betweenaircraft i and j in time period t.p t i j , 0–1 parameter that will be 1 if the pair of aircraft i and jwill not be taken into account in time period t for conflictresolution. This parameter <strong>de</strong>pends on the criterion <strong>de</strong>ci<strong>de</strong>dby the ATC. Notice that this parameter will be 1 if f c t i j = 1.ip i jt , intersection point between the straight line trajectories of thecorresponding polygonal segment for time period t for aircrafti and j, if the trajectories are not parallel o coinci<strong>de</strong>nt.d 1 i jt , distance between the i aircraft position and ip i jt in time periodt.di 2 jt , distance between points ( x t i + cos(m∗t i ),y t i + sin(m∗t i ) ) andip i jt for aircraft i and j in time period t.VariablesFor all f ∈ F and for all t ∈ T :x t f ,yt f , the position (abscissa and ordinate) of aircraft f in timeperiod t, for f ∈ F and t ∈ T , respectively.a t f , acceleration variation for aircraft f in time period t, for f ∈ Fand t ∈ T . This variable is real and can be divi<strong>de</strong>d in twopositive variables, say, a t+f and a t−f , such that a t f = at+ f −a t−f as a traditional way in LP.a t+f , positive acceleration variation for aircraft f in time period t,for f ∈ F and t ∈ T .a t−f , negative acceleration variation for aircraft f in time period t,for f ∈ F and t ∈ T .dv t f , velocity for aircraft f in time period t, for f ∈ F and t ∈ T .∀i, j ∈ F : i < j ∧ p i j = 0,∀t ∈ T : t = { max{t d i ,td j } + 1,...,min{tr i ,tr j } − 1}v t (i cos(m∗ti )(1 − pc t i j) − sin(m ∗ti )pc t i j)−v t j− v t (i hti (1 − pc t i j) + h ′ tv t (i cos(m∗t(cos(m∗tj )(1 − pc t i j) − sin(m ∗tj )pc t i j)+ vtj(htj (1 − pc t i j) + h ′ tj pc t i j)) (vti + v t j)(1 − δ 1i jt)i pc t i j( (v ti |h t i| + v t j|h t j| ) (1 − pc t i j) + ( v t i|h ′ ti | + v t j|h ′ tj | ) pc t i ji )(1 − pc t i j) − sin(m ∗ti )pc t i j)−v t jv t (i kti (1 − pc t i j) + k ′ t− v t (i cos(m∗tv t (i hti (1 − pc t i j) + h ′ t− v t (i cos(m∗t(cos(m∗tj )(1 − pc t i j) − sin(m ∗tj )pc t i j)− vtj(ktj (1 − pc t i j) + k ′ tj pc t i j))(1 − δ 1i jt)) (vti + v t j)(1 − δ 2i jt)i pc t i j( (v ti |k t i| + v t j|k t j| ) (1 − pc t i j) + ( v t i|k ′ ti | + v t j|k ′ tj | ) pc t i ji )(1 − pc t i j) − sin(m ∗ti )pc t i j)+v t (j cos(m∗tj )(1 − pc t i j) − sin(m ∗tj )pc t i j)− vtj(htj (1 − pc t i j) + h ′ tj pc t i j))(1 − δ 2i jt)) (vti + v t j)(1 − δ 3i jt)i pc t i j( (v ti |h t i| + v t j|h t j| ) (1 − pc t i j) + ( v t i|h ′ ti | + v t j|h ′ tj | ) pc t i ji )(1 − pc t i j) − sin(m ∗ti )pc t i j)+v t (j cos(m∗tj )(1 − pc t i j) − sin(m ∗tj )pc t i j− v t (i kti (1 − pc t i j) + k ′ t)+ vtj(ktj (1 − pc t i j) + k ′ tj pc t i j))(1 − δ 3i jt)) (vti + v t j)(1 − δ 4i jt)i pc t i j( (v ti |k t i| + v t j|k t j| ) (1 − pc t i j) + ( v t i|k ′ ti | + v t j|k ′ tj | ) pc t i j∀ f ∈ F ,∀t ∈ T : t = {t d f + 1,...,tr f }δ 1i jt + δ 2i jt + δ 3i jt + δ 4i jt = 1)(1 − δ 4i jt)(4a)(4b)(4c)(4d)(4e)(4f)(4g)(4h)(4i)ALIO-EURO <strong>2011</strong> – 261


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>next terms are nonlinear ones:(x t f − x ∗ttf −∑l=1(x t f − x ∗ttf −∑l=1(y t f − y ∗ttf − ∑ d l f −l=1(y t f − y ∗ttf − ∑ d l f −l=1(x t f − x ∗t−1tf −∑l=1(x t f − x ∗t−1tf −∑l=1(y t f − y ∗t−1tf −∑l=1(y t f − y ∗t−1tf −∑l=1t∑ d l tf − ∑l=1 l=1∑l=1d ∗lf d∗t fdiv γt ft∑ d l tf − ∑ d ∗lf − ε (− d∗t fl=1 l=1div − ε)(1 − γt f )d l t )f − ∑ d ∗lf cos(m t f ) (x t f − x ∗tf + d∗t fl=1div )(1 − γt f )d l t )f − ∑ d ∗lf cos(m ∗tf ) (x t f − x ∗tf − d∗t fl=1div )(1 − γt f )t )d ∗lf sin(m ∗tf ) (y t f − y ∗tf + d∗t ft∑d l t−1f − ∑l=1d l t−1f − ∑l=1d l t−1f − ∑l=1d l t−1f − ∑l=1l=1)d ∗lf)d ∗lf sin(m ∗tf ) (y t − f y∗t)d ∗lf)d ∗lf)d ∗lfcos(m ∗t−1fcos(m ∗t−1fsin(m ∗t−1fsin(m ∗t−1f) (x t f − x ∗t−1f) (x t f − x ∗t−1f) (y t f − y ∗t−1f) (y t − f y∗t−1 fdiv )(1 − γt f )f − d∗t fdiv )(1 − γt f )+ d ∗tf )γ t f− d ∗tf )γ t f+ d ∗tf )γ t f− d ∗tf )γ t f(5a)(5b)(5c)(5d)(5e)(5f)(5g)(5h)(5i)(5j)h t i =h t j =k t i =k t j =(x t i − xt j )s + (yt i − yt j√(x ) t i − xt j )2 + (y t i − yt j )2 − s 2√cos(m ∗t(x t i − xt j ) (x t i − xt j )2 + (y t i − yt j )2 − s 2 − (y t i − yt j )s i ) − sin(m ∗ti )(x t i − xt j )s + (yt i − yt j√(x ) t i − xt j )2 + (y t i − yt j )2 − s 2√cos(m ∗t(x t i − xt j ) (x t i − xt j )2 + (y t i − yt j )2 − s 2 − (y t i − yt j )s j ) − sin(m ∗tj )(9a)(9b)−(x t i − xt j )s + (yt i − yt j√(x ) t i − xt j )2 + (y t i − yt j )2 − s 2√cos(m ∗t(x t i − xt j ) (x t i − xt j )2 + (y t i − yt j )2 − s 2 + (y t i − yt j )s i ) − sin(m ∗ti ) (9c)−(x t i − xt j )s + (yt i − yt j√(x ) t i − xt j )2 + (y t i − yt j )2 − s 2√cos(m ∗t(x t i − xt j ) (x t i − xt j )2 + (y t i − yt j )2 − s 2 + (y t i − yt j )s j ) − sin(m ∗tj ) (9d)Constraints (5) up<strong>da</strong>te the positions in each time period t ∈ T , accordingto the previous changes in velocity ma<strong>de</strong> until the currenttime period. Constraints (6) transform the second term of the objectivefunction in a linear function, since it minimizes an absolutevalue between the covered distances in the initial flight plan andthe covered distances in the resolution. Constraints (7) force somevariables to be into fixed bounds, in or<strong>de</strong>r to calculate the big M inthe position projection constraints. Finally, constraints (8) give thevariables’ type. See [2] and the full version of the paper [5] for a<strong>de</strong>tailed explanation.∀ f ∈ F ,∀t ∈ T : t = {t d f + 1,...,tr f − 1}d t f = d ∗tft∑ d l tf − ∑ d ∗lf β t f (6a)l=1 l=1t∑ d ∗ltf − ∑ d l f β t fl=1 l=1− d∗t fdiv dt f d∗t f+ d∗t fdiv = dt fx t f = x ∗tf − d∗t fdiv cos(m∗t−1 f ) x t f x∗t f + d∗t fdiv cos(m∗t f ) = x t fy t = y ∗tf f − d∗t fdiv sin(m∗t−1 f ) y t f y∗t f + d∗t fdiv sin(m∗t f ) = y t f(6b)(7a)(7b)(7c)5. ALGORITHMIC APPROACHFor solving iteratively the linearized mo<strong>de</strong>l, the algorithmic approach<strong>de</strong>scribed in Almiñana et al. (2007) [6] in a different contextinspires the work presented in this paper. It is based on aiterative optimization by starting with the initial configuration andup<strong>da</strong>ting the input parameters where the <strong>de</strong>rivatives are centereduntil a stop criterion is satisfied.First, the nonlinear constraints have to be linearized by using Taylorpolynomials, so the new mathematical expression for each inequality(n = 1,...,4) can be expressed as follows,{() } ( v i and j − ˆv t i and j∂CnCn +∂(v t i and j ,xt i and j ,yt i and j ) x i and j − ˆx t|( ˆv t i and j , ˆxt i and j ,ŷt i and j ) i and jy i and j − ŷ t i and j)∀ f ∈ F ,∀t ∈ T : t = {t d f ,...,tr f } x t f ,y t f ,a t f ∈ R (8a)≤ M(1 − δ n i jt ).v t f ,a t+f ,a t−f ,d t f ,β t f ∈ R + (8b)∀i, j ∈ F : i < j ∧ p i j = 0,∀t ∈ T : t = { max{t d i ,td j } + 1,...,min{tr i ,tr j } − 1}δ 1i jt,δ 2i jt,δ 3i jt,δ 4i jt ∈ {0,1}∀ f ∈ F ,∀t ∈ T : t = {t d f ,...,tr f } : γ t f ∈ {0,1} (8d)The objective function (1) gives the optimization criterion for themo<strong>de</strong>l. It has two terms, one for minimizing the sum of the absolutevalues of the accelerations and the other one for forcing aircraftto return to the initial configuration, where the values of thew 1 and w 2 emphasizes one term over the other. If the second termof the objective function is contemplated, it must be accompaniedby constraints (6). Constraints (2) avoid the velocity and the accelerationto be bigger or smaller than the upper or lower bound,respectively. Constraints (3) up<strong>da</strong>te the covered distance by an aircraftafter the changes in its configuration in time period t ∈ T .Constraints (4) <strong>de</strong>tect and solve the conflicts in the airspace. The(8c)The algorithm for the resolution is presented below:Step 1. Computing the values of the nonlinear constraints ∀i, j ∈F : i < j and ∀ f ∈ F , such that v t i ,vt j ,xt i ,xt j ,yt i ,yt j are replacedwith ˆv t i , ˆvt j , ˆxt i , ˆxt j ,ŷt i ,ŷt j , respectively. In the first iteration,ˆv t i = v∗t i , ˆv t j = v∗t j , ˆxt i = x∗t i , ˆx t j = x∗t j ,ŷt i = y∗t i ,ŷ t j =y ∗tj , dˆt f .Step 2. Solving the mixed zero-one mo<strong>de</strong>l, where the nonlinearconstraints are substituted by its linear approximation. Letd t f be the optimal values of the respective variables.Step 3. Optimality test. If the following condition is satisfied thenstop, the quasi-optimal solution has been obtained. Otherwise,go to Step 4.∑∑f ∈F t∈Twhere ε is a positive tolerance.(d t f − ˆ d t f )2 ≤ εStep 4. Up<strong>da</strong>ting the covered distance, the acceleration, the velocityand the positions and go to Step 1.See [2] and [5] for a <strong>de</strong>tailed explanation.ALIO-EURO <strong>2011</strong> – 262


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Now, the linearized mo<strong>de</strong>l can be solved by using the optimizationengine of choice for mixed 0–1 linear mo<strong>de</strong>ls. Unfortunately,the resolution time for large-scale instances could be very big dueto the exponential complexity of the Branch and Bound (B&B)schemes in MILP.In view of the situation, one alternative consists of implementing ametaheuristic scheme able to get a good solution in an affor<strong>da</strong>bletime. The characteristics of this problem make the optimality tobe asi<strong>de</strong>, being the feasibility of the solution the first goal. Thescheme of choice has been published by Lazić et al. (2010) [7], socalled “Variable Neighbourhood Decomposition Search” (VNDS).6. COMPUTATIONAL RESULTSFor comparison purposes of the VNDS solution quality, Table 1shows the objective function values by plain use of the engineCPLEX [8] for optimizing the VCTP mo<strong>de</strong>l according to the schemepresented in section 5 and, in<strong>de</strong>pen<strong>de</strong>ntly, by using the VNDSscheme for a large testbed, where the headings are as follows:• Case: Case configuration: C-AAA <strong>de</strong>notes number of aircraft(AAA) in conflict.• z ip : Optimal solution by using the exact algorithm.• z V NDS : Best solution obtained by using VNDS.• GAP V NDS : z V NDS−z ipz V NDS%• t ip : Time (secs.) to obtain the z ip value.• t V NDS : Best time (secs.) to obtain the best solution by usingVNDS.Table 1 shows the small GAP between the optimal solution obtainedby the above exact algorithm and the best solution obtainedby the VNDS scheme. The resolution times are notably shorter byusing the metaheuristic scheme instead of the exact one. See [2],[5] and [9] for an extensive computational experience.7. CONCLUSIONSA mixed 0–1 nonlinear optimization mo<strong>de</strong>l has been presented inor<strong>de</strong>r to solve the collision avoi<strong>da</strong>nce for the ATM problem. Dueto four hard nonlinear constraints, the mo<strong>de</strong>l has been linearized byusing iteratively Taylor polynomials approximations since no optimizationengine to solve mixed integer nonlinear mo<strong>de</strong>ls couldsolve the problem, as we know. Solving the mo<strong>de</strong>l by successivemixed 0–1 linear approximations could require a non affor<strong>da</strong>bletime. However, a metaheuristic based on VNS has been implemented,getting good solutions (GAP less than 1%) in short time.8. ACKNOWLEDGEMENTSThis work is partially supported by i-Math Ingenio Mathematica.This work has been carried out within the Framework ofATLANTIDA project, partially fun<strong>de</strong>d by the Spanish CDTI, inwhich the Rey Juan Carlos University is collaborating with GMVAerospace and Defence S.A. This research has been also partiallyCase z ip z VNDS GAP VNDS (%) t ip t VNDSC-025 0.0227 0.0228 0.44 19.95 2.02C-035 0.0315 0.0318 0.94 54.54 8.73C-045 0.0457 0.0459 0.44 116.55 18.90Table 1: Results comparisonsupported by the project PLANIN MTM2009-14087-C04-01 fromMinistry of Science and Innovation, Spain.The authors would like to thank N. Mla<strong>de</strong>nović and J. Lazić, whosemetaheuristic VNDS scheme has helped to obtain the good computationalresults that have been reported.9. REFERENCES[1] J. K. Kuchar and L. C. Yang, “A review of conflict <strong>de</strong>tectionand resolution mo<strong>de</strong>ling methods,” IEEE Transactionson Intelligent Transportation Systems, vol. 1, pp. 179–189,2000.[2] F. J. Martín-Campo, “The collision avoi<strong>da</strong>nce problem:Methods and algorithms,” Ph.D. dissertation, Department ofStatistics and Operational Research, Rey Juan Carlos University,2010.[3] L. Pallottino, E. Feron, and A. Bicchi, “Conflict resolutionproblems for air traffic management systems solved withmixed integer programming,” IEEE Transactions on IntelligentTransportation Systems, vol. 3, no. 1, pp. 3–11, 2002.[4] A. Alonso-Ayuso, L. F. Escu<strong>de</strong>ro, and F. J. Martín-Campo, “Collision avoi<strong>da</strong>nce in the air traffic management:A mixed integer linear optimization approach,” IEEETransactions on Intelligent Transportation Systems, DOI:10.1109/TITS.2010.2061971, 2010.[5] A. Alonso-Ayuso, L. F. Escu<strong>de</strong>ro, and F. J. Martín-Campo,“A mixed 0-1 nonlinear approach for the collision avoi<strong>da</strong>ncein atm: Velocity changes through a time horizon,” To be submitted,<strong>2011</strong>.[6] M. Alminana, L. F. Escu<strong>de</strong>ro, M. Lan<strong>de</strong>te, J. F. Monge,A. Rabasa, and J. Sánchez-Soriano, “On a mixed 0-1 separablenonlinear approach for water irrigation scheduling,”IIE Transactions, vol. 44, no. 4, pp. 398–405, 2008.[7] J. Lazić, S. Hanafi, N. Mla<strong>de</strong>nović, and D. Urošević, “Variableneighbourhood <strong>de</strong>composition search for 0-1 mixed integerprograms,” Computers & Operations Research, vol. 37,pp. 1055–1067, 2010.[8] IBM ILOG, CPLEX v12.1. User’s Manual for CPLEX, 2009.[9] A. Alonso-Ayuso, L. F. Escu<strong>de</strong>ro, and F. J. Martín-Campo, “Variable neighbourhood <strong>de</strong>composition search forthe mixed 0-1 nonlinear collision avoi<strong>da</strong>nce problem,” To besubmitted, <strong>2011</strong>.ALIO-EURO <strong>2011</strong> – 263


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Low Energy Scheduling with Power Heterogeneous Multiprocessor SystemsRichard Dobson ∗Kathleen Steinhöfel ∗∗ King’s College London,Department of Informatics{richard.dobson, kathleen.steinhofel}@kcl.ac.ukABSTRACTIn this paper we consi<strong>de</strong>r low energy scheduling for power heterogeneousmultiprocessor systems. This is a fast <strong>de</strong>veloping area thatis of great importance and is currently being researched by both industryand aca<strong>de</strong>mia. This problem is of great importance becausereal life multiprocessor computer systems are often heterogeneousat run time. We have <strong>de</strong>veloped an algorithm which transformsany multiprocessor system into a Virtual Single Processor (VSP).Using our VSP platform, existing techniques can be explored forlow energy scheduling for heterogeneous multiprocessor scheduling.In this study we focus on applying algorithms which minimise∑Flow + Energy in conjunction with our VSP approach. ∑Flow +Energy have been shown to be very useful in real life situations.Keywords: Virtual Single Processor, Dynamic Speed Scaling, Energy,Heterogeneous Multiprocessor Systems, Low Energy Scheduling1. INTRODUCTIONEnergy consumption is an important consi<strong>de</strong>ration when <strong>de</strong>signingcomputer systems; especially when for use as a mobile <strong>de</strong>vice suchas a smart phone, which needs to provi<strong>de</strong> both high performanceand good battery life. One of the largest drains of energy in acomputer system is the processor(s). In most mo<strong>de</strong>rn processorsenergy consumption and processing speed are intrinsically linked,this is normally through the relationship Power = Speed a wherea is a constant which differs between processors but is typicallybetween 2 and 3. A good way of reducing the amount of energya processor uses is to lower the operational speed, for this we useDynamic Speed Scaling (DSS).DSS allows the operating frequency of a processor to be modifie<strong>da</strong>t runtime. This means that we do not have to run the processor atmaximum speed and waste energy when <strong>de</strong>mand is low. DSS is atool which enables the speed of the processor(s) to be lowered butwe need to combine this with an objective function which controlshow much the processors should be slowed down by. This is byno means a trivial task as we need to ensure that energy is notneedlessly wasted processing jobs too quickly; but we also needto ensure that jobs are completed by at time such that they are notcompromising the operation of the computer system or the userexperience.Multiprocessor computer systems have become increasingly commonin the past few years with the most common of these beinghomogeneous multiprocessor systems; where all processors areequal. The general consensus between aca<strong>de</strong>mics and industryis that the way forward is heterogeneous multiprocessor systems;where processors are not all equal. Heterogeneous multiprocessorsystems are often favourable over homogeneous options due toResearch partially supported by EPSRC Grant No. EP/G501483/1 andNokia Ltd.their flexibility. They can have many low powered processors forlow priority tasks or if the <strong>de</strong>mand is low and a collection of highpowered processors to <strong>de</strong>al with high priority jobs or to relive pressureif there is a large amount of jobs. This structure has the potentialto provi<strong>de</strong> much greater performance for the user and consumeconsi<strong>de</strong>rably less energy than an equivalent homogeneoussystem. We also often find that a homogeneous multiprocessorsystem which implements DSS is heterogeneous at run time.We address the problem of trying to combine DSS with a heterogeneousmultiprocessor system to provi<strong>de</strong> a high level of performanceand energy efficiency.1.1. Related WorkYao et. al. [9] presented a solution which assumed all jobs have ahard <strong>de</strong>adline, the processor is then ran at a speed such that all jobsare processed before their <strong>de</strong>adlines. Job <strong>de</strong>adlines do not alwayssolicit a feasible schedule unless we allow the maximum speed ofthe processor to be infinite or we restrict the set of incoming jobs.Some years later Albers and Fujiwara [1] presented an alternativesolution which does not rely on <strong>de</strong>adlines but attempts to balancethe quality of service against power consumption. The authorsattempt to minimise ∑Flow + Energy, this balances the two conflictingvalues of Energy against ∑Flow where the Flow of eachjob is the time between it’s release and completion. This meansthat there is always a feasible solution without the need to haveinfinite speed processors or restricted job sets. The best algorithmfor this problem has been <strong>de</strong>veloped by Andrew et al [2] and obtainsa competitive ratio of (2 + ε). Andrew et al also show thatthere exists some tra<strong>de</strong> off functions for which no algorithm canbe better than (2)Competitive.Lam et al exten<strong>de</strong>d existing knowledge of single processor F+Ealgorithms to the homogeneous multiprocessor situation in theirpaper [7]. The authors looked at the online problem and suggeste<strong>da</strong>n algorithm Classified Round Robin (CRR) where jobs aredistributed fairly evenly between all processors using a weightedround robin; individual processors are then left to manage theirown speed using a single processor F+E minimisation algorithm.Less than a year later Lam et al released [8] which presents aslightly improved version of the algorithm.Most recently Gupta et al [5] <strong>de</strong>veloped an algorithm for heterogeneousmultiprocessor systems. They approached the problem(which was formalised by Bower et al [4]) in a similar way to Lamet al has approached the homogeneous multiprocessor problem.Instead of distributing jobs as evenly as possible they aim to distributejobs to the processor which will provi<strong>de</strong> the least increasein ∑weighted flow. The algorithm is outlined below.1. Job Selection (which job should be ran on each processor):Highest Density First2. Speed Scaling (what speed should each processor run at):The speed is set so the power is the fractional weight of theALIO-EURO <strong>2011</strong> – 264


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>unfinished jobs3. Assignment (which processor should each job be assignedto): A new job is assigned to the processor that results inthe least increase in the projected future weighted flow, assumingthe adopted speed scaling and job selection policies,and ignoring the possibility of jobs arriving in the futureIn the paper the authors prove this algorithm to be "scalable forscheduling jobs on a heterogeneous multiprocessor with arbitrarypower functions to minimize the objective function of weightedflow plus energy".2. LOW ENERGY SCHEDULING FORHETEROGENEOUS MULTIPROCESSOR SYSTEMSIn [5] the authors present a solution for power heterogeneous multiprocessorsystems using weighted Flow + Energy as their objectivefunction. This solution has been shown to be theoreticallysound for the mo<strong>de</strong>l which they use. If we consi<strong>de</strong>r a real lifemultiprocessor computer system we often find that there are constraintswhich are applied to the processors. For example manymulti-core processors require the cores to always run at the samespeed. The Gupta et. al approach is not compatible with this situationas the processor speed is linked to the number of jobs.The Gupta et al algorithm also requires a large amount of runtimecomputation. Each time a job needs to be assigned to a processor itmust find out which processor will provi<strong>de</strong> the smallest increase inthe weighted flow. This requires a small amount of processing forjust one job but over time this could potentially add up to a largeamount of computation. We present a solution that has the abilityto overcome both of these issues.Our solution suggests that we should form a ’Virtual Single Processor’(VSP). The VSP is essentially a collection of processorswhich have been combined together in an efficient way to formwhat appears to be a single processor. We then present the VSP (asa single processor) to a DSS algorithm which controls the speed ofthe overall VSP and specifies which job should be processed first.The VSP in turn translates the overall VSP speed into speeds foreach processor which when combined equal the VSP speed.3. THE VIRTUAL SINGLE PROCESSORWe begin by <strong>de</strong>fining the term ’system speed’, that is the combinedprocessing power of all processors. We wish to use the leastamount of energy possible for any system speed. To achieve thiswe control the processor speed at system level and allow the processorsto request a new job when they have completed their currentjob.We consi<strong>de</strong>r an example of a 4 processor system (P0, P1, P2, P3)where each processor has a finite set of speeds and a simple powerfunction in the form of P = S a , the attributes of which are outlinedbelow.• P0 (0, 200,300,400, 500) a=2.3• P1 (0, 600, 700,800,900) a=2.35• P2 (0, 100, 300, 500, 700, 900) a=2.5• P3 (0, 1200) a=2.2We <strong>de</strong>fine a combination to be a set of processing speeds (one foreach processor); each combination has a system speed which is thesum of the individual speeds. For example {200,600,100,1200} isa valid combination with a system speed of 2100. This means itcan process 2100 cycles worth of work per second. In the simpleFigure 1: A graph showing the relationship between system spee<strong>da</strong>nd power consumption for the best case and worst case processorcombinations.1.6E+091.4E+091.2E+091E+0980000000060000000040000000020000000000 2000 4000 6000 8000 10000 12000 14000Figure 2: A graph showing the relationship between system spee<strong>da</strong>nd power consumption including energy spikes and optimal systemspeeds marked by dotssystem outlined above there are 300 unique combinations many ofthese provi<strong>de</strong> the same system speed. For example there are 14different combinations which make up the overall system speed of1400 alone.If we consi<strong>de</strong>r the worst and best ways of achieving the systemspeed of 1400 (with regards to energy) we find that the worst caseuses 487% of the energy consumed by the most efficient combination.If we look at Figure 1 we can see the difference between themost and least energy is largest in mid range speeds with the graphconverging at either end of the system speed range. This simpleexample highlights how crucial it is to make the correct processorspeed selections and this is where the VSP method stems from.The VSP method pre-computes the optimal processor combinationsuch that no other combination of processors provi<strong>de</strong>s the samesystem speed but uses less energy. To find out which combinationsare best we first compute all processor combinations, theseare then or<strong>de</strong>red by system speed. We then compare all of the combinationswith the same system speed and discard all apart from thecombination with the lowest energy usage. Now we have the listof optimal combinations for each system speed although this is notalways good enough.Using a technique from [3] we can ’simulate’ any speed between 0and S max by alternating between two speeds in different ratios. Forexample if we wanted to simulate the processor speed of 4 with asingle processor which can only operate at either speed 3 or speed5 we could run the processor for half the time at 3 and half the timeat 5. This would average out at speed 4. We can use this methodto ensure that we are always using the optimal amount of energy atall times. Figure 2 shows how we can use this technique to lowerthe overall energy consumption of the system for some speeds.The VSP essentially provi<strong>de</strong>s a level of abstraction between thesingle processor algorithm and the multiprocessor system. Thisabstraction allows us to hi<strong>de</strong> the complexity of the multiprocessorsystem behind the VSP front. We can hi<strong>de</strong> complex requirementsby computing a sub VSP where all processor combinations adhereLowHighOptALIO-EURO <strong>2011</strong> – 265


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>to the requirements of the system. For example if two cores mustalways operate at i<strong>de</strong>ntical speed then we only accept combinationswhere this is the case. We then feed this sub VSP into theoverall VSP as if it were a processor.By computing the VSP we can also remove the bur<strong>de</strong>n of calculatingwhich processor is best for each job as in [5]. This is ma<strong>de</strong>possible as the VSP hi<strong>de</strong>s the fact that more processors exist andonly assigns jobs when a processor has a speed greater than 0 andno job. This means that there is no need for the algorithm to performcostly calculations at run time.3.1. Formal DescriptionWe first <strong>de</strong>fine U to be a set of processing units where each processorU x has two properties U x S which is the set of valid processingspeeds (which may or may not contain simulated speeds) andU x P(s) which is a function which given an input of a valid speed sreturns e the energy used per second by this speed.To transform individual processors into a VSP we first combine theprocessors to form a set of all possible combinations C = U 0 S ×U 1 S.... ×U n−1 S where n is the total number of processing units.We <strong>de</strong>fine each combination C i to have 2 functions C i E which isthe energy consumption of all processors used in the combinationand C i S which is the overall speed of all processors in the combination.We then remove inefficient combinations to leave a streamlinedcollection of efficient processor combinations.We <strong>de</strong>fine C ′ to be a new set which contains only the combinationswhich adhere to the following rules.∀aC a ∈ C ′ iff∀bC a E ≤ C b E where C a S = C b Sand¬∃C c ∈ C ′ where C c S = C a STo form the VSP (V 0 ) we need to extract the available speeds andthe power function from C ′ .V 0 S = {C ′ 0 S,...,C′ nS}∀V 0 P(V 0 S a ) = C ′ aEThere is one final step we must perform to ensure optimal VSPperformance; we must remove speeds which are not efficient.⎛ (( ) ) ⎞V0 S x −V 0 S x−1V 0 S x+1 −V 0 S x−1×V 0 P(V 0 S x+1 )∀ 0


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>140.00%6. CONCLUSION120.00%100.00%80.00%60.00%40.00%20.00%In this paper we present the Virtual Single Processor approachto low energy scheduling for power heterogeneous multiprocessorcomputer systems. We show that the VSP approach is theoreticallysound and can be used with existing technologies soliciting strongresults. In future papers we hope to expand this area by lookingat variations of the VSP system that are split into sub VSPs somethat are <strong>de</strong>signed to handle large / sequential jobs and others forsmaller parallel jobs.0.00%BasicRemoveOutliersBasicRemoveOutliersBasicRemoveOutliersBasicRemoveOutliers7. REFERENCESFlow Energy ∑Flow + Energy Finish TimeFigure 3: A graph showing the average results from the simulationsand the average results with outliers removedprocessors, 2x medium powered processors and 4x high poweredprocessors.We use the processor configuration from [6] where x = 1. 1 ×High speed processor: s = {0,1000,2000}a = 2.8, 2 × mediumspeed processors: s = {0,250,500,750,1000}a = 2.55, 4 × lowspeed processor: s = {0,50,100,150,200,250}a = 2.25. We splitthe test <strong>da</strong>ta into 3 different categories all of which contain jobswith random weights and sizes:• Immediate; all jobs are all released at time 0• Uniformly random; jobs are released randomly over timeup to time x• Peaks and troughs; jobs are released in surges which aresimilar to the action of a computer system up to time xCovering these categories gives us a good indication of overall systemperformance.After running our simulations we found some very interesting results.We report the values below in terms of VSP performancein comparison to the Gupta et. al. algorithm. Upon extractingthe <strong>da</strong>ta from the simulations we found that there were some jobswhich were severely throwing the average finish time for both theVSP and Gupta algorithms. The few jobs which have been removedhave both a very low weight and a very large size. Afterremoving these we recalculate the averages and have reported thisin Figure 3.The results prove to be very interesting. They show us that the VSPplatform is very promising; this has allowed us to combine theGupta et. al. algorithm with the VSP platform and achieve verycompetitive results. Although the energy consumption is larger,the ∑Flow + Energy is lower. This is the aim of the algorithm, tominimise the ∑Flow + Energy so this has been very successful.[1] Albers, S. and Fujiwara, H.: Energy-Efficient Algorithms forFlow Time Minimization. ACM Transactions on Algorithm,3(4), 49 (2007)[2] Andrew, L. and Wierman, A. and Tang, A.: Optimal speedscaling un<strong>de</strong>r arbitrary power functions. ACM SIGMET-RICS Performance Evaluation Review 37(2), 39–41 (2009)[3] Bansal, N. and Chan, H.L. and Pruhs, K.: Speed Scalingwith an Arbitrary Power Function. <strong>Proceedings</strong> of the NineteenthAnnual ACM-SIAM Symposium on Discrete Algorithms,693–701 (2009)[4] Bower, F.A. and Sorin, D.J. and Cox, L.P.: The Impact ofDynamically Heterogeneous Multicore Processors on ThreadScheduling. IEEE Micro 28(3), 17–25 (2008)[5] Gupta, A. and Krishnaswamy, R. and Pruhs, K.: ScalablyScheduling Power-Heterogeneous Processors. InternationalConference on Green Computing, 165-173, InternationalConference on Green Computing, 2010[6] Gupta, A. and Krishnaswamy, R. and Pruhs, K.: NonclairvoyantlyScheduling Power-Heterogeneous Processors. <strong>Proceedings</strong>of the 37th International Colloquium of Automata,Languages and Programming, 312–323 (2010)[7] Lam, Tak-Wah and Lee, Lap-Kei and To, Isaac K. K.and Wong, Pru<strong>de</strong>nce W. H.: Competitive Non-migratoryScheduling for Flow Time and Energy. <strong>Proceedings</strong> of thetwentieth annual symposium on Parallelism in algorithmsand architectures, 256–264 (2008)[8] Lam, Tak-Wah and Lee, Lap-Kei and To, Isaac K. K. andWong, Pru<strong>de</strong>nce W. H.: Improved multi-processor schedulingfor flow time and energy. Journal of Scheduling, November,Online First (2009)[9] Yao, F. and Demers, A. and Shenker, S.: A schedulingmo<strong>de</strong>l for reduced CPU energy. <strong>Proceedings</strong> of the 36thAnnual Symposium on Foun<strong>da</strong>tions of Computer Science(FOCS’95) 374–382 (1995)ALIO-EURO <strong>2011</strong> – 267


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>A linear programming approach for a<strong>da</strong>ptive synchronization of traffic signalsPablo Coll ∗ Pablo Factorovich ∗ Irene Loiseau ∗∗ Departamento <strong>de</strong> ComputaciónFacultad <strong>de</strong> Ciencias Exactas y Naturales,Universi<strong>da</strong>d <strong>de</strong> Buenos Aires{pecoll,pfactoro,irene}@dc.uba.arABSTRACTAs traffic congestion during rush hours is a growing problem formost cities, there is an increasing need for more effective managingtraffic signal control and traffic assignment systems. Wepresent here a new a<strong>da</strong>ptive system based on a linear programmingmo<strong>de</strong>l for the signal control problem, having as objectiveto minimize the total length of the queues of cars waiting at eachcorner. The mo<strong>de</strong>l is inten<strong>de</strong>d to be fed with traffic informationprovi<strong>de</strong>d by real-time sensors installed at each intersection. In or<strong>de</strong>rto compare the performance of our program with that of thecurrent scheduling <strong>de</strong>signed by the transit office of Buenos Airescity, we used a traffic simulation system and real traffic flow <strong>da</strong>taof a pilot area of the city. Preliminary results are very promising.Keywords: Urban traffic control, A<strong>da</strong>ptive signal control, Signaltiming optimization, Linear programming1. INTRODUCTIONTraffic signal control are systems for synchronizing the timing ofany number of traffic signals in a certain area.Despite the researchdone in the field of urban traffic control systems since around five<strong>de</strong>ca<strong>de</strong>s, there is still an increasing need for more effective managingof traffic control systems. None of the proposals we were ableto found in the literature solves the complete optimization problemfor big urban areas. As this is a very difficult problem authorshandle only part of the problem, or propose hierarchical mo<strong>de</strong>lsthat divi<strong>de</strong> the problem in parts that are addressed in<strong>de</strong>pen<strong>de</strong>ntly.Some studies focus on small regions or in a single intersection.Main <strong>de</strong>cisions in signal control strategies at urban areas inclu<strong>de</strong><strong>de</strong>termining the duration of the complete cycle and the duration ofgreen lights at each direction of every intersection. Several mathematicaland computational approaches have been proposed, mostof them based on heuristics. Existing exact mo<strong>de</strong>ls for traffic signalcontrol are very limited in scope but they are useful for providinginsight into the problem and examining the performance ofheuristics.Most of the currently implemented traffic control systems may begrouped into three principal categories:• Fixed time strategies, that are <strong>de</strong>rived off-line by use ofco<strong>de</strong>s based on historical <strong>da</strong>ta.• Traffic-responsive strategies make use of real-time measurementsto calculate in real time the suitable signal settings.• Predictive strategies are based on off-line and on-line information.It is no possible to survey here all the work done in this vast researcharea, so we will mention only a few selected references.A review on traffic control strategies can be found at [1]. AlsoCheng et al.[2], Dotoli et al. [3] and Wey[4] present an overviewof available traffic control methods.Some of the approaches are already implemented in real life whileother reflect work still at the research and <strong>de</strong>velopment stage.Among the commercial systems we can mention TRANSYT [5]which was first <strong>de</strong>veloped by Robertson [5] and was substantiallyimproved later. It uses historical information and computes signalcontrol schemes off-line. SCOOT [6, 7] inclu<strong>de</strong>s a networkmo<strong>de</strong>l that is fed with real <strong>da</strong>ta and is run repeatedly to investigatethe effect of incremental changes of splits, offsets and cycle times.Changes are implemented if they show to be beneficial. RODHES[8], PRODYN [9] and OPAC [10] <strong>de</strong>veloped more rigorous mo<strong>de</strong>lbasedtraffic responsive strategies. RODHES and PRODYN solvein real time optimization problems employing dynamic programmingand OPAC employs exhaustive enumeration. So the three ofthem are real-time feasible for only one intersection and they endwith a <strong>de</strong>centralized optimal strategies coordinated heuristically bya superior layer. In TUC [11, 12] a store-and-forward strategy isimplemented. The main i<strong>de</strong>a is to simplify the mo<strong>de</strong>l in or<strong>de</strong>r tobe able to <strong>de</strong>scribe the traffic flow process without using discretevariables. The optimization part of the system requires to solve aquadratic programming problem.Wey [4] presents an integer linear mo<strong>de</strong>l for the network wi<strong>de</strong> signaloptimization problem and a modification of the network simplexalgorithm to solve it. The proposal is tested in a five intersectionsarea and compared with exact solution to the MIP. Lo[13] mo<strong>de</strong>ls the traffic flow conditions using a cell transmissionmo<strong>de</strong>l (CTM) based in hydrodynamic concepts. The resultingmo<strong>de</strong>l for the dynamic signal-control problem is a mixed-integerlinear programming program. A two intersections network is usedto <strong>de</strong>monstrate applicability of the formulation. Lin and Wang[14] propose an enhanced 0-1 formulation based also on the CTMmo<strong>de</strong>l. Also He et al. [15] propose heuristics based on the linearrelaxation of the mo<strong>de</strong>l for solving CTM based MIP formulations.They tested the approach on examples of one or two intersections.Barisone et al. [16] propose an elaborated real-time nonlinear optimizationmo<strong>de</strong>l. They report having successfully tested it in aurban area of Genova consisting of 18 links. Dotoli et al. [3] modifythis mo<strong>de</strong>l to take into account the presence of pe<strong>de</strong>strians, differentlevels of traffic consgetion, vehicle classification, etc. Theircase study is an area of two consecutive intersections with heavytraffic. Cheng et al.[2] presents a parallel algorithm for the problemof finding optimal coordinate signal timing plans for a largearea based on game theory. They test their algorithm in a real areaof 75 intersections, and they claim that using a thousand CPUŽsthey found a signal planning in less than 10 minutes. Aboudulaset al.[17] propose a methodology based on store-and-forwardtraffic mo<strong>de</strong>l, mathematical optimization and optimal control forreal-time signal control in congested large-scale urban traffic networks.In [18] authors <strong>de</strong>scribe a genetic algorithm applied to the coordinationof signals in an urban network based on real-time trafficinformation. They evaluate they approach on a 12 intersectionsarea, using the CORSIM traffic simulation software, and reportthat they reduce average <strong>de</strong>lay times in a 15%. There are sev-ALIO-EURO <strong>2011</strong> – 268


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>eral other heuristics approaches <strong>de</strong>veloping algorithms based ongenetic algorithms, ant colony optimization, Markov processes orneural networks.Systems already implemented in real life can be evaluated throughthe improvement of traffic conditions in cities or areas were theyare installed. Other approaches are difficult to compare betweenthem, as they show results in different small real or generated adhoccase studies.Our goal was to <strong>de</strong>velop an automatic system that <strong>de</strong>termines inreal-time the cycles of the traffic signals in a region of the city,based on information provi<strong>de</strong>d by sensors installed at each upstreamcorner. The system is continuously fed with informationprovi<strong>de</strong>d by these sensors. Based on the results of the optimization,the planning is automatically modified.Optimization is done by means of a linear programming mo<strong>de</strong>lhaving as objective to minimize the total length of the queues ofcars waiting at each corner. Part of our mo<strong>de</strong>l was <strong>de</strong>rived fromsome of the equations of the pioneer Robertson mo<strong>de</strong>l for the platoondispersion ([5], see also [4]). This mo<strong>de</strong>l represents vehiclesmoving in platoons that flow from one corner to the other till theyare out of the area . We a<strong>da</strong>pted the equations <strong>de</strong>scribing the numberof vehicles waiting at the queues, but equations relating neighborintersections are not inclu<strong>de</strong>d in our mo<strong>de</strong>l. We also ad<strong>de</strong>dconstraints that bind the red and green lights at each intersection.Bounds that prevent sud<strong>de</strong>n changes on the length of the cyclesand the green lights, are also <strong>de</strong>fined.Several cycles of the traffic signals are consi<strong>de</strong>red at each run ofthe optimization program.2. MODEL FORMULATIONIn this section we <strong>de</strong>fine the variables, parameters and constraintsand we <strong>de</strong>scribe the LP mo<strong>de</strong>l in <strong>de</strong>tail. We consi<strong>de</strong>r an area thatconsists of a set N = {1...N} of intersections. At each intersectionsthere is a variable number of J(n) directions from whichtraffic flow arrives, so there are J(n) signals that have to be regulated.A cycle of the traffic signal is the sum of the lasting times of thegreen and "not green" phases. The temporal horizon of the mo<strong>de</strong>lis a pre<strong>de</strong>termined number of complete cycles of the signals H ={1...H}. The optimization program is run with the information ofthe last H cycles. When new <strong>da</strong>ta is received from the sensors theol<strong>de</strong>st cycle is discar<strong>de</strong>d. Each phase (green and "not green") ofthe cycle is partitioned in i<strong>de</strong>ntical sets I = {1...I} of time intervals.The <strong>de</strong>cision variables are the duration of each cycle at eachcorner and the duration of the green lights at each direction. Themo<strong>de</strong>l implicitly calculates an offset that represents the beginningof each cycle related to an arbitrary initial time.We need to distinguish which are the cycles starting in green andwhich are those starting in red, so we <strong>de</strong>fineRS = {(n, j)/cycle at (n,j) starts in red}.As there are several types of intersections in a city, we need toprovi<strong>de</strong> the following <strong>de</strong>finitions. Let JV R(n, j) be the set of directionsrelated to the intersection n, for which its greens lightsdo not overlap and jointly <strong>de</strong>fine the time when light of direction(n, j) is red. The simplest and more typical situation is an intersectionwith two crossing directions j and k. In this case we haveJV R(n, j) = {k} and JV R(n,k) = { j} and if (n, j) ∈ RS, we willhave to force R h n, j = V n,k h ; otherwise, Rh+1n, j = Vn,k h . Analogously,let JVV (n, j) be the set of directions related to the intersection nin which green light times do not overlap and jointly <strong>de</strong>fine thetime when light (n, j) is green. Here a typical situation is relatedto the two opposite directions in a two way street, which are usuallyin green simultaneously. Let us note that JVV (n, j) can be anempty set for some j, but j have to belong to some JV R(n,k) orJVV (n,k) for another direction k. Finally, let JRR(n, j) be a setof directions related to the intersection n, in which red light timesare non overlapped and jointly <strong>de</strong>fine the time when light of (n, j)direction is red.2.1. VariablesVariables related to time are:• Vn, h j : length of the green at intersection n ∈ N , directionj ∈ J(N) and cycle h ∈ H• R h n, j : length of “not green” at intersection n ∈ N , directionj ∈ J(N) and cycle h ∈ H (n, j)Then Cn h = R h n, j + V n, h j for any direction ( j,n) ∈ RS, is the totallength of cycle h ∈ H at intersection n ∈ N . If (n, j) ∈ RS, variableR h+1n, j appears instead of R 1 n, j at the mo<strong>de</strong>l. We <strong>de</strong>fine for each(n, j) the set of in<strong>de</strong>xes h, H (n, j), to represent these situations.Variables related to number of vehicles are:• LV h,in, j : queue length during green light at intersection n ∈N , direction j ∈ J(n), cycle h ∈ H and interval i ∈ I• LR h,in, j : queue length during red ("not green") light at intersectionn ∈ N , direction j ∈ J(n), cycle h ∈ H (n, j) andinterval i ∈ I• Q h,in, j : outcoming flow of vehicles at intersection n ∈ N ,direction j ∈ J(n), cycle h ∈ H and interval i ∈ I .All variables of the mo<strong>de</strong>l are allowed to take non integer values.2.2. Coefficients• As interval lasting R h,in, j is a result of the optimization, ÊRh,in, jis an estimation of the information that should be given bysensors at direction j intersection n, cycle h, interval i duringred light. This estimation is based on values obtainedfrom the sensors after executing the planning obtained fromthe last run of the mo<strong>de</strong>l.h,i• ÊVn, j is the same estimation for green lights.• CI n, j represents the length of the queue at the beginning ofthe period to be optimized.• S n, j is the flow capacity of a street section (in number ofcars).• α is a positive parameter of the objective function.2.3. The mo<strong>de</strong>lThe linear programming formulation we propose is the following:N J(n) H Imin ∑ ∑ ∑ ∑ LR h,i h,in, j + LV n, j − αQh,i n, jn=1 j=1 h=1 i=1subject to the following constraints ∀n ∈ N , j ∈ J(n):ALIO-EURO <strong>2011</strong> – 269


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>̂LR h,1 h−1,I ER h,1n, jn, j = LV n, j + R h n, j ∀h ∈ H (n, j),h ≥ 2 (1)ILV h,1n, j = LR h,In, jh,1ÊVn, j+ÎLR h,in, j = ER h,iLRh,i−1 n, jn, j + R h n, jIV h n, j − Qh,1 n, j ∀h ∈ H ∩ H (n, j) (2)∀h ∈ H (n, j),i ∈ {2...I}̂LV h,i h,i−1 EV h,in, jn, j = LV n, j + Vn, h jIn, j ∀h ∈ H ,i ∈ {2...I} (4)̂LV 1,1 EV 1,1n, jn, j = CI n, j + Vn, 1 jIn, j ∀ (n, j) /∈ RS (5)̂LR 1,1n, j = CI ER 1,1n, jn, j + R 1 n, jI∀ (n, j) ∈ RS (6)(3)Q h,in, j ≤ S n, jVn, h j (7)IConstraint (7) sets a bound, based on street capacity, for the outgoingflow from direction j at intersection n. Constraints (8) and(9) impose smoothness in the changes of light schedules, which isvery important, not only for safety, but also to guarantee the qualityof estimators ÊRh,i h,in, j and ÊVn, j . (10) and (11) are bounds on lightduration and constraints (12) to (15) establish the necessary coordinationamong traffic lights in different directions at each corner.Depending on each intersection structure, some of the equations(12) to (15) can be redun<strong>da</strong>nt. For the sake of clarity, we choseto write the mo<strong>de</strong>l as above instead of using a more complicatednotation that could have avoid redun<strong>da</strong>ncy in all cases.3. EXPERIMENTAL RESULTSSolution was implemented as a C library that takes care of communicationbetween hub, sensors and signals and calls the LP solver.The hub sends sensors information to the library and receives nextplaning for every semaphore.As the sensors have not been installed yet, we tested our mo<strong>de</strong>l bymeans of a microscopic traffic simulation software package TSIS-CORSIM [19](Figure 1).− δ V ≤ Vn, h h−1j −Vn, j ≤ δ V (8)− δ R ≤ R h n, j − Rh−1 n, j ≤ δ R (9)Figure 1: A<strong>da</strong>ptive System Scheme∑V Min ≤ V h n, j ≤ V Max (10)R Min ≤ R h n, j ≤ R Max (11)R h n, j = Vn,k h ∀h ∈ H , (n, j) ∈ RS (12)k∈JV R(n, j)∑R h n, j = Vn,k h−1 ∀h ∈ H (n, j), (n, j) /∈ RS (13)k∈JV R(n, j)∑Vn, h j = Vn,k h ∀h ∈ H (14)k∈JVV (n, j)∑R h n, j = R h n,k ∀h ∈ H (n, j) (15)k∈JRR(n, j)In or<strong>de</strong>r to compare the performance of our program with the currentscheduling <strong>de</strong>signed by experts from the transit office of BuenosAires city, we used real traffic flow <strong>da</strong>ta of a pilot area of the city(7 intersections, some one direction streets, some avenues).(Figure2) We have tested our mo<strong>de</strong>l in eight different scenarios obtainedVn, h j , Rh n, j , LRh,i h,in, j , LV n, j , Qh,i n, j ≥ 0 (16)The objective is to minimize the sum of the queue lengths and tomaximize the car flow in the area. α is a parameter of the mo<strong>de</strong>linclu<strong>de</strong>d to force the car outgoing flow at each corner, in or<strong>de</strong>r tomake equation (7) as tight as possible.Constraints (1) to (6) <strong>de</strong>fine the new queue lengths for every intervalby adding to the previous length the car arrivals and subtractingthe outgoing flow of vehicles. These equations are inspired in partof the Robertson mo<strong>de</strong>l. However, instead of using information ofneighbor intersections, arrivals are computed multiplying estimationsof number of arriving cars per second by the length of theinterval. Constraints (1) and (2) <strong>de</strong>fine the length of the queues forintervals i = 1, which is the first time interval for the current lightcolor. Constraints (3) and (4) compute the length of the queue incontiguous intervals corresponding to the same light color. Finally,(5) and (6) set initial values of the queues.Figure 2: Pilot areavarying green lights lower bounds and traffic flow rates.Preliminary results show that our optimization tool outperformsthe current system reducing the average queue length in 4,17% atrush hours. Also on tests with 50% and 25% of the current trafficflow, an improvement of 16,49% and 32.72 o % respectively, wasobtained. Besi<strong>de</strong> this we have tried with the following alternateflow: 150% of rush hour flow in N and E direction and 66% in theS and W. Results of experiments with green lights lower bounds of30 sec. can be seen at (Figure 3).Parameters used to perform the tests were: α = 0.033, H = 2, I =5, CI n, j taking values from simulation using current lights plan andS n, j = 2 for every (n, j) except for those corresponding to VernetAvenue, in which case the value is 4 since the other streets has 2lanes and Vernet has 4.ALIO-EURO <strong>2011</strong> – 270


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Figure 3: Preliminary results4. CONCLUSIONSWe <strong>de</strong>veloped an a<strong>da</strong>ptive system based on a linear programmingmo<strong>de</strong>l for the synchronization of traffic signals at an urban area.The system gets real-time traffic information from sensors installe<strong>da</strong>t each intersection. According to the results of the optimizationthe system will automatically coordinate traffic lights duration inan area of the city. Preliminary results show that this proposal canbe integrated on an efficient tool for traffic congestion management.As the optimization is done by means of an LP program (nointeger variables), computational times are affor<strong>da</strong>ble in the frameof a real time system. This is the main advantage of our proposal.5. REFERENCES[1] M. Papageorgiou, C. Diakaki, V. Dinopoulou, A. Kotsialos,and Y. Wang, “Review of road traffic control strategies,” <strong>Proceedings</strong>of the IEEE, vol. 91, no. 12, pp. 2043–2067, 2003.[2] S.F. Cheng, M.A. Epelman, and R.L. Smith, “Cosign: a parallelalgorithm for coordinated traffic signal control,” IEEETransactions on Intelligent Transportation Systems, vol. 7,no. 4, pp. 551–564, 2006.[3] M.Dotoli, M. Fanti, and C. Meloni, “A signal timing planformulation for urban traffic control,” Control EngeneeringPractice, vol. 14, pp. 190–192, 2006.[4] W.M. Wey, “Mo<strong>de</strong>l formulation and solution algorithm oftraffic signal control in an urban network,” Computers, Environmentand Urban Systems, vol. 2, pp. 355–377, 2000.[5] D.I. Robertson, “Transyt method for area traffic control,”Traffic Engeneering and Control, vol. 10, pp. 276–281, 1969.[6] P.B. Hunt, D.I. Robertson, R.D. Bretherson, and M.C. Royle,“The scoot on-line traffic signal optimization technique,”Traffic Engeneering and Control, vol. 23, pp. 190–192, 1982.[7] D. Bretherton, M. Bodger, and N. Baber, “Scoot-the future,”in <strong>Proceedings</strong> of the 12th IEE International Conferenceon Road Transport Information and Control, London, UK,2004, pp. 301–306.[8] P. Mirchan<strong>da</strong>ni and L. Head, “Rodhes: a real-time trafficsignal control system:architecture, algorithms and analysis,”Transportation Research Part C, vol. 9, no. 6, pp. 415–432,2004.[9] J.LFarges, J. Henry, and J. Tufal, “The prodyn real-timetraffic algorithm,” in <strong>Proceedings</strong> 4th IFAC Symposium onTransportation Systems, 1983, pp. 307–312.[10] N.H. Gartner, “Opac: a <strong>de</strong>mand -responsive strategy for trafficsignal control system:architecture, algorithms and analysis,”Transportation Research Record, vol. 906, pp. 75–84,1983.[11] C. Diakaki, M. Papageorgiou, and K. Aboudolas, “A multivariateregularor approach to traffic-responsive networkwi<strong>de</strong>signal control,” Contro engeneering Parctice, vol. 10,pp. 183–195, 2002.[12] V. Dinopoulou, C. Diakaki, and M. Papageorgiou, “Applicatonsof the urban traffic control strategy tuc,” EuropeanJournal of Operational Research, vol. 175, pp. 1652–1665,2006.[13] H. Lo, “A cell-based traffic control formulation: strategiesand benefits of dynamic timing plans,” Transportation Science,vol. 35, no. 2, pp. 148–164, 2001.[14] W.H. Lin and C. Wang, “An enhanced 0-1 mixed-integer lpformulation for traffic signal control,” IEEE Transactions onIntelligent Transportation Systems, vol. 5, no. 4, pp. 238–245, 2004.[15] Q. He, W. Lin, H. Liu, and L. Head, “Heuristic algorithmsto solve 0-1 mixed integer lp formulations for traffic signalcontrol problems,” in <strong>Proceedings</strong> 2010 IEEE InternationalConference on Service Operations and Logistics and Informatics(SOLI 2010), Qing<strong>da</strong>o, China, 2010, pp. 118–124.[16] A. Barisone, D. Giglio, R. Minciardi, and R. Poggi, “Amacroscopic traffic mo<strong>de</strong>l for real-time optimization of signalizedurban areas,” in <strong>Proceedings</strong> of the 41st IEEE Conferenceon Desion and Control, Las Vegas, USA, 2002, pp.900–903.[17] K. Aboudolas, M. Papageorgiou, and E. Kosmatopoulos,“Store and forward based methods for the signal controlproblem in large-scale congested urban road networks,”Transportation Research Part C, vol. 17, pp. 163–174, 2009.[18] L. Zang and L. Jia, “Mo<strong>de</strong>lling and simulation of trafficsignal control for urban network,” in <strong>Proceedings</strong> of theIEEE Third Internatonal Symposium on Intelligent InformationTechnoloty Application, China, 2009, pp. 253–256.[19] http://mctrans.ce.ufl.edu/featured/tsis.ALIO-EURO <strong>2011</strong> – 271


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>ALIO–EURO <strong>2011</strong> – 272


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>List of AuthorsAgra, Agostinho, 51, 142, 221Alma<strong>da</strong>-Lobo, Bernardo, 177Álvarez, A<strong>da</strong>, 181Álvarez-Valdés, Ramón, 67, 118Alvelos, Filipe, 251Alves, Cláudio, 64Amorim, Pedro, 177Anghinolfi, Davi<strong>de</strong>, 79Ayuso, Antonio Alonso, 253Baier, Horst, 87Baños, Raul, 17, 109Barbosa-Póvoa, Ana Paula, 199, 203Bianchi-Aguiar, Teresa, 130Bing, Xiaoyun, 205Birgin, Ernesto G., 112Bloemhof, Jacqueline, 205Bolufé Röhler, Antonio José, 71Borschbach, Markus, 13Boschetti, Marco A., 71Brandão, Filipe, 115Brás, Pedro, 64Brito, José, 29, 97Brito, Luciana, 97Caceres, Jose, 126Cantão, Luiza Amalia Pinto, 224Captivo, M. Eugénia, 174Cardona-Valdés, Yajaira, 181Cardoso, Sónia R., 203Carlsson, John Gunnar, 91Carravilla, Maria Antónia, 58, 130, 215Carvalho, Margari<strong>da</strong>, 146Cer<strong>de</strong>ira, Jorge Orestes, 221Chaudron, M.R.V., 21Christiansen, Marielle, 142Coelho, Igor Machado, 45, 138Coelho, Vitor Nazario, 45Coll, Pablo, 261Constantino, Miguel, 4, 228Correia, Paulo <strong>de</strong> Barros, 8Costa, Dulce, 1Dabia, Said, 134De Causmaecker, Patrick, 105<strong>de</strong> Giovanni, Luigi, 37De Kok, Ton, 134Delgado, Alexandrino, 142Dell’Amico, Mauro, 75Desrosiers, Jacques, 164Dias, Joana M., 166Dobson, Richard, 257Doostmohammadi, Mahdi, 51Dubois-Lacoste, Jérémie, 232El-Sourani, Nail, 13Emmerich, M.T.M., 21Escu<strong>de</strong>ro, Laureano F., 156, 160, 253Etemaadi, R., 21Factorovich, Pablo, 261Falavigna, Simone, 75Faulin, Javier, 126Fernán<strong>de</strong>z, Antonio, 17, 109Ferreira, Brígi<strong>da</strong> C., 166Ferreira, Eduar<strong>da</strong> Pinto, 60Florentino, Helenice <strong>de</strong> Oliveira, 95Gamboa, Dorabela, 49Garín, M. Araceli, 156, 160Gatica, Gustavo, 101Gesteira, Claudio Martagão, 195Gil, Consolacion, 17, 109Gomes, A. Miguel, 60Gomes, Maria Isabel, 199Grasman, Scott, 126Günther, Hans-Otto, 177Had<strong>da</strong>d, Matheus Nohra, 45Henggeler Antunes, C., 1Iori, Manuel, 75Juan, Angel A., 122, 126Junqueira, Leonardo, 58Kosuch, Stefanie, 83Lan<strong>da</strong>-Silva, Dario, 217Leitner, Markus, 243Li, R., 21Lobato, Rafael D., 112Lodi, Andrea, 5Loiseau, Irene, 261Lopes, Isabel Cristina, 41Lopes, Maria do Carmo, 166López-Ibánez, Manuel, 232Luna, Henrique Pacca Loureiro, 195Maciel, Renan S., 152Maculan, Nelson, 97Maniezzo, Vittorio, 71Markenzon, Lilian, 25Marques, Inês, 174Márquez, Antonio L., 17, 109Martín Campo, Francisco Javier, 253Martínez Sykora, Antonio, 67Martins, A. Gomes, 1Martins, Isabel, 228Massi, Gionata, 37Merino, María, 156Mesquita, Marta, 211Metrane, Ab<strong>de</strong>lmoutalib, 164Miran<strong>da</strong>, Vladimiro, 152Montenegro, Flávio, 97Möring, Rolf, 1Montoya, M.G., 109Morabito, Reinaldo, 58, 112Moz, Margari<strong>da</strong>, 211Munhoz, Pablo Luiz Araújo, 45Neto, Teresa, 228


Proc. of the VII ALIO–EURO – Workshop on Applied Combinatorial Optimization, Porto, Portugal, May 4–6, <strong>2011</strong>Nogueira, Teresa, 150Nordström, Tomas, 247Obit, Joe Henry, 217Ochi, Luiz Satoru, 29, 45, 97, 138Oliveira, José Fernando, 58, 130, 215Oliveira, Marisa, 60Pacheco, Joaquín, 181Padilha-Feltrin, Antonio, 152Paias, Ana, 211Paolucci, Massimo, 79Paquete, Luís, 236Para<strong>da</strong>, Víctor, 101Parra, M., 109Parragh, Sophie N., 170Pato, Margari<strong>da</strong> Vaz, 95, 174, 211Pedroso, João Pedro, 4, 115, 146, 185, 228Perales, Rosa Colomé, 208Pereira, Paulo R.C., 25Pérez, Gloria, 156, 160Peruyero, Esteban, 122Pezzella, Ferdinando, 37Pfetsch, Marc E., 37Pinho <strong>de</strong> Sousa, Jorge, 239Poss, Michael, 191Raack, Christian, 191Raidl, Günther R., 243Ramos, Tania Rodrigues Pereira, 199Raymond, Vincent, 164Rego, César, 49Relvas, Susana, 203Requejo, Cristina, 33, 221Ribas, Sabir, 138Riera, Daniel, 122, 126Riezebos, Jan, 188Rinaldi, Giovanni, 37Rocha, Humberto, 166Rocha, Marta, 215Roffilli, Matteo, 71Romeijn, H. Edwin, 54Ronconi, Debora P., 2Romero Morales, Dolores, 54Røpke, Stefan, 134Rosa, Mauro <strong>de</strong>, 152Ruthmair, Mario, 243Santos, Dorabella, 251Santos, Eulália, 33Santos, José Luís, 236Santos, Nicolau, 185Saraiva, João, 146Schmid, Verena, 170Semaan, Gustavo Silva, 29Sepúlve<strong>da</strong>, Mauricio, 101Silva, Marcos <strong>de</strong> Melo <strong>da</strong>, 45Silva, Ricardo Coelho, 224Soto, Juan Pablo, 208Soumis, Francois, 164Sousa, Amaro <strong>de</strong>, 251Souza, Marcone Jamilson Freitas, 45, 138Steinhöfel, Kathleen, 257Stützle, Thomas, 232Subramanian, Anand, 138Tamarit, Jose Manuel, 67, 118Thiell, Marcus, 208Torralba Fernan<strong>de</strong>s, Jessica Pillon, 8Towhidi, Mehdi, 164Unzueta, A., 160Valério <strong>de</strong> Carvalho, José, 41, 64van <strong>de</strong>n Heuvel, Wilco, 54van <strong>de</strong>r Vorst, Jack, 205Van Woensel, Tom, 134Van<strong>de</strong>n Berghe, Greet, 105Vaz, Daniel, 236Ventura, Paolo, 37Verstichel, Jannes, 105Viana, Ana, 4, 239Villa, Fulgencia, 118Waga, Christina F.E.M., 25Wagelmans, Albert P.M., 54Wolkerstorfer, Martin, 247Xavier, Adilson Elias, 195Yamakami, Akebo, 224Yevseyeva, Iryna, 239Zepe<strong>da</strong>, J. Alejandro, 101Zhang, Yang, 87

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!