• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 108
  • 26
  • 18
  • 12
  • 7
  • 6
  • 5
  • 5
  • 3
  • 2
  • 2
  • 2
  • 1
  • 1
  • 1
  • Tagged with
  • 245
  • 113
  • 54
  • 52
  • 48
  • 31
  • 31
  • 29
  • 28
  • 28
  • 26
  • 26
  • 26
  • 25
  • 25
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
201

Border Crossing Modeling and Analysis: A Non-Stationary Dynamic Reallocation Methodology For Terminating Queueing Systems

Moya, Hiram 2012 August 1900 (has links)
The United States international land boundary is a volatile, security intense area. In 2010, the combined trade was $918 billion within North American nations, with 80% transported by commercial trucks. Over 50 million commercial vehicles cross the Texas/Mexico border every year, not including private vehicles and pedestrian traffic, between Brownsville and El Paso, Texas, through one of over 25 major border crossings called "ports of entry" (POE). Recently, securing our southwest border from terrorist interventions, undocumented immigrants, and the illegal flow of drugs and guns has dominated the need to efficiently and effectively process people, goods and traffic. Increasing security and inspection requirements are seriously affecting transit times. Each POE is configured as a multi-commodity, prioritized queueing network which rarely, if ever, operates in steady-state. Therefore, the problem is about finding a balance between a reduction of wait time and its variance, POE operation costs, and the sustainment of a security level. The contribution of the dissertation is three-fold. The first uses queueing theory on the border crossing process to develop a methodology that decreases border wait times without increasing costs or affecting security procedures. The outcome is the development of the Dynamic Reallocation Methodology (DRM). Currently at the POE, inspection stations are fixed and can only inspect one truck type, FAST or Non-FAST program participant. The methodology proposes moveable servers that once a threshold is met, can be switched to service the other type of truck. Particular emphasis is given to inspection (service) times under time-varying arrivals (demands). The second contribution is an analytical model of the POE, to analyze the effects of the DRM. First assuming a Markovian service time, DRM benefits are evaluated. However, field data and other research suggest a general distribution for service time. Therefore, a Coxian k-phased approximation is implemented. The DRM is analyzed under this new baseline using expected number in the system, and cycle times. A variance reduction procedure is also proposed and evaluated under DRM. Results show that queue length and wait time is reduced 10 to 33% depending on load, while increasing FAST wait time by less than three minutes.
202

Abordagens de modelos de filas com abandono para análise de congestão em Call Centers

Ferrari, Sidney Carlos 22 June 2016 (has links)
Submitted by Izabel Franco (izabel-franco@ufscar.br) on 2016-10-26T17:09:15Z No. of bitstreams: 1 TeseSCF.pdf: 10747360 bytes, checksum: 8a9b640ece219d76af8f66468c70af0a (MD5) / Approved for entry into archive by Marina Freitas (marinapf@ufscar.br) on 2016-11-08T18:25:43Z (GMT) No. of bitstreams: 1 TeseSCF.pdf: 10747360 bytes, checksum: 8a9b640ece219d76af8f66468c70af0a (MD5) / Approved for entry into archive by Marina Freitas (marinapf@ufscar.br) on 2016-11-08T18:25:51Z (GMT) No. of bitstreams: 1 TeseSCF.pdf: 10747360 bytes, checksum: 8a9b640ece219d76af8f66468c70af0a (MD5) / Made available in DSpace on 2016-11-08T18:25:59Z (GMT). No. of bitstreams: 1 TeseSCF.pdf: 10747360 bytes, checksum: 8a9b640ece219d76af8f66468c70af0a (MD5) Previous issue date: 2016-06-22 / Não recebi financiamento / This work deals with the analysis of queueing systems for Call Centers regarding the possibility of the customer abandon the system before being served, due to his/her impatience. The Call Centers are service organizations that predominantly serve customers by phone calls and are considered a particular kind of Contact Centers that serve their costumers through phone, fax, e-mail, chat, mobile devices and other communication channels. From a business point of view, the main concern on the management of the Call Center is the offer quality service with minimum cost. The perception of the quality of services offered is expressed, by customer, for example, through the abandonment of queue, which is considered one of the most important operational measures to evaluate the performance of a Call Center. This work aims to present and apply analytical queueing models with abandonment, represented by generic probability distributions (particularly mixed distributions), as an effective analysis approach to represent the problem of congestion in Call Center systems and support decisions of dimensioning and operations on these systems. Two studies were conducted with Call Center extracted data of a company located in the countryside of São Paulo State, Brazil and another located overseas, in Israel. The parameters (e.g, arrival rate, service rate, abandonment rate) and some measures of performance (e.g, average waiting time, waiting probability, abandonment probability and traffic intensity) were established based on these data. These sampling measures were equated with the same measures achieved by the analytical queueing models M/M/c+G, M/Gc/1+G and M/G/c+G considered in this research, using the parameters obtained empirically and mixed, Exponential, Fatigue Life, Normal and Lognormal distributions to represent the abandonment (patience) of users. It was observed that in some cases, depending on the considered performance measure, the queueing models with mixed distributions for the abandonment have better results (minor deviations, compared to the real data) than their corresponding with nonmixed distributions. It was observed, also, that independently of the analyzed performance measures, the Call Centers may be represented by a analytical queueing model with a mixed distribution for abandonment times, which behaves better than the others. There were no evidences that the mixed distributions to represent the abandonment times were the best in all analyzed performance measures, but they were always competitive. It was used, also, an experimental model of discrete simulation that properly represents the Call Center in order to check the results of the analytical models and explore alternative scenarios. The same scenarios were considered with the theoretical model and the performance measures achieved by the simulation and by the theoretical models were compared, showing the potential of the use of the approaches based on analytical models with abandonment for the Call Centers analysis. / Este trabalho trata da análise de sistemas de filas para Call Center considerando a possibilidade do cliente abandonar o sistema antes de ser servido, devido a sua impaciência. Os Call Centers são organizações de serviço que predominantemente servem os clientes via chamada telefônica e são considerados um tipo particular dos Centros de Contato, que atendem seus clientes por meio de telefone, fax, e-mail, chat, dispositivos móveis e outros canais de comunicação. Do ponto de vista dos negócios, a principal preocupação na gestão de um Call Center é oferecer serviço de qualidade com mínimo custo. A percepção da qualidade dos serviços oferecidos é manifestada, pelo cliente, por exemplo, por meio do abandono da fila de espera, que é considerado uma das medidas operacionais mais importantes para avaliar o desempenho de um Call Center. O objetivo deste trabalho é apresentar e aplicar modelos analíticos de filas com abandono, representado por distribuições genéricas (particularmente distribuições mistas), como uma abordagem de análise efetiva para representar o problema de congestão em sistemas de Call Centers e apoiar decisões de dimensionamento e operação nesses sistemas. Dois estudos foram conduzidos com dados extraídos do Call Center de uma empresa localizada no interior do Estado de São Paulo e de outro localizado no exterior, em Israel. Os parâmetros (por exemplo, taxa de chegada, taxa de serviço e taxa de abandono) e algumas medidas de desempenho (por exemplo, tempo médio de espera, probabilidade de esperar, probabilidade de abandonar e intensidade de tráfego) são determinadas com esses dados. Essas medidas amostrais são comparadas com as mesmas medidas obtidas por meio dos modelos analíticos de fila M/M/c+G, M/Gc/1+G e M/G/c+G considerados nesse estudo, que utilizam os parâmetros obtidos empiricamente e as distribuições mistas, Exponencial, Fatigue Life, Normal e Lognormal para representar o abandono (paciência) dos usuários. Observou-se como resultado, que em alguns casos, dependendo da medida de desempenho considerada, os modelos de fila com distribuição mista para o abandono têm apresentado resultados melhores (desvios menores em relação aos dados reais) que os seus correspondentes com distribuições não mistas. Observou-se, também, que independentemente das medidas de desempenho analisadas, os Call Centers podem ser representados por um modelo analítico de fila com uma distribuição mista para os tempos de abandono, que se comporta melhor do que os outros. Não houve evidências que as distribuições mistas para representar os tempos de abandono foram as melhores em todas as medidas de desempenho analisadas, mas foram sempre competitivas. Utilizou-se, também, um modelo experimental de simulação discreta, que representa apropriadamente o Call Center, para verificar os resultados dos modelos analíticos e explorar cenários alternativos. Os mesmos cenários foram considerados com o modelo teórico e as medidas de desempenho obtidas pela simulação e pelo modelo teórico foram comparadas, mostrando o potencial do uso de abordagens baseadas em modelos analíticos com abandono para análise de Call Centers.
203

A importância da rapidez de atendimento nos caixas de supermercados : um estudo de caso utilizando um modelo analítico de filas com trocas. / The importance of quick responsiveness in supermarket checkouts: a case study using an analytical queueing model with jockeying.

Rinaldi, José Gilberto Spasiani 29 June 2007 (has links)
Made available in DSpace on 2016-06-02T19:50:01Z (GMT). No. of bitstreams: 1 TeseJGSR.pdf: 2611645 bytes, checksum: 6dbd36dc8e39b96d32bfddfeef61b88a (MD5) Previous issue date: 2007-06-29 / Self-service companies have been concerned about improving their performance in face of competition, either to conquer new markets or to keep in business. It is essential for them to present good quality products and/or services. Thus, in this study, one of the aims was to show that the waiting time in supermarket checkout lines is very important for service speed and, therefore, for the service level of customers. Other aims include the reasons why the customer goes regularly to a specific supermarket and, among these reasons, the relevance of each one in contrast with the others, and its relative importance. As a result, it was possible to contextualize the scenario in which customers view the importance of queues and their waiting time. Also, the level of schooling of the customers was associated with the factors that lead them to go regularly to the place. This is an interesting contribution because in Brazil there is a relation between income and schooling. Besides, the queueing models that may satisfactorily represent the supermarket checkouts were studied and the one that best represents the system was identified. It was verified that the customers change lines when they notice another line with fewer customers, and the size of this difference was assessed. The queueing model with jockeying was the one to best represent the situation commonly found in supermarkets, and it can offer subsidies for service improvement. In order to carry out this research some companies in the city of São Carlos/SP were contacted, and one among the ones that allowed the data collection was selected. Therefore, this is a case study, thus presenting inferential limitations. / As empresas de auto-serviço, em geral, têm se preocupado em melhorar seu desempenho em relação à concorrência, seja para ganhar mercado ou mesmo para sobreviver. É essencial que apresentem boa qualidade nos produtos oferecidos e/ou serviços prestados. Assim, neste trabalho, entre outros, um dos objetivos, foi mostrar que o tempo de espera na fila de caixas de supermercados assume grande importância na rapidez de atendimento e, portanto, para o nível de serviço dos clientes. Outros objetivos também foram considerados. As causas pelas quais o cliente freqüenta aquele supermercado e, nestas, a relevância de cada uma em contraste com as outras, e sua importância relativa. Assim, pôde-se contextualizar o cenário no qual os clientes enxergam a importância da fila e de seu tempo de espera. Também, associou-se a escolaridade do cliente a fatores pelos quais este freqüenta o local, sendo esta uma contribuição interessante devido a existir, no Brasil, uma relação entre renda e escolaridade. Também, estudaram-se modelos de filas que podem representar satisfatoriamente o sistema de filas nos caixas de supermercados e identificouse, entre eles, o que melhor representou o sistema. Para o modelo de filas com trocas, foi verificado que os clientes trocam de fila quando percebem outra fila com menos clientes, sendo que o tamanho desta diferença foi avaliado. Este modelo foi o que melhor representou a situação comumente encontrada nos supermercados podendo fornecer subsídios para melhoria de atendimento. Para a viabilização da pesquisa foram contatadas algumas empresas da cidade de São Carlos/SP, sendo que, daquelas que permitiram a coleta de dados, uma foi selecionada. Desta forma, este trabalho se trata de um estudo de caso, tendo limitações inferenciais.
204

Avaliação do desempenho e cenários alternativos em um samu utilizando o modelo hipercubo estacionário e não-estacionário / Performance and alternative scenarios evaluation on a samu using the stationary and nonstationary hypercube model

Beojone, Caio Vítor [UNESP] 09 October 2017 (has links)
Submitted by Caio Vítor Beojone null (beojone@hotmail.com) on 2017-10-25T17:47:50Z No. of bitstreams: 1 Caio Vitor Beojone - Dissertação.pdf: 10092472 bytes, checksum: 470115087a1c1f987a7a8e2cffec7cd4 (MD5) / Approved for entry into archive by Monique Sasaki (sayumi_sasaki@hotmail.com) on 2017-10-31T16:52:11Z (GMT) No. of bitstreams: 1 beojone_cv_me_bauru.pdf: 10092472 bytes, checksum: 470115087a1c1f987a7a8e2cffec7cd4 (MD5) / Made available in DSpace on 2017-10-31T16:52:11Z (GMT). No. of bitstreams: 1 beojone_cv_me_bauru.pdf: 10092472 bytes, checksum: 470115087a1c1f987a7a8e2cffec7cd4 (MD5) Previous issue date: 2017-10-09 / Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) / Vários Sistemas de Atendimento Emergenciais (SAE’s) sofrem com as variações diárias da demanda e da disponibilidade das ambulâncias. Nesses sistemas pode haver flutuação do desempenho ao longo do dia devido, por exemplo, a mudança no número de servidores e nas taxas de chegada, levando à necessidade de considerar explicitamente tais variações em uma extensão ao modelo hipercubo ainda não explorada na literatura. Como ocorre em alguns SAE’s, as ambulâncias melhor equipadas são reservadas para o atendimento exclusivo de chamados com risco de vida. Dessa maneira, a política de despacho pode ser diferenciada com a finalidade de reservar totalmente o atendimento de alguns servidores para certas gravidades de ocorrências. Além disso, somam-se à natureza aleatória desses sistemas, como por exemplo, as incertezas da disponibilidade das ambulâncias, a chegada de um novo chamado e sua localização. Nesse contexto, os objetivos do presente estudo são: (i) estender o modelo hipercubo de filas para reserva total de capacidade, dependendo do tipo do chamado; (ii) estender o modelo hipercubo de filas para torná-lo mais eficiente computacionalmente, sem haver perda de precisão durante a modelagem e resolução; e (iii) propor uma abordagem baseada no modelo hipercubo não-estacionário para organização do trabalho das ambulâncias em qualquer momento do dia. Para verificar a viabilidade e a aplicabilidade dessas abordagens, é realizado um estudo de caso no SAMU da cidade de Bauru (SAMU-Bauru) que, além de reservar suas ambulâncias avançadas para ocorrências mais graves, é afetado pelas variações diárias na demanda e disponibilidade das ambulâncias. Além da configuração original do SAMU-Bauru, estudada em duas etapas, foram analisados um total de quatro cenários alternativos que consideram questões importantes: o impacto do aumento na demanda do período mais congestionado; a mitigação desse impacto incluindo uma nova ambulância; a alteração do horário das pausas diárias; e o impacto de aumentos na demanda em horários específicos do dia. Foram calculadas importantes medidas de desempenho para cada cenário como a carga de trabalho, tempos médios de espera e tempos médios de resposta. Os resultados mostram que as extensões realizadas no modelo hipercubo são capazes de analisar satisfatoriamente sistemas como o SAMU-Bauru, além de possibilitar a criação e mensuração de propostas de melhorias nos níveis táticos e operacionais. / Many Emergency Service Systems face daily variations on demand and ambulance availability. These systems may suffer, for example, performance fluctuations throughout the day, changes on the number of servers and on arrival rates, leading to the need to explicitly consider such variations in a hypercube model extension not yet explored in the literature. As occurs in some SAMU’s, which reserve their best equipped ambulances to exclusively serve life-threating requests. Therefore, the dispatch policy can be differentiated in order to completely reserve the service of some ambulances to more severe requests. These problems add up to the random nature of these systems with uncertainties upon ambulance availability or the arrival of a new request and its location. Thus, this study aims to: (i) extend the hypercube queueing model to be able to capture the complete capacity reservation of advanced ambulances, depending on the request classification; (ii) extend the hypercube model in order to make it more computationally efficient, without losing any information during modeling and resolution. (iii) propose an approach based on nonstationary hypercube queueing model to organize the operation of ambulances at any time of the day. To verify the feasibility of these approaches, a case study is carried out on the SAMU from Bauru city (SAMU-Bauru), which, in addition to the advanced ambulance reservation for life-threating requests, is affected by daily variations in demand and ambulance availability. In addition to the original configuration of SAMU-Bauru, studied on a two-step approach, we studied a total of four alternative scenarios that exploited important matters as: the impact of average demand increase on the congestion peak; mitigation of this impact by including a new ambulance; changing the schedule of daily breaks; and the impact of increases in the demand at specific hours of the day. We calculated important performance measures for each scenario, such as workload, mean waiting times and mean response times. Results show that the proposed extensions to the hypercube model are capable of satisfactorily analyze systems such as SAMU-Bauru, besides making it possible to create and to measure improvements proposals in tactical and operational levels.
205

Pernambuco’s health sector: analysis of queueing problems and an economic growth model

ROCHA, Tamires Taís Bezerra 04 April 2013 (has links)
Submitted by Caroline Falcao (caroline.rfalcao@ufpe.br) on 2017-06-05T16:18:45Z No. of bitstreams: 2 license_rdf: 811 bytes, checksum: e39d27027a6cc9cb039ad269a5db8e34 (MD5) Tamires Taís.pdf: 9141530 bytes, checksum: dab991dd87283a9e223705028b80d093 (MD5) / Made available in DSpace on 2017-06-05T16:18:45Z (GMT). No. of bitstreams: 2 license_rdf: 811 bytes, checksum: e39d27027a6cc9cb039ad269a5db8e34 (MD5) Tamires Taís.pdf: 9141530 bytes, checksum: dab991dd87283a9e223705028b80d093 (MD5) Previous issue date: 2013-04-04 / Esta dissertação apresenta um panorama do sistema de saúde brasileiro, com ênfase no caso do Estado de Pernambuco. A gestão de sistemas de saúde se manifesta sob a forma geral de longas filas de espera, que são analisados neste contexto, incluindo algumas abordagens que têm sido propostas e implementadas em Pernambuco, a fim de resolver o problema. Um modelo de crescimento econômico ótimo destacando o setor de saúde, e, em seguida, operando em conjunto, os setores de saúde e educação é proposto. Os resultados do princípio do máximo de Pontryagin aplicado a este modelo mostram os benefícios mútuos para ambos os setores e os seus efeitos no bem-estar da sociedade. Um estudo de caso de filas de espera no Hospital da Restauração, em Recife, Pernambuco, é apresentado. / An overview of the Brazilian health care system is presented, with an emphasis in the Pernambuco state case. One central issue concerning health systems management manifests itself under the general form of long waiting lines, which are then here analyzed in this context, including some approaches that have been proposed and implemented in Pernambuco in order to tackle the problem. An optimal economic growth model highlighting the health sector, and then, operating jointly, the health and education sectors, is proposed. The results of the Pontryagin Maximum Principle applied to this model show the mutual benefits for both sectors and their effects in the community welfare. A case study of queueing systems in Hospital da Restaura¸c˜ao (an emergency hospital) in Recife, Pernambuco, is presented.
206

Optimization of Just-in-Time Sequencing Problems and Supply Chain Logistics

Thapa, Gyan January 2015 (has links)
This dissertation presents a comprehensive and comparative progress in sequencing approaches of mixed-model just-in-time (JIT) sequencing problem together with discrete apportioment problem (DAP). The goal of JIT sequencing problem (JITSP) is to keep the rate of usage of parts as constant as possible along the assembly lines, and the goal of DAP is to divide a given integer number of delegates proportionally among the states or the parties according to their population or votes. Furthermore, the supply chain logistics problem is also reported in here with some real life applications.The single-level JITSP, known as the product rate variation problem (PRVP), is pseudo-polynomially solvable. The total PRVP minimizes sum deviation and the bottleneck PRVP minimizes the maximum deviation between the actual production and the ideal production. The assignment approach solves total PRVP whereas the perfect matching works for bottleneck PRVP solving the problem in pseudo-polynomial time. The multi-level JITSP, known as the output rate variation problem (ORVP), is NP-hard in most of the cases. However, some sequencing heuristics and dynamic programming are devised for near optimal solutions. And the pegging assumption reduces the ORVP into weighted case of PRVP. In this dissertation, the total PRVP with square and absolute deviations are considered and mean-based divisor methods are devised for the equitably efficient solution. The simultaneous dealing to the PRVP and DAP establishes the interlink between the production sequencing problem and integer seat allocating problem. The new upper bottlenecks are investigated and the problems are solved comparatively. The bottleneck PRVP instances for small deviations and cyclic sequences for total PRVP are shown to be optimal. The bicriterion sequencing is discussed with Pareto optimal solutions.The production sequencing problem is simultaneously dealt with supply chain logistics to balance overall supply chain system. The cross-docking supply chain logistics problem is formulated with a proposition to be solved. The real-world applications of JITSP and supply chain are listed and some open problems are pointed out as the closing of the dissertation.
207

Efektivní využití konvergované sítě / Effective utilization of converged network

Nesveda, Marek January 2011 (has links)
The Master´s thesis deals with the field of converged networks and their effective utilization, focusing specifically on telematic networks. For modelling of a specific application, the field of vehicle telematics has been chosen. The thesis consists of two series of simulations performed in Opnet Modeler network simulator. The first theoretic part gave an overview of protocols used in converged networks, with a more detailed description of routing protocols. The attention also centred on the mechanisms for assuring the quality of service in IP. The subsequent practical part consisted in creating the simulation of a converged telematic network in Opnet Modeler network simulator, configuration of scenarios for assuring QoS using RSVP and DSCP protocols and the simulation itself. The results of the simulations for both scenarios were compared and represented in graphic form. The second theoretic part described queueing disciplines, as well as congestion control and congestion avoidance mechanisms. The corresponding practical part compared FIFO, PQ and WFQ queueing algorithms within the same simulated converged telematic network, using DSCP protocol scenario, which gave much better results in the first series of simulations. Again, the results of the simulations were compared and represented in graphic form.
208

Resource management in computer clusters : algorithm design and performance analysis / Gestion des ressources dans les grappes d’ordinateurs : conception d'algorithmes et analyse de performance

Comte, Céline 24 September 2019 (has links)
La demande croissante pour les services de cloud computing encourage les opérateurs à optimiser l’utilisation des ressources dans les grappes d’ordinateurs. Cela motive le développement de nouvelles technologies qui rendent plus flexible la gestion des ressources. Cependant, exploiter cette flexibilité pour réduire le nombre d’ordinateurs nécessite aussi des algorithmes de gestion des ressources efficaces et dont la performance est prédictible sous une demande stochastique. Dans cette thèse, nous concevons et analysons de tels algorithmes en utilisant le formalisme de la théorie des files d’attente.Notre abstraction du problème est une file multi-serveur avec plusieurs classes de clients. Les capacités des serveurs sont hétérogènes et les clients de chaque classe entrent dans la file selon un processus de Poisson indépendant. Chaque client peut être traité en parallèle par plusieurs serveurs, selon des contraintes de compatibilité décrites par un graphe biparti entre les classes et les serveurs, et chaque serveur applique la politique premier arrivé, premier servi aux clients qui lui sont affectés. Nous prouvons que, si la demande de service de chaque client suit une loi exponentielle indépendante de moyenne unitaire, alors la performance moyenne sous cette politique simple est la même que sous l’équité équilibrée, une extension de processor-sharing connue pour son insensibilité à la loi de la demande de service. Une forme plus générale de ce résultat, reliant les files order-independent aux réseaux de Whittle, est aussi prouvée. Enfin, nous développons de nouvelles formules pour calculer des métriques de performance.Ces résultats théoriques sont ensuite mis en pratique. Nous commençons par proposer un algorithme d’ordonnancement qui étend le principe de round-robin à une grappe où chaque requête est affectée à un groupe d’ordinateurs par lesquels elle peut ensuite être traitée en parallèle. Notre seconde proposition est un algorithme de répartition de charge à base de jetons pour des grappes où les requêtes ont des contraintes d’affectation. Ces deux algorithmes sont approximativement insensibles à la loi de la taille des requêtes et s’adaptent dynamiquement à la demande. Leur performance peut être prédite en appliquant les formules obtenues pour la file multi-serveur. / The growing demand for cloud-based services encourages operators to maximize resource efficiency within computer clusters. This motivates the development of new technologies that make resource management more flexible. However, exploiting this flexibility to reduce the number of computers also requires efficient resource-management algorithms that have a predictable performance under stochastic demand. In this thesis, we design and analyze such algorithms using the framework of queueing theory.Our abstraction of the problem is a multi-server queue with several customer classes. Servers have heterogeneous capacities and the customers of each class enter the queue according to an independent Poisson process. Each customer can be processed in parallel by several servers, depending on compatibility constraints described by a bipartite graph between classes and servers, and each server applies first-come-first-served policy to its compatible customers. We first prove that, if the service requirements are independent and exponentially distributed with unit mean, this simple policy yields the same average performance as balanced fairness, an extension to processor-sharing known to be insensitive to the distribution of the service requirements. A more general form of this result, relating order-independent queues to Whittle networks, is also proved. Lastly, we derive new formulas to compute performance metrics.These theoretical results are then put into practice. We first propose a scheduling algorithm that extends the principle of round-robin to a cluster where each incoming job is assigned to a pool of computers by which it can subsequently be processed in parallel. Our second proposal is a load-balancing algorithm based on tokens for clusters where jobs have assignment constraints. Both algorithms are approximately insensitive to the job size distribution and adapt dynamically to demand. Their performance can be predicted by applying the formulas derived for the multi-server queue.
209

Resource management in computer clusters : algorithm design and performance analysis / Gestion des ressources dans les grappes d’ordinateurs : conception d'algorithmes et analyse de performance

Comte, Céline 24 September 2019 (has links)
La demande croissante pour les services de cloud computing encourage les opérateurs à optimiser l’utilisation des ressources dans les grappes d’ordinateurs. Cela motive le développement de nouvelles technologies qui rendent plus flexible la gestion des ressources. Cependant, exploiter cette flexibilité pour réduire le nombre d’ordinateurs nécessite aussi des algorithmes de gestion des ressources efficaces et dont la performance est prédictible sous une demande stochastique. Dans cette thèse, nous concevons et analysons de tels algorithmes en utilisant le formalisme de la théorie des files d’attente.Notre abstraction du problème est une file multi-serveur avec plusieurs classes de clients. Les capacités des serveurs sont hétérogènes et les clients de chaque classe entrent dans la file selon un processus de Poisson indépendant. Chaque client peut être traité en parallèle par plusieurs serveurs, selon des contraintes de compatibilité décrites par un graphe biparti entre les classes et les serveurs, et chaque serveur applique la politique premier arrivé, premier servi aux clients qui lui sont affectés. Nous prouvons que, si la demande de service de chaque client suit une loi exponentielle indépendante de moyenne unitaire, alors la performance moyenne sous cette politique simple est la même que sous l’équité équilibrée, une extension de processor-sharing connue pour son insensibilité à la loi de la demande de service. Une forme plus générale de ce résultat, reliant les files order-independent aux réseaux de Whittle, est aussi prouvée. Enfin, nous développons de nouvelles formules pour calculer des métriques de performance.Ces résultats théoriques sont ensuite mis en pratique. Nous commençons par proposer un algorithme d’ordonnancement qui étend le principe de round-robin à une grappe où chaque requête est affectée à un groupe d’ordinateurs par lesquels elle peut ensuite être traitée en parallèle. Notre seconde proposition est un algorithme de répartition de charge à base de jetons pour des grappes où les requêtes ont des contraintes d’affectation. Ces deux algorithmes sont approximativement insensibles à la loi de la taille des requêtes et s’adaptent dynamiquement à la demande. Leur performance peut être prédite en appliquant les formules obtenues pour la file multi-serveur. / The growing demand for cloud-based services encourages operators to maximize resource efficiency within computer clusters. This motivates the development of new technologies that make resource management more flexible. However, exploiting this flexibility to reduce the number of computers also requires efficient resource-management algorithms that have a predictable performance under stochastic demand. In this thesis, we design and analyze such algorithms using the framework of queueing theory.Our abstraction of the problem is a multi-server queue with several customer classes. Servers have heterogeneous capacities and the customers of each class enter the queue according to an independent Poisson process. Each customer can be processed in parallel by several servers, depending on compatibility constraints described by a bipartite graph between classes and servers, and each server applies first-come-first-served policy to its compatible customers. We first prove that, if the service requirements are independent and exponentially distributed with unit mean, this simple policy yields the same average performance as balanced fairness, an extension to processor-sharing known to be insensitive to the distribution of the service requirements. A more general form of this result, relating order-independent queues to Whittle networks, is also proved. Lastly, we derive new formulas to compute performance metrics.These theoretical results are then put into practice. We first propose a scheduling algorithm that extends the principle of round-robin to a cluster where each incoming job is assigned to a pool of computers by which it can subsequently be processed in parallel. Our second proposal is a load-balancing algorithm based on tokens for clusters where jobs have assignment constraints. Both algorithms are approximately insensitive to the job size distribution and adapt dynamically to demand. Their performance can be predicted by applying the formulas derived for the multi-server queue.
210

Resource management in computer clusters : algorithm design and performance analysis / Gestion des ressources dans les grappes d’ordinateurs : conception d'algorithmes et analyse de performance

Comte, Céline 24 September 2019 (has links)
La demande croissante pour les services de cloud computing encourage les opérateurs à optimiser l’utilisation des ressources dans les grappes d’ordinateurs. Cela motive le développement de nouvelles technologies qui rendent plus flexible la gestion des ressources. Cependant, exploiter cette flexibilité pour réduire le nombre d’ordinateurs nécessite aussi des algorithmes de gestion des ressources efficaces et dont la performance est prédictible sous une demande stochastique. Dans cette thèse, nous concevons et analysons de tels algorithmes en utilisant le formalisme de la théorie des files d’attente.Notre abstraction du problème est une file multi-serveur avec plusieurs classes de clients. Les capacités des serveurs sont hétérogènes et les clients de chaque classe entrent dans la file selon un processus de Poisson indépendant. Chaque client peut être traité en parallèle par plusieurs serveurs, selon des contraintes de compatibilité décrites par un graphe biparti entre les classes et les serveurs, et chaque serveur applique la politique premier arrivé, premier servi aux clients qui lui sont affectés. Nous prouvons que, si la demande de service de chaque client suit une loi exponentielle indépendante de moyenne unitaire, alors la performance moyenne sous cette politique simple est la même que sous l’équité équilibrée, une extension de processor-sharing connue pour son insensibilité à la loi de la demande de service. Une forme plus générale de ce résultat, reliant les files order-independent aux réseaux de Whittle, est aussi prouvée. Enfin, nous développons de nouvelles formules pour calculer des métriques de performance.Ces résultats théoriques sont ensuite mis en pratique. Nous commençons par proposer un algorithme d’ordonnancement qui étend le principe de round-robin à une grappe où chaque requête est affectée à un groupe d’ordinateurs par lesquels elle peut ensuite être traitée en parallèle. Notre seconde proposition est un algorithme de répartition de charge à base de jetons pour des grappes où les requêtes ont des contraintes d’affectation. Ces deux algorithmes sont approximativement insensibles à la loi de la taille des requêtes et s’adaptent dynamiquement à la demande. Leur performance peut être prédite en appliquant les formules obtenues pour la file multi-serveur. / The growing demand for cloud-based services encourages operators to maximize resource efficiency within computer clusters. This motivates the development of new technologies that make resource management more flexible. However, exploiting this flexibility to reduce the number of computers also requires efficient resource-management algorithms that have a predictable performance under stochastic demand. In this thesis, we design and analyze such algorithms using the framework of queueing theory.Our abstraction of the problem is a multi-server queue with several customer classes. Servers have heterogeneous capacities and the customers of each class enter the queue according to an independent Poisson process. Each customer can be processed in parallel by several servers, depending on compatibility constraints described by a bipartite graph between classes and servers, and each server applies first-come-first-served policy to its compatible customers. We first prove that, if the service requirements are independent and exponentially distributed with unit mean, this simple policy yields the same average performance as balanced fairness, an extension to processor-sharing known to be insensitive to the distribution of the service requirements. A more general form of this result, relating order-independent queues to Whittle networks, is also proved. Lastly, we derive new formulas to compute performance metrics.These theoretical results are then put into practice. We first propose a scheduling algorithm that extends the principle of round-robin to a cluster where each incoming job is assigned to a pool of computers by which it can subsequently be processed in parallel. Our second proposal is a load-balancing algorithm based on tokens for clusters where jobs have assignment constraints. Both algorithms are approximately insensitive to the job size distribution and adapt dynamically to demand. Their performance can be predicted by applying the formulas derived for the multi-server queue.

Page generated in 0.0643 seconds