MARC보기
LDR00000nam u2200205 4500
001000000434744
00520200227102534
008200131s2019 ||||||||||||||||| ||eng d
020 ▼a 9781687927200
035 ▼a (MiAaPQ)AAI27536154
035 ▼a (MiAaPQ)umichrackham002164
040 ▼a MiAaPQ ▼c MiAaPQ ▼d 247004
0820 ▼a 658
1001 ▼a Chen, Weidong.
24510 ▼a Online Learning Algorithms for Stochastic Inventory and Queueing Systems.
260 ▼a [S.l.]: ▼b University of Michigan., ▼c 2019.
260 1 ▼a Ann Arbor: ▼b ProQuest Dissertations & Theses, ▼c 2019.
300 ▼a 160 p.
500 ▼a Source: Dissertations Abstracts International, Volume: 81-04, Section: B.
500 ▼a Advisor: Duenyas, Izak
5021 ▼a Thesis (Ph.D.)--University of Michigan, 2019.
506 ▼a This item must not be sold to any third party vendors.
506 ▼a This item must not be added to any third party search indexes.
520 ▼a The management of inventory and queueing systems lies in the heart of operations research and plays a vital role in many business enterprises. To this date, the majority of work in the literature has been done under complete distributional information about the uncertainties inherent in the system. However, in practice, the decision maker may not know the exact distributions of these uncertainties (such as demand, capacity, lead time) at the beginning of the planning horizon, but can only rely on realized observations collected over time. This thesis focuses on the interplay between learning and optimization of three canonical inventory and queueing systems and proposes a series of first online learning algorithms.The first system studied in Chapter II is the periodic-review multiproduct inventory system with a warehouse-capacity constraint. The second system studied in Chapter III is the periodic-review inventory system with random capacities. The third system studied in Chapter IV is the continuous-review make-to-stock M/G/1 queueing system. We take a nonparametric approach that directly works with data and needs not to specify any (parametric) form of the uncertainties. The proposed online learning algorithms are stochastic gradient descent type, leveraging the (sometimes non-obvious) convexity properties in the objective functions. The performance measure used is the notion of cumulative regret or simply regret, which is defined as the cost difference between the proposed learning algorithm and the clairvoyant optimal algorithm (had all the distributional information about uncertainties been given). Our main theoretical results are to establish the square-root regret rate for each proposed algorithm, which is known to be tight. Our numerical results also confirm the efficacy of the proposed learning algorithms.The major challenges in designing effective learning algorithms for such systems and analyzing them are as follows. First, in most retail settings, customers typically walk away in the face of stock-out, and therefore the system is unable to keep track of these lost-sales. Thus, the observable demand data is, in fact, the sales data, which is also known as the censored demand data. Second, the inventory decisions may impact the cost function over extended periods, due to complex state transitions in the underlying stochastic inventory system. Third, the stochastic inventory system has hard physical constraints, e.g., positive inventory carry-over, warehouse capacity constraint, ordering/production capacity constraint, and these constraints limit the search space in a dynamic way.We believe this line of research is well aligned with the important opportunity that now exists to advance data-driven algorithmic decision-making under uncertainty. Moreover, it adds an important dimension to the general theory of online learning and reinforcement learning, since firms often face a realistic stochastic supply chain system where system dynamics are complex, constraints are abundant, and information about uncertainties in the system is typically censored. It is, therefore, important to analyze the structure of the underlying system more closely and devise an efficient and effective learning algorithm that can generate better data, which is then feedback to the algorithm to make better decisions. This forms a virtuous cycle.
590 ▼a School code: 0127.
650 4 ▼a Operations research.
650 4 ▼a Industrial engineering.
690 ▼a 0796
690 ▼a 0546
71020 ▼a University of Michigan. ▼b Industrial & Operations Engineering.
7730 ▼t Dissertations Abstracts International ▼g 81-04B.
773 ▼t Dissertation Abstract International
790 ▼a 0127
791 ▼a Ph.D.
792 ▼a 2019
793 ▼a English
85640 ▼u http://www.riss.kr/pdu/ddodLink.do?id=T15494210 ▼n KERIS ▼z 이 자료의 원문은 한국교육학술정보원에서 제공합니다.
980 ▼a 202002 ▼f 2020
990 ▼a ***1008102
991 ▼a E-BOOK