Global ETD Search

1	Online stochastic algorithms / Algorithmes stochastiques en ligne Li, Le 27 November 2018 (has links) Cette thèse travaille principalement sur trois sujets. Le premier concentre sur le clustering en ligne dans lequel nous présentons un nouvel algorithme stochastique adaptatif pour regrouper des ensembles de données en ligne. Cet algorithme repose sur l'approche quasi-bayésienne, avec une estimation dynamique (i.e., dépendant du temps) du nombre de clusters. Nous prouvons que cet algorithme atteint une borne de regret de l'ordre et que cette borne est asymptotiquement minimax sous la contrainte sur le nombre de clusters. Nous proposons aussi une implémentation par RJMCMC. Le deuxième sujet est lié à l'apprentissage séquentiel des courbes principales qui cherche à résumer une séquence des données par une courbe continue. Pour ce faire, nous présentons une procédure basée sur une approche maximum a posteriori pour le quasi-posteriori de Gibbs. Nous montrons que la borne de regret de cet algorithme et celui de sa version adaptative est sous-linéaire en l'horizon temporel T. En outre, nous proposons une implémentation par un algorithme glouton local qui intègre des éléments de sleeping experts et de bandit à plusieurs bras. Le troisième concerne les travaux qui visent à accomplir des tâches pratiques au sein d'iAdvize, l'entreprise qui soutient cette thèse. Il inclut l'analyse des sentiments pour les messages textuels et l'implémentation de chatbot dans lesquels la première est réalisé par les méthodes classiques dans la fouille de textes et les statistiques et la seconde repose sur le traitement du langage naturel et les réseaux de neurones artificiels. / This thesis works mainly on three subjects. The first one is online clustering in which we introduce a new and adaptive stochastic algorithm to cluster online dataset. It relies on a quasi-Bayesian approach, with a dynamic (i.e., time-dependent) estimation of the (unknown and changing) number of clusters. We prove that this algorithm has a regret bound of the order of and is asymptotically minimax under the constraint on the number of clusters. A RJMCMC-flavored implementation is also proposed. The second subject is related to the sequential learning of principal curves which seeks to represent a sequence of data by a continuous polygonal curve. To this aim, we introduce a procedure based on the MAP of Gibbs-posterior that can give polygonal lines whose number of segments can be chosen automatically. We also show that our procedure is supported by regret bounds with sublinear remainder terms. In addition, a greedy local search implementation that incorporates both sleeping experts and multi-armed bandit ingredients is presented. The third one concerns about the work which aims to fulfilling practical tasks within iAdvize, the company which supports this thesis. It includes sentiment analysis for textual messages by using methods in both text mining and statistics, and implementation of chatbot based on nature language processing and neural networks. Apprentissage en ligne Clustering en ligne Quasi-Bayésian Borne de regret minimax Reversible Jump Markov Chain Monte Carlo Courbe principale séquentielle Online learning Online clustering Quasi-Bayesian Minimax regret bound Reversible Jump Markov Chain Monte Carlo Sequential principal curve 510
2	利用混合模型估計風險值的探討阮建豐 Unknown Date (has links) 風險值大多是在假設資產報酬為常態分配下計算而得的，但是這個假設與實際的資產報酬分配不一致，因為很多研究者都發現實際的資產報酬分配都有厚尾的現象，也就是極端事件的發生機率遠比常態假設要來的高，因此利用常態假設來計算風險值對於真實損失的衡量不是很恰當。針對這個問題，本論文以歷史模擬法、變異數-共變異數法、混合常態模型來模擬報酬率的分配，並依給定的信賴水準估算出風險值，其中混合常態模型的參數是利用準貝式最大概似估計法及EM演算法來估計；然後利用三種風險值的評量方法：回溯測試、前向測試與二項檢定，來評判三種估算風險值方法的優劣。經由實證結果發現： 1.報酬率分配在左尾臨界機率1％有較明顯厚尾的現象。 2.利用混合常態分配來模擬報酬率分配會比另外兩種方法更能準確的捕捉到左尾臨界機率1％的厚尾。 3.混合常態模型的峰態係數值接近於真實報酬率分配的峰態係數值，因此我們可以確認混合常態模型可以捕捉高峰的現象。關鍵字：風險值、厚尾、歷史模擬法、變異數-共變異教法、混合常態模型、準貝式最大概似估計法、EM演算法、回溯測試、前向測試、高峰 / Initially, Value at Risk (VaR) is calculated by assuming that the underline asset return is normal distribution, but this assumption sometimes does not consist with the actual distribution of asset return. Many researchers have found that the actual distribution of the underline asset return have Fat-Tail, extreme value events, character. So under normal distribution assumption, the VaR value is improper compared with the actual losses. The paper discuss three methods. Historical Simulated method - Variance-Covariance method and Mixture Normal .simulating those asset, return and VaR by given proper confidence level. About the Mixture Normal Distribution, we use both EM algorithm and Quasi-Bayesian MLE calculating its parameters. Finally, we use tree VaR testing methods, Back test、Forward tes and Binomial test -----comparing its VaR loss probability We find the following results: 1.Under 1% left-tail critical probability, asset return distribution has significant Fat-tail character. 2.Using Mixture Normal distribution we can catch more Fat-tail character precisely than the other two methods. 3.The kurtosis of Mixture Normal is close to the actual kurtosis, this means that the Mixture Normal distribution can catch the Leptokurtosis phenomenon. Key words: Value at Risk、VaR、Fat tail、Historical simulation method、 Variance-Covariance method、Mixture Normal distribution、Quasi-Bayesian MLE、EM algorithm、Back test、 Forward test、 Leptokurtosis 風險值厚尾歷史模擬法變異數-共變異數法混合常態模型準貝式最大概似估計法 EM演算法回溯測試前向測試高峰 Value at risk (VaR) Fat tail Historical simulation method Variance-covariance method Mixture normal distribution Quasi-bayesian MLE EM algoritm Back test Forward test Leptokurtosis

Search results

Online stochastic algorithms / Algorithmes stochastiques en ligne

利用混合模型估計風險值的探討