Global ETD Search

141	類典型相關分析及其在免試入學上採計成績之研究 / A canonical correlation analysis type approach to model a criterion for enrolling high school students 卓惠敏, Cho, Hui Min Unknown Date (has links) 實施十二年國民基本教育，目的是為促進學生五育均衡發展，兼顧國中學習品質及日常生活表現。由於各校對成績的評分標準與評分方式皆不相同，因此如何使在校成績採計達到公平性將成為一項重要的問題。戴岑熹(2011) 考慮了國中在校綜合學科分數與基測總分間的相關性，以決定在校各學科的權重。而本研究延伸其概念與方法，將基測各科量尺分數考慮進來，於在校綜合學科分數與基測綜合量尺分數的關聯性最密切的情況下，分析各學科權重的取決方式，希望能找出較理想的模式來代表學生在校三年的整體學習表現與成果，以做為免試升學採計在校成績的參考與依據。本文的研究方法是運用典型相關分析的理論，但因權重的限制條件與傳統典型相關分析的要求不同，因此，便將其命名為「類典型相關分析」。在類典型相關分析中，我們證明了在校各學科分數及基測各科量尺分數的最佳權重，可先透過典型相關分析求得典型相關向量，若有必要的話，使用Rao-Ghangurad 方法加以修正，最後，再將所獲得的非負典型相關向量正規化，即可獲得所要的結果，這是一個求最佳權重向量極便捷的途徑。在實例分析方面，我們發現了一個有趣的現象，即在校學科分數與基測考科量尺分數的最佳權重向量相當接近，即名稱相同的學科與考科幾乎有相同的權重。在比較了幾個權重分配方式不同的在校綜合學科分數後，我們也發現一般學校常用的等加權模式，其表現結果也頗優異。 / The purpose of implementing the twelve-year compulsory education is to promote the balanced development of learning in students, taking into account their learning quality and normal daily performances in school. As the evaluation standard and method vary among schools, achieving fairness in calculating in-school grades has become an important issue. Dai (2011) considered the correlations between the scores of in-school academic performance and the total score of the BCTEST for junior high schools, which decided to the weightings of all learning subjects. This study extended his concept and method, and took into account the scale scores of all learning subjects. In the closest case of the weightings of all learning subjects and find out the correlations between the scores of in-school academic performance and the BCTEST, and analyse the weightings of all learning subjects. We hope the study can find a better approach that can not only reflect students’ learning situations and achievements for the three years in school but also provide a reference for the evaluation of entering senior high schools without entrance examinations. The research method in this paper employs the theory of canonical correlation analysis.However, due to that fact that weight restrictions are different from the requirements of canonical correlation analysis, it is named as the canonical correlation analysis type approach. In the canonical correlation analysis type approach, we proved that the optimal weights for school subject score and test subject score scales can be obtained by finding the canonical correlation vectors using canonical correlation analysis. Then the Rao-Ghangurad method can further be used for amending, if needed. Finally, the nonnegative canonical correlation vectors generated would be normalized to get the desired result. It is an extremely convenient way to obtain the optimal weight vector. In the case study, we found an interesting phenomenon as follows: When the optimal weight vectors for school subject score and test subject score scales were very close, subjects and tests of the same name had almost the same weight. After comparing several comprehensive school subject scores of different weight distribution, we also found that the results of the equal weighting model commonly used in schools also showed quite good results. 在校學科分數國中基測量尺分數主成份分析類主成份分析典型相關分析類典型相關分析 scores of in-school academic performance principal component analysis canonical correlation analysis
142	Canonical correlation analysis of aggravated robbery and poverty in Limpopo Province Rwizi, Tandanai 05 1900 (has links) The study was aimed at exploring the relationship between poverty and aggravated robbery in Limpopo Province. Sampled secondary data of aggravated robbery of- fenders, obtained from the South African Police (SAPS), Polokwane, was used in the analysis. From empirical researches on poverty and crime, there are some deductions that vulnerability to crime is increased by poverty. Poverty set was categorised by gender, employment status, marital status, race, age and educational attainment. Variables for aggravated robbery were house robbery, bank robbery, street/common robbery, carjacking, truck hijacking, cash-in-transit and business robbery. Canonical correlation analysis was used to make some inferences about the relationship of these two sets. The results revealed a signi cant positive correlation of 0.219(p-value = 0.025) between poverty and aggravated robbery at ve per cent signi cance level. Of the thirteen variables entered into the poverty-aggravated model, ve emerged as sta- tistically signi cant. These were gender, marital status, employment status, common robbery and business robbery. / Mathematical Sciences / M. Sc. (Statistics) Canonical correlation analysis Poverty Aggravated robbery Limpopo Province Poverty-aggravated robbery model 519.537 Canonical correlation (Statistics)
143	多反應變量相關模式於不動產擔保估價之應用陳俊宏 Unknown Date (has links) 本研究以不動產估價技術規則第19條第7項與第20條之規定，引用相似無關迴歸模式、多變量迴歸模式與典型相關分析等計量模式，對金融機構所做的擔保品估價進行驗證、預測及控制分析。擔保品估價中會產生兩價，即擔保品的評估市場價格與評估擔保值（價），大部分的人都認為兩價存在一個比率關係。傳統的迴歸分析估價模式係由一組價格影響因素影響一個不動產價格，上述情形是否可能由同一組價格影響因素影響兩個不動產價格？本研究實證結果顯示，在95％統計信賴水準下，有兩個不動產價格受同一組價格因素影響的結果。既然驗證存在同一組價格影響因素影響兩個不動產價格，是否有更具效率的計量估價模式呢？典型相關分析係透過兩組變項之相關關係建構計量模式，除可再度驗證同一組價格影響因素影響兩個不動產價格，並可如同因素分析或主成份分析的功能，對兩組變項各做變項縮減的工作，達到對變項去蕪存菁的效果。 / This thesis is based on Article 19 No 7 and Article 20 of the Real Estate Appraisal Regulation. Seemingly Unrelated Regression Model, Multivariate Regression Model and Econometric Model and so on econometric model are applied. In addition, collateral valuations done by financial institutions are verified, predicted and analyzed. In collateral valuations, there are two-value references: assessed market value and assessed accommodation value. Majority believe that there is a ratio between these two values. The traditional regression analysis of the valuation model is having one set of pricing factors to have impact on the real estate price. However, is it possible that one set of pricing factors will affect two real estate prices? The findings approve that, under statistical confidence level with 95%, more than two real estate prices can be influenced by one set of pricing factors. Further more, this thesis also examines if there are other econometric valuation models to be applied? The canonical correlation analysis is to build a calculation model to analyze correlation between two variables. Other than examining one set of pricing factors can influence two real estate prices, this analysis also provides a similar function of the factor analysis or principal analysis to reduce variables caused by two sets of variable. 擔保品估價特徵價格法相似無關迴歸多變量迴歸典型相關分析 Collateral valuation Hedonic price method Seemingly unrelated regression Multivariate regression Canonical correlation analysis
144	企業資訊科技能力指標之研究 / A Study of Information Technology Capability Indicators 林志弘, Lin, Jyh Horng Unknown Date (has links) 在全球化市場的激烈競爭環境中，資訊科技對企業而言已是一種提升競爭優勢的策略性設備，而先前文獻對於資訊科技能力的評估或與企業績效關聯性的探討，多以行為性問卷的認知數據量表進行研究，少有利用事實性問卷所收集的現象數據評估資訊科技能力及進一步分析資訊科技能力與企業績效關聯性之研究。故本研究基於資源基礎觀點理論，利用企業事實性現象填答問卷建立企業資訊科技能力評估模型，包含資訊科技的導入狀態、應用方式及使用經驗等現象相關問項，如硬體、網路、資訊系統應用程度及範圍等，並探討資訊科技能力與企業績效的關聯性。使用典型相關分析進行實證研究發現，針對先前政府委託調查所收集資料計算出來的企業資訊科技能力，與公開發行的上市櫃企業財務資料所計算出來的企業績效具有顯著關聯性，特別是會計型財務績效之經營能力，經檢定具統計顯著性。進一步進行產業別比較，先使用灰色熵權重分析對於各個子構面進行權重估計，並以權重加權法重新計算每一樣本之資訊科技能力，再進行單因子變異數分析，顯示各產業間之資訊科技能力及子構面能力多數呈現顯著差異。本研究所提出的資訊科技能力評估模型與企業績效關聯檢定模式，以及產業間資訊科技能力差異性分析模式，可提供政府或產業觀察機構建立長期觀測平台，以彙整各種產業資訊科技導入現象及應用範圍，使政府與企業可檢視整體產業整體或個別產業資訊科技能力之差異，藉以擬定資訊科技投資策略，提升企業競爭優勢。 / In the highly competitive globalization environment, information technology (IT) has become strategic equipment for leveraging a business’s competitive advantage. Most previous studies use perceptual questionnaire to collect behavioral data for evaluating IT capability, and furthermore to explore the relationship between IT capability and firm performance. Very few studies use factual questionnaire to collect the phenomenon data for analysis. In this study, we propose a model of evaluating IT capability based on Resource-Based View (RBV) theory and use factual phenomenon questionnaire including induction status, application approach, and usage experience, such as hardware, networks, IS application levels and scopes, etc. The research also explores the relationship between IT capability and firm performance. The IT capability data are calculated from the earlier government-sponsored survey. The firm performance data by financial indicators are collected or calculated from the open data of listed companies in Taiwan Stock Exchange and Over-the-Counter Agencies. The Canonical Correlation Analysis is used and shows significantly positive relationship for the IT capability affecting the firm performance, especially in Accounting-Based Financial Indicators. Before further analysis of industry comparison, Grey Entropy is used to estimate the weights of three sub-constructs and the overall IT capability is then re-calculated by integrating the weighted sub-construct capabilities. Afterwards, the One-Way ANOVA analysis is conducted and shows significant differences across industries in the overall IT capability of the firm and the IT capabilities of the sub-constructs. The proposed IT capability estimation model and the relationship analysis for the IT capability and firm performance can be used by the government or industry observation institution to continuously watch the industry IT capability phenomena and its relationship with the firm performance. The observation for the whole country and across industries can be used as a reference to pursue appropriate IT investments for strategic advantage. 資訊科技能力企業績效灰色熵典型相關分析 Information Technology Capability Firm Performance Grey Entropy Canonical Correlation Analysis
145	Závislost postavení týmů v žebříčku FIFA na dosažených výsledcích na vrcholných turnajích / Dependence of the position of teams in the FIFA rankings on the achievements in top tournaments Kotrba, Lukáš January 2015 (has links) Title: Dependence of the position of teams in the FIFA rankings on the achievements in top tournaments Objectives: The aim of this work is determining dependency team standings in the FIFA rankings on the achievements in top tournaments. This is the World Cup in 1998, 2002, 2006, 2010 and 2014. Methods: In my thesis I used a method of regression and correlation analysis, correlation coefficient and regression straight line. Results: All results are presented in the analytical part of the work. It was found dependency between the observed data and an increasing trend. The biggest dependence was at the World Cup 2014, which reached high levels. The smallest was at the World Cup 1998 and 2002, where the results reached below average. Keywords: Football, FIFA, FIFA World rankings, FIFA World Cup, correlation analysis, regression analysis, correlation coefficient, regression straight line
146	Towards on-line domain-independent big data learning : novel theories and applications Malik, Zeeshan January 2015 (has links) Feature extraction is an extremely important pre-processing step to pattern recognition, and machine learning problems. This thesis highlights how one can best extract features from the data in an exhaustively online and purely adaptive manner. The solution to this problem is given for both labeled and unlabeled datasets, by presenting a number of novel on-line learning approaches. Specifically, the differential equation method for solving the generalized eigenvalue problem is used to derive a number of novel machine learning and feature extraction algorithms. The incremental eigen-solution method is used to derive a novel incremental extension of linear discriminant analysis (LDA). Further the proposed incremental version is combined with extreme learning machine (ELM) in which the ELM is used as a preprocessor before learning. In this first key contribution, the dynamic random expansion characteristic of ELM is combined with the proposed incremental LDA technique, and shown to offer a significant improvement in maximizing the discrimination between points in two different classes, while minimizing the distance within each class, in comparison with other standard state-of-the-art incremental and batch techniques. In the second contribution, the differential equation method for solving the generalized eigenvalue problem is used to derive a novel state-of-the-art purely incremental version of slow feature analysis (SLA) algorithm, termed the generalized eigenvalue based slow feature analysis (GENEIGSFA) technique. Further the time series expansion of echo state network (ESN) and radial basis functions (EBF) are used as a pre-processor before learning. In addition, the higher order derivatives are used as a smoothing constraint in the output signal. Finally, an online extension of the generalized eigenvalue problem, derived from James Stone’s criterion, is tested, evaluated and compared with the standard batch version of the slow feature analysis technique, to demonstrate its comparative effectiveness. In the third contribution, light-weight extensions of the statistical technique known as canonical correlation analysis (CCA) for both twinned and multiple data streams, are derived by using the same existing method of solving the generalized eigenvalue problem. Further the proposed method is enhanced by maximizing the covariance between data streams while simultaneously maximizing the rate of change of variances within each data stream. A recurrent set of connections used by ESN are used as a pre-processor between the inputs and the canonical projections in order to capture shared temporal information in two or more data streams. A solution to the problem of identifying a low dimensional manifold on a high dimensional dataspace is then presented in an incremental and adaptive manner. Finally, an online locally optimized extension of Laplacian Eigenmaps is derived termed the generalized incremental laplacian eigenmaps technique (GENILE). Apart from exploiting the benefit of the incremental nature of the proposed manifold based dimensionality reduction technique, most of the time the projections produced by this method are shown to produce a better classification accuracy in comparison with standard batch versions of these techniques - on both artificial and real datasets. 006.3
147	Hodnotenie environmentálneho statku - východoslovenská priehrada Ružín / Evaluation of the environmental good - The Eastern Slovak dam Ružín Kožariková, Veronika January 2010 (has links) The main purpose of the diploma thesis is to determine the willingness of people visiting the Eastern Slovak dam Ružín to pay for improving water quality, namely for the environmental good. Willingness to pay is determined by questionnaire investigation the contingent valuation method. Dam is a public good, which has no owner. We all use it without someone to care for it. This use is not positive, but negative in terms of pollution, clogging of toxic sediments. The theoretical part is devoted to construction and the need to build dam as well as environmental problems, which occur at the dam. This is related to the problem of public good and "the tragedy of the commons." There are the contingent valuation method and development of the questionnaire described at the end of theoretical part. The practical part consists of the evaluation questionnaire investigation and the linear regression model in terms of the parameters under which they were created identifiers variables and point estimates. Finally, it is converted by statistical analysis of the impact of variables on the amount that people are willing to pay.
148	Avaliação da adaptação marginal e interna, da resistência à fratura após ciclagem termomecânica e das tensões nos implantes por correlação de imagens digitais em próteses parciais fixas sobre implantes com pilares e copings em zircônia com diferentes sistemas CAD/CAM / Evaluation of the marginal and internal fit, resistance to fracture after thermomechanical cycling and tensions in the implants by correlation of digital images in fixed partial dentures on implants with abutments and copings in zirconia with different CAD/CAM systems Mendes, Francielle Alves 02 June 2015 (has links) Considerando a crescente exigência estética, o desenvolvimento da zircônia e o incremento da tecnologia CAD/CAM o objetivo deste trabalho foi avaliar a adaptação marginal e interna, as tensões nos implantes e a resistência à fratura após prensagem da porcelana e termociclagem mecânica em próteses parciais fixas sobre implantes com pilares e infraestrutura em zircônia com dois sistemas CAD/CAM (Neodent digital - Neodent e Lava - 3M ESPE) comparados com o método convencional (n=10). A adaptação marginal e interna foi analisada por meio de um microtomógrafo computadorizado (microCT). Cada prótese foi digitalizada e os arquivos foram processadas utilizando o software NRecon e CTAN. Foi utilizado o programa Dataview para aferição das medidas. Para a realização da ciclagem termomecânica as próteses foram posicionadas na máquina de fadiga mecânica por mastigação e foi aplicada a carga de 120 N com uma ponta que simula a oclusão antagonista simulando 2.000.000 ciclos. Durante o ensaio, as próteses foram mantidas em água destilada e termocicladas com variação de temperatura entre 5º-55º C. Para a verificação das tensões geradas pelas próteses parciais fixas em torno dos implantes foi realizada a análise por correlação de imagens digitais. Foram selecionados cinco modelos de cada um dos sistemas CAD/CAM e um antagonista e aplicada uma carga de 250 N, com velocidade de 0,1 mm/min, em máquina universal de ensaios. Para avaliação da resistência à fratura foi aplicada uma força perpendicular ao longo eixo da peça protética, no pôntico, até que devido à fratura não houvesse mais resistência. Após esse teste foi avaliado o relacionamento entre os componentes da prótese em microscópio eletrônico de varredura (MEV). A análise estatística mostrou que houve diferença significativa na adaptação pilar-implante dos molares entre os grupos Lava e ZirNeo, Lava e Controle (p=0,008). Para a desadaptação vertical e horizontal antes e após a prensagem e ciclagem não houve diferença significante (p>0,005). A desadaptação interna axial mostrou diferença significante antes e após para os molares dos grupos Lava e ZirNeo (p<0,001). A desadaptação interna oclusal mostrou diferença significante para os PM dos grupos TiNeo e Controle e para os molares dos grupos Lava e Zir Neo (p<0,005). Houve diferença significante de tensão na região cervical dos molares dos grupos ZirNeo e Lava (p=0,015) com maiores valores de tensão para o grupo Lava. O grupo TiNeo teve maior resistência à fratura que os demais (p=0,022). O relacionamento entre os componentes da prótese permaneceu favorável para todos os grupos. Os resultados deste trabalho permitem concluir que a prensagem da porcelana e a termociclagem mecânica não influenciou os resultados da desadaptação marginal e melhorou a desadaptação interna. O grupo usinado pelo sistema Neodent digital em zircônia teve maior concentração de tensão na região cervical podendo ter maior perda óssea nessa região. O grupo TiNeo foi o que mais resistiu à fratura. Entre fresar em zircônia pelo sistema Neodent digital ou Lava, o sistema Lava distribui melhor a tensão ao longo do implante, porém teve maiores valores de desadaptação interna. Entre fresar em titânio ou confeccionar a prótese pelo sistema convencional, melhor fresar. / Considering the growing aesthetic requirements, the development of zirconia and the increase of CAD/CAM technology, the aim of this study was to evaluate the marginal and internal fit, tensions in implants and fracture resistance after pressing porcelain and thermomechanical cycling in FPDs on implants with abutments and infrastructure in zirconia with two CAD/CAM systems (Neodent digital -Neodent and Lava - 3M ESPE) compared with the conventional method (n = 10). The marginal and internal fit was analyzed by a computerized microtomograph (microCT). Each prosthesis was scanned and the files were processed using the NRecon and CTAN software. Dataview program was used for the assessment of the measures. To carry out the thermomechanical cycling, prostheses were placed in mechanical fatigue machine for chewing and 120 N load was applied with a tip that simulates the antagonist occlusion simulating 2,000,000 cycles. During the test, the prostheses were kept in distilled water and thermocycled with temperatures between 5°-55° C. Digital image correlation analysis was performed to check the load transfer by implant-supported restoration. Five models were selected from each of the CAD/CAM systems and an antagonist and a load of 250 N was applied, with 0.1 mm/min speed using a universal testing machine. The fracture resistance was verified with force applied perpendicular to the long axis of the prosthesis, at pontic, until there were no more fracture resistance. After this test was evaluated the relationship between the components of the prosthesis in a scanning electron microscope (SEM). The statistical analysis showed significant difference in abutment-implant fit of molars between Lava and ZirNeo, Lava and control groups (p=.008). For vertical and horizontal fit there was no significant difference (p>.005) before and after pressing and thermomechanical cycling. The axial internal gap was significantly different before and after for molar ZirNeo groups (p<.001). The occlusal internal fit was significantly different to the PM of TiNeo and Control, and the molars of Lava and ZirNeo (p<.005). There were significant difference for tension in the cervical region of the molars of ZirNeo and Lava (p=.015) with higher values for the Lava group. TiNeo group had higher resistance to fracture than others (p=.022). The relationship between the prosthesis components remained positive for all groups. The results of this study showed that the pressing of porcelain and thermomechanical cycling did not influence the results of marginal gap and improved internal fit. The zirconia group machined by Neodent digital system had higher concentration of tension in the cervical and may have greater bone loss in this region. TiNeo group was the most resistant to fracture. Between the zirconia milling by Neodent digital or Lava system, the Lava system distributes better strain throughout the implant, but had greater internal fit values. Between milling titanium or fabricate the prosthesis by the conventional system, better milling. Adaptação marginal CAD/CAM CAD/CAM Ciclagem termomecânica Digital image correlation analysis Fracture resistance Marginal fit MEV Resistência a fratura SEM thermomechanical cycling Zirconia Zircônia
149	Otimização das colunas de absorção da recuperação de acetona na produção de Filter Tow por meio de estudos fenomenológicos e análise estatística. / Optimization of the absorption columns in the acetone recovery at the Filter Tow production by phenomenological studies and statistical analysis. Nasser Junior, Roberto 13 November 2009 (has links) A absorção é a etapa determinante da recuperação de acetona no Filter Tow, por reduzir a emissão de acetona e trazer melhorias à economia. Por isso, ela constitui o objeto deste estudo, que inclui desde a revisão dos conceitos fenomenológicos, considerando a escolha do melhor modelo de equilíbrio líquido-vapor, passando pela execução dos balanços coerentes de massa e o estabelecimento da Fotografia da situação original, relatando um caso complexo de transposição de pratos para recheios. Contudo, a operação das colunas de absorção é influenciada por outras variáveis de caráter desconhecido, ruídos em relação à fenomenologia, o que se pretende avaliar, justificando-se desenvolver estudo para avaliar seus efeitos. Com o objetivo geral de otimizar absorção, uma análise estatística foi executada a partir do levantamento de dados operacionais, utilizando todas suas variáveis, sejam as fenomenológicas, como os ruídos, com o objetivo específico de obter modelos empíricos que complementem as simulações fenomenológicas, aumentando sua abrangência. Para a execução da análise estatística, os conjuntos de dados históricos foram levantados e validados pelos balanços coerentes de massa e pela Fotografia, o que tornou possível sua evolução, desde a seleção das variáveis, até estabelecer os modelos de regressão, com os quais pode-se obter um novo modo de controle, que estabiliza a operação, possibilitando a otimização. Em termos ambientais, a utilização destes modelos resulta em redução de até 15% das perdas de acetona para o ambiente, como também de consumo energético, com uma economia da ordem de 1 milhão de reais por ano, sem quaisquer custos adicionais. / Absorption is the key step of the acetone recovery at Filter Tow production, for reducing the acetone emission and improving economics. For this reason it is the subject of this study, including the revision of phenomenological concepts, considering the choice of the best vapor liquid equilibrium model, passing by the improvement of coherent mass balances and establishing the Photography of the original situation, detailing a complex case of transposition of sieve trays to structured packing. However, the operation of the absorption columns is influenced by other variables, with unknown impacts, noises in relation to the phenomenology, justifying the development of this study, for evaluating them. With the general objective of optimizing the absorption, a statistical analysis is performed from collecting operating data, considering all variables, phenomenological and noises, with the specific objective of getting empirical models complementing the phenomenological simulations, increasing their comprising. For performing the statistical analysis sets of historical data have been collected and validated by coherent mass balances and the Photography, making possible its evolution, from the selection of the variables till establishing the regression models, and with them getting a new control way, which stabilizes the operation, allowing the optimization. In environmental terms, the use of these models results in up to 15% decrease in acetone losses to the environment, as well as power consumption with a saving of approximately 1 million reais per year, without any additional costs. Absorção Absorption Acetato de celulose Análise de regressão e de correlação Análise estatística de dados Cellulose acetate Data statistical analysis Equilíbrio líquido-vapor (modelos) Filer tow Filter Tow Regression and correlation analysis Vaporliquid equilibrium (models)
150	Développement de méthodes d'analyse de données en ligne / Development of methods to analyze data steams Bar, Romain 29 November 2013 (has links) On suppose que des vecteurs de données de grande dimension arrivant en ligne sont des observations indépendantes d'un vecteur aléatoire. Dans le second chapitre, ce dernier, noté Z, est partitionné en deux vecteurs R et S et les observations sont supposées identiquement distribuées. On définit alors une méthode récursive d'estimation séquentielle des r premiers facteurs de l'ACP projetée de R par rapport à S. On étudie ensuite le cas particulier de l'analyse canonique, puis de l'analyse factorielle discriminante et enfin de l'analyse factorielle des correspondances. Dans chacun de ces cas, on définit plusieurs processus spécifiques à l'analyse envisagée. Dans le troisième chapitre, on suppose que l'espérance En du vecteur aléatoire Zn dont sont issues les observations varie dans le temps. On note Rn = Zn - En et on suppose que les vecteurs Rn forment un échantillon indépendant et identiquement distribué d'un vecteur aléatoire R. On définit plusieurs processus d'approximation stochastique pour estimer des vecteurs directeurs des axes principaux d'une analyse en composantes principales (ACP) partielle de R. On applique ensuite ce résultat au cas particulier de l'analyse canonique généralisée (ACG) partielle après avoir défini un processus d'approximation stochastique de type Robbins-Monro de l'inverse d'une matrice de covariance. Dans le quatrième chapitre, on considère le cas où à la fois l'espérance et la matrice de covariance de Zn varient dans le temps. On donne finalement des résultats de simulation dans le chapitre 5 / High dimensional data are supposed to be independent on-line observations of a random vector. In the second chapter, the latter is denoted by Z and sliced into two random vectors R et S and data are supposed to be identically distributed. A recursive method of sequential estimation of the factors of the projected PCA of R with respect to S is defined. Next, some particular cases are investigated : canonical correlation analysis, canonical discriminant analysis and canonical correspondence analysis ; in each case, several specific methods for the estimation of the factors are proposed. In the third chapter, data are observations of the random vector Zn whose expectation En varies with time. Let Rn = Zn - En be and suppose that the vectors Rn form an independent and identically distributed sample of a random vector R. Stochastic approximation processes are used to estimate on-line direction vectors of the principal axes of a partial principal components analysis (PCA) of ~Z. This is applied next to the particular case of a partial generalized canonical correlation analysis (gCCA) after defining a stochastic approximation process of the Robbins-Monro type to estimate recursively the inverse of a covariance matrix. In the fourth chapter, the case when both expectation and covariance matrix of Zn vary with time n is considered. Finally, simulation results are given in chapter 5 Big Data Flux de données Analyse en composantes principales (ACP) ACP projetée Analyse canonique généralisée (ACG) Approximation stochastique Big data Data streams Principal components analysis (PCA) Projected PCA Stochastic approximation 519.5

Search results