151 |
Use and Misuse of X-Ray Photoelectron Spectroscopy (XPS): Reproducibility, Gross Errors, Data Reporting, and Peak FittingMajor, George Hobbs 18 April 2023 (has links) (PDF)
X-ray photoelectron spectroscopy (XPS) is the most widely used surface analysis technique for chemically probing surfaces. Its popularity stems from the large amount of information that can be gathered about the electronic states of the atoms it probes, including core shell information and valence electron information. Simple qualitative analysis (peak identification) can often be performed, but quantitative analysis is a much more complicated process. Although XPS usage has increased dramatically, so has the amount of erroneous analysis observed in the literature. In my thesis, I first present a perspective on how to improve the quality of surface and material data analysis. This chapter focuses on responsible groups, using population biology models and the Prisoner's Dilemma to describe the situation and the potential changes that must be made to counteract error propagation. I quantify errors in XPS data analysis to provide perspective on the gravity of the situation. Over 400 publications in three journals were analyzed. Additionally, another 900 journals were surveyed to determine the quantity of information in the analysis. The parameters include experimental parameters, e.g., the pass energy, peak fitting parameters, the spot size, X-ray source, and the type of spectrometer. I found that over 40% of the publications had significant errors that could potentially change the conclusions of the publication. About 35% of all papers neglected to note the type of spectrometer used, and 85% did not mention the type of software used for analysis. The latter half of this work focuses on XPS peak fitting. I present a broad overview of peak fitting, including how to determine the appropriate background and peak shapes to use, how to quantify XPS data, and how to account for other phenomena associated with photoemission. The line shape chosen for peak fitting is critical, as it is the synthetic shape that is used to model observed physical phenomena. A detailed review on typical line shapes, including the Voigt and pseudo-Voigt functions is presented, along with how to apply them in peak fitting. How and why asymmetric peak shapes are required is also discussed, including which effects cause asymmetry, and if it is inherent to the material or the method of analysis. Finally, a discussion on using constraints to properly model known effects is presented. These efforts were guided by the findings in the former half of this work. The trends presented here are not unique to XPS. Other fields and techniques have similar reproducibility problems. This work discusses possible solutions and what efforts as a community need to be taken to remedy the reproducibility crisis. Additionally, this work includes guides that have original research to improve approaches to XPS analysis, including peak fitting, constraint parameters, and the appropriate use of line shapes.
|
152 |
Accounting for variance and hyperparameter optimization in machine learning benchmarksBouthillier, Xavier 06 1900 (has links)
La récente révolution de l'apprentissage automatique s'est fortement appuyée sur l'utilisation de bancs de test standardisés. Ces derniers sont au centre de la méthodologie scientifique en apprentissage automatique, fournissant des cibles et mesures indéniables des améliorations des algorithmes d'apprentissage. Ils ne garantissent cependant pas la validité des résultats ce qui implique que certaines conclusions scientifiques sur les avancées en intelligence artificielle peuvent s'avérer erronées.
Nous abordons cette question dans cette thèse en soulevant d'abord la problématique (Chapitre 5), que nous étudions ensuite plus en profondeur pour apporter des solutions (Chapitre 6) et finalement developpons un nouvel outil afin d'amélioration la méthodologie des chercheurs (Chapitre 7).
Dans le premier article, chapitre 5, nous démontrons la problématique de la reproductibilité pour des bancs de test stables et consensuels, impliquant que ces problèmes sont endémiques aussi à de grands ensembles d'applications en apprentissage automatique possiblement moins stable et moins consensuels. Dans cet article, nous mettons en évidence l'impact important de la stochasticité des bancs de test, et ce même pour les plus stables tels que la classification d'images. Nous soutenons d'après ces résultats que les solutions doivent tenir compte de cette stochasticité pour améliorer la reproductibilité des bancs de test.
Dans le deuxième article, chapitre 6, nous étudions les différentes sources de variation typiques aux bancs de test en apprentissage automatique, mesurons l'effet de ces variations sur les méthodes de comparaison d'algorithmes et fournissons des recommandations sur la base de nos résultats. Une contribution importante de ce travail est la mesure de la fiabilité d'estimateurs peu coûteux à calculer mais biaisés servant à estimer la performance moyenne des algorithmes. Tel qu'expliqué dans l'article, un estimateur idéal implique plusieurs exécution d'optimisation
d'hyperparamètres ce qui le rend trop coûteux à calculer. La plupart des chercheurs doivent donc recourir à l'alternative biaisée, mais nous ne savions pas jusqu'à présent la magnitude de la dégradation de cet estimateur. Sur la base de nos résultats, nous fournissons des recommandations pour la comparison d'algorithmes sur des bancs de test avec des budgets de calculs limités. Premièrement, les sources de variations devraient être randomisé autant que possible. Deuxièmement, la randomization devrait inclure le partitionnement aléatoire des données pour les ensembles d'entraînement, de validation et de test, qui s'avère être la plus importante des sources de variance. Troisièmement, des tests statistiques tel que la version du Mann-Withney U-test présenté dans notre article devrait être utilisé plutôt que des comparisons sur la simple base de moyennes afin de prendre en considération l'incertitude des mesures de performance.
Dans le chapitre 7, nous présentons un cadriciel d'optimisation d'hyperparamètres développé avec principal objectif de favoriser les bonnes pratiques d'optimisation des hyperparamètres. Le cadriciel est conçu de façon à privilégier une interface simple et intuitive adaptée aux habitudes de travail des chercheurs en apprentissage automatique. Il inclut un nouveau système de versionnage d'expériences afin d'aider les chercheurs à organiser leurs itérations expérimentales et tirer profit des résultats antérieurs pour augmenter l'efficacité de l'optimisation des hyperparamètres. L'optimisation des hyperparamètres joue un rôle important dans les bancs de test, les hyperparamètres étant un facteur confondant significatif. Fournir aux chercheurs un instrument afin de bien contrôler ces facteurs confondants est complémentaire aux recommandations pour tenir compte des sources de variation dans le chapitre 6.
Nos recommendations et l'outil pour l'optimisation d'hyperparametre offre une base solide pour une méthodologie robuste et fiable. / The recent revolution in machine learning has been strongly based on the use of standardized benchmarks. Providing clear target metrics and undeniable measures of improvements of learning algorithms, they are at the center of the scientific methodology in machine learning. They do not ensure validity of results however, therefore some scientific conclusions based on flawed methodology may prove to be wrong.
In this thesis we address this question by first raising the issue (Chapter 5), then we study it to find solutions and recommendations (Chapter 6) and build tools to help improve the methodology of researchers (Chapter 7).
In first article, Chapter 5, we demonstrate the issue of reproducibility in stable and consensual benchmarks, implying that these issues are endemic to a large ensemble of machine learning applications that are possibly less stable or less consensual. We raise awareness of the important impact of stochasticity even in stable image classification tasks and contend that solutions for reproducible benchmarks should account for this stochasticity.
In second article, Chapter 6, we study the different sources of variation that are typical in machine learning benchmarks, measure their effect on comparison methods to benchmark algorithms and provide recommendations based on our results. One important contribution of this work is that we measure the reliability of a cheaper but biased estimator for the average performance of algorithms. As explained in the article, an ideal estimator involving multiple rounds of hyperparameter optimization is too computationally expensive. Most researchers must resort to use the biased alternative, but it has been unknown until now how serious a degradation of the quality of estimation this leads to. Our investigations provides guidelines for benchmarks on practical budgets. First, as many sources of variations as possible should be randomized. Second, the partitioning of data in training, validation and test sets should be randomized as well, since this is the most important source of
variation. Finally, statistical tests should be used instead of ad-hoc average comparisons so that the uncertainty of performance estimation can be accounted for when comparing machine learning algorithms.
In Chapter 7, we present a framework for hyperparameter optimization that has been developed with the main goal of encouraging best practices for hyperparameter optimization. The framework is designed to favor a simple and intuitive interface adapted to the workflow of machine learning researchers. It includes a new version control system for experiments to help researchers organize their rounds of experimentations and leverage prior results for more efficient hyperparameter optimization. Hyperparameter optimization plays an important role in benchmarking, with the effect of hyperparameters being a serious confounding factor. Providing an instrument for researchers to properly control this confounding factor is complementary to our
guidelines to account for sources of variation in Chapter 7.
Our recommendations together with our tool for hyperparameter optimization provides a solid basis for a reliable methodology in machine learning benchmarks.
|
153 |
EMPIRICAL EVALUATION OFCROSS-SITE REPRODUCIBILITY ANDDISCRIMINABILITY OF RADIOMICFEATURES FOR CHARACTERIZINGTUMOR APPEARANCE ON PROSTATEMRIChirra, Prathyush V., Chirra 31 August 2018 (has links)
No description available.
|
154 |
How reproducible is the acoustical characterization of porous media?Pompoli, F., Bonfiglio, P., Horoshenkov, K.V., Khan, Amir, Jaouen, L., Bécot, F-X., Sgard, F., Asdrubali, F., D'Alessandro, F., Hübelt, J., Atalla, N., Amédin, C.K., Lauriks, W., Boeckx, L. 04 February 2020 (has links)
Yes / There is a considerable number of research publications on the characterization of porous media that is carried out in accordance with ISO 10534-2 (International Standards Organization, Geneva, Switzerland, 2001) and/or ISO 9053 (International Standards Organization, Geneva, Switzerland, 1991). According to the Web of Science(TM) (last accessed 22 September 2016) there were 339 publications in the Journal of the Acoustical Society of America alone which deal with the acoustics of porous media. However, the reproducibility of these characterization procedures is not well understood. This paper deals with the reproducibility of some standard characterization procedures for acoustic porous materials. The paper is an extension of the work published by Horoshenkov, Khan, Bécot, Jaouen, Sgard, Renault, Amirouche, Pompoli, Prodi, Bonfiglio, Pispola, Asdrubali, Hübelt, Atalla, Amédin, Lauriks, and Boeckx [J. Acoust. Soc. Am. 122(1), 345-353 (2007)]. In this paper, independent laboratory measurements were performed on the same material specimens so that the naturally occurring inhomogeneity in materials was controlled. It also presented the reproducibility data for the characteristic impedance, complex wavenumber, and for some related pore structure properties. This work can be helpful to better understand the tolerances of these material characterization procedures so improvements can be developed to reduce experimental errors and improve the reproducibility between laboratories.
|
155 |
Test-retest reproducibility of accommodation measurements gathered in an unselected sample of UK primary school childrenAdler, P., Scally, Andy J., Barrett, Brendan T. January 2012 (has links)
No / Purpose To determine the test-retest reproducibility of accommodation measurements gathered in an unselected sample of primary school children.
Methods Monocular and binocular amplitudes of accommodation (AA) were collected by five different Testers using the push-up method in an unselected sample of school children (n=137, age: 8.1±2.1-years). Testing was conducted on three occasions (average testing interval: 8-days) in 91.2% of the children.
Results The median AA was 19.1D, the variation due to the identity of the Tester was 3.1D (p<0.001) and the within-subject variation (which takes the variation due to Tester identity into account) was 5.2D. Around 75-79% of children exhibited monocular AAs-12D when tested on the first occasion, but more than 90% exhibited an AA-12D when subsequently tested. Around 74-80% of those with an AA<12D on the first occasion had values-12D on subsequent testing even though no treatment had been undertaken. Poorer initial AA measurements were less likely to improve on repeat testing.
Conclusions Our results reveal substantial intra-individual variation in AA measurements, raising questions about the usefulness of this test in children aged 4-12-years. We suggest that AA assessment may prove most useful in children in this age range as a pass/fail check for substantially reduced AA, for example, where the AA is <12D. Our sample would suggest that the prevalence of persistently reduced AA may be around 3.2% when tested under binocular conditions and 4-6.4% when tested monocularly.
|
156 |
Avaliação da repetibilidade e da reprodutibilidade do ensaio de duplo puncionamento. / Evaluation of the repeatability and reproducibility of the double punch test.Nogueira, André Baltazar 02 May 2019 (has links)
Os avanços na utilização do concreto reforçado com fibras (CRF) para fins estruturais trazem cada vez mais a necessidade de se desenvolver ensaios práticos que possam controlar o material, analisando a capacidade resistente residual do compósito para diferentes níveis de abertura de fissura. O ensaio de duplo puncionamento (EDP) é uma promissora alternativa nesta direção, uma vez que haja uma correlação já comprovada com outros métodos de ensaio bem estabelecidos. Este trabalho objetivou investigar a repetibilidade e reprodutibilidade do EDP, considerando os parâmetros de cargas de fissura da matriz cimentícia e cargas residuais, verificando se existe igualdade de médias e igualdade de variâncias nas diferentes situações criadas, para 95% de confiabilidade. A repetibilidade foi avaliada por meio de um programa experimental, com moldagem dos corpos de prova no Centro de Pesquisa e Desenvolvimento da Construção Civil - CPqDCC, da Escola Politécnica da Universidade de São Paulo, envolvendo dois teores de fibra de aço com ancoragem em gancho (30 kg/m³ e 55 kg/m³) em três séries iguais de moldagem. A reprodutibilidade foi avaliada em um programa interlaboratorial, com participação de 6 laboratórios independentes e empregando a mesma fibra e teores. Como ferramentas para comparação dos resultados e avaliar se houve repetição e reprodução dos parâmetros do ensaio, foi utilizada análise de variância (ANOVA) e a metodologia da ASTM E691. Os resultados mostraram que foi possível obter repetibilidade e reprodutibilidade de médias para os parâmetros de cargas residuais com um número limitado de corpos de prova. No entanto, as variâncias são mais sensíveis que as médias, e não mostraram repetição total quando a homogeneização das fibras no volume total do concreto é prejudicada. Ainda, as cargas de fissuração do ensaio (Pf) não se mostraram totalmente repetíveis e nem reprodutíveis, pois podem ser afetadas pela máquina de ensaio e operador. / The advances in the use of fiber reinforced concrete (CRF) for structural applications bring the need to develop more practical tests to control the FRC by analyzing the residual strength of the composite for different crack openings levels. The double punch test (DPT) is a promising alternative in that sense, once a correlation with already proven test methods is achievable. This work aimed to investigate the repeatability and reproducibility of the DPT, considering the parameters of crack load and residual loads, verifying if there is equality of means and equality of variances in the different situations created, for 95% reliability. The repeatability was evaluated by an experimental program, with the specimens molding at the Research and Development Center in Civil Construction - CPqDCC, at the Polytechnic School of the University of São Paulo, involving two hooked end steel fiber contents (30 kg/m³ and 55 kg / m³), in three equal molding series. The reproducibility was evaluated in an interlaboratory program, with the participation of six independent laboratories, and using the same fiber and contents. The analysis of variance (ANOVA) and the ASTM E691 methodology were applied as a tool to compare the results and to evaluate if there were repetition and reproduction of the test parameters occurred. Together with these tools, an inductive test was used on the reproducibility specimens to verify the scatter of the fiber among them. The results showed that it was possible to obtain repeatability and reproducibility of means for the parameters related to the residual loads. However, the variances are more sensitive than the means, and did not show total repetition when the homogenization of the fibers in the total volume of the concrete is compromised. Moreover, the cracking loads of the test (Pf) were not totally repeatable and reproducible, as they can be affected by the testing machine.
|
157 |
Avaliação da confiabilidade interexaminador dos testes sensoriais quantitativos em estruturas mastigatórias / Evaluation of interrater reliability of quantitative sensory testing in the masticatory structuresAraújo Junior, Everardo Napoleão Santana de 29 March 2018 (has links)
Este estudo estimou a confiabilidade interexaminador e a concordância da avaliação somatossensorial realizada na região do músculo masseter e da articulação temporomandibular (ATM) em um grupo de voluntários saudáveis do sexo feminino e masculino. Quarenta participantes saudáveis (20 homens e 20 mulheres) foram avaliados em duas sessões por dois examinadores diferentes. Os testes aplicados foram limiar de detecção ao frio (CDT, siglas em inglês), limiar de detecção ao calor (WDT), limiar sensorial térmico (TSL), limiar de dor ao frio (CPT), limiar de dor ao calor (HPT), limiar de detecção mecânica (MDT), limiar de dor mecânica (MPT) Razão da somação temporal (WUR) e o limiar de dor a pressão (PPT). ANOVA, coeficiente de correlação intraclasse (ICC) e erro padrão de mensuração (SEM) foram computados ( = 5%). O intervalo de confiança de 95% sem sobreposição (IC 95%) dos valores de ICC foi considerado significativamente diferente. Setenta e sete por cento dos valores de ICC de todas as medidas do QST variaram entre razoável a excelente (ICCs: 0,47-0,97). No entanto, a confiabilidade do WDT, TSL e HPT do masseter foi significativamente maior do que da ATM, enquanto que a confiabilidade de MDT da ATM foi maior do que do músculo masseter. Além disso, a seguinte combinação de teste/sítio apresentou os menores valores de ICC nas mulheres: HPT e MDT da ATM e MPT de ambos os sítios ATM e masseter. Para o erro padrão de mensuração, os maiores valores foram apresentados no CPT e MPT. A avaliação somatossensorial geral das estruturas de mastigatórias realizadas por dois examinadores pode ser considerada suficientemente confiável para discriminar os participantes. No entanto, as possíveis influências do sitio e do sexo nos parâmetros de reprodutibilidade devem ser levadas em consideração para uma interpretação e aplicação clínica apropriadas. / This study estimated the interrater reliability and agreement of the somatosensory assessment performed at masseter and temporomandibular joint (TMJ) region in a group of healthy female and male participants. Forty healthy participants (20 men and 20 men) were evaluated in two sessions by two different examiners. Cold detection threshold (CDT), warm detection threshold (WDT), thermal sensory limen (TSL), cold pain threshold (CPT), heat pain threshold (HPT), mechanical detection threshold (MDT), mechanical pain threshold (MPT), wind-up ratio (WUR) and pressure pain threshold (PPT) were assessed on the skin overlying masseter body and TMJ. Mixed ANOVA, Intraclass Correlation Coefficients (ICC) and standard error of measurement (SEM) were applied to the data (=5%). Non-overlapping 95% confidence intervals (95% CI) of ICC values were considered significantly different. The ICC values of 77% of all QST measurements were considered fair to excellent (ICCs: 0.47-0.97). However, the reliability of WDT, TSL and HPT of masseter was significantly higher than TMJ, whereas the MDT reliability of TMJ was higher than masseter. In addition, the following combination of test/sites presented significantly lower ICCs for women: HPT, MDT of TMJ and MPT of both TMJ and masseter. Finally, the highest SEM values were presented for CPT and MPT. The overall somatosensory assessment of the masticatory structures performed by two examiners can be considered sufficiently reliable to discriminate participants. Nevertheless, possible site and sex influences on the reproducibility parameters should be take into account for an appropriate interpretation and clinical application.
|
158 |
Achados clínicos e da análise videofluoroscópica da deglutição em pacientes com doença pulmonar obstrutiva crônica / Clinical and videofluoroscopic evaluation of swallowing in patients with chronic obstructive pulmonary diseaseChaves, Rosane de Deus 15 April 2014 (has links)
Indivíduos com doença pulmonar obstrutiva crônica (DPOC) apresentam sintomas de disfagia, indicando a existência de alteração da deglutição nessa população. O objetivo da presente tese foi identificar as características da deglutição nos pacientes com DPOC. A deglutição foi avaliada por meio do exame videofluoroscopia da deglutição. Foi utilizado um protocolo para análise dos seguintes parâmetros: tempo de trânsito faríngeo, duração do contato da base de língua com a parede posterior da faringe, resíduo em valécula e penetração/aspiração. Os resultados foram analisados através de dois estudos. No primeiro estudo foi verificada a reprodutibilidade do protocolo para análise da videofluoroscopia da deglutição em uma população de adultos saudáveis. Foram avaliados 20 indivíduos saudáveis, de ambos os gêneros, com idades entre 50 e 65 anos. O protocolo consistiu na avaliação da deglutição de 10ml de consistência líquida. A análise estatística envolveu a avaliação da reprodutibilidade do método de análise entre juízes e a análise dos dados quantitativos, levando-se em consideração os gêneros. Em relação à análise das respostas dos juízes, foi observada significância estatística, com alta e boa reprodutibilidade para todas as comparações realizadas. Não foram encontradas diferenças estatisticamente significantes para o tempo de trânsito faríngeo; para a duração do contato da base de língua na parede posterior da faringe; e para a porcentagem de resíduo na valécula. Não houve penetração/aspiração para nenhuma das consistências testadas. Concluindo, os resultados indicaram que o protocolo para análise da videofluoroscopia da deglutição é reprodutível. Os parâmetros de deglutição avaliados não se diferenciaram entre os gêneros. O resíduo na valécula mostrou-se presente em 40% da amostra, sugerindo que este parâmetro, isoladamente, não é indicativo de alteração para essa faixa etária. No segundo estudo foram avaliados 20 pacientes com DPOC, que foram comparados a 20 indivíduos saudáveis, pareados por gênero e idade. O protocolo consistiu na avaliação da deglutição de consistência líquida, pastosa e sólida. Os participantes do estudo não apresentaram sinais de penetração/aspiração para nenhuma das consistências testadas. Os pacientes com DPOC apresentaram maior duração do tempo de trânsito faríngeo para a consistência líquida e pastosa. Em relação à duração do contato de base de língua com a parede posterior da faringe, os pacientes com DPOC apresentaram maior duração para as consistências liquida e pastosa. Não foi observada diferença estatisticamente significante para a distribuição dos indivíduos nos diferentes níveis de resíduo faríngeo. Concluindo, o presente estudo sugere que os pacientes com DPOC podem apresentar adaptações fisiológicas como uma manobra protetora da deglutição para evitar penetração/ aspiração de conteúdo faríngeo. Os resultados indicam que o resíduo em valécula não pode ser considerado um fator isolado para explicar as alterações de deglutição nessa população / Individuals with chronic obstructive pulmonary disease (COPD) present symptoms of dysphagia, indicating that swallowing is altered in this population. The purpose of the present thesis was to identify swallowing characteristics of patients with COPD. Swallowing was assessed through videofluoroscopic examination. A protocol was used aiming at the following parameters: pharyngeal transit time; duration of the tongue base contact with the posterior pharyngeal wall; valleculae residue and penetration/aspiration. The results were presented in two different studies. The first study verified the reproducibility of the adopted protocol in healthy adults. Twenty healthy individuals of both genders, with ages between 50 and 65 years were assessed. The protocol consisted in analyzing the swallow of 10ml of a liquid consistency. The statistical analysis involved the verification of the reproducibility of the results between judges and the analyses of the quantitative data (i.e. differences between genders). This first analysis indicated that reproducibility was high between the judges for all of the comparisons. No statistical differences were found between genders for any of the tested parameters (i.e. pharyngeal transit time; duration of the tongue base contact with the posterior pharyngeal wall; and percentage of valleculae residue). Penetration/aspiration was not observed for any of the tested food consistencies. The results of the first study indicated that the protocol used to analyze the videofluoroscopy of swallow is reproducible. The investigated swallowing parameters did not vary between genders. Valleculae residue was present in 40% of the studied sample, suggesting that this parameter alone does not indicate swallowing alterations in this age group. In the second study, the swallow of 20 patients with COPD was compared to 20 healthy individuals, paired by age and gender. The protocol consisted of analyzing the swallow of liquid, paste and solid food consistencies. Participants of the study did not present any signs of penetration/aspiration for any of the tested food consistencies. Patients with COPD presented longer pharyngeal transit times for the liquid and paste consistencies. Regarding the duration of the tongue base contact with the posterior pharyngeal wall, patients with COPD presented longer durations for the liquid and paste consistencies. No statistical difference was observed for the distribution of individuals among the different severity levels of valleculae residue. The results of the second study suggest that patients with COPD can present physiologic adaptations (i.e. protective maneuver) to avoid penetration/aspiration. The results also indicate that valleculae residue should not be considered the only factor responsible for swallowing alterations in this population
|
159 |
Avaliação da confiabilidade e validade da versão em português (SCI-MOODS - VP) de uma entrevista estruturada para o espectro do humor, a Structured Clinical Interview for Mood Spectrum (SCI-MOODS) / Reliability and Validity of a Brazilian Portuguese mood spectrum interview (SCI-MOODS-VP), the Structured Clinical Interview for Mood Spectrum (SCI-MOODS)Ratzke, Roberto 11 April 2007 (has links)
A SCI-MOODS (Entrevista Clínica Estruturada para o Espectro do Humor) é uma entrevista de 161 itens com sete subdomínios (humor depressivo e maníaco, energia depressiva e maníaca, cognição depressiva e maníaca e ritmicidade) que avalia o humor de forma dimensional, permitindo um diagnóstico mais preciso de transtornos afetivos. O objetivo deste estudo é desenvolver uma versão em português, a SCI-MOODS -VP, determinando sua confiabilidade, validade e uma linha de corte entre pacientes bipolares e unipolares e portadores de transtornos do humor e controles (normais e esquizofrênicos). A SCI-MOODS, versão 3.0, foi traduzida do inglês para o português brasileiro e posteriormente retro-traduzida. Foram selecionados, bipolares (n=47), unipolares (n=47), esquizofrênicos (n=18), e sujeitos sem transtorno psiquiátrico (n=22), de acordo com o SCID-I/P. Uma análise discriminante progressiva passo a passo foi feita entre os subdomínios de bipolares e unipolares além de uma curva ROC para determinar o melhor ponto de corte. A consistência interna de domínios e subdomínios da SCI-MOODS-VP variou entre 0.86 to 0.94. A confiabilidade entre examinadores de uma amostra de 10 bipolares e 10 unipolares apresentou coeficiente de correlação intra-classe de 0,956. Os sujeitos bipolares apresentaram escores total e maníaco da SCI-MOODS - VP maiores que os outros grupos. A curva receiver operating characteristic (ROC) do escore maníaco entre bipolares e unipolares apresentou área abaixo da curva (AUC) de 0,909. O escore maníaco igual ou acima de 30 demonstrou sensibilidade de 91, 49% e especificidade de 74, 47%. Outra curva entre pacientes com transtornos afetivos e controles apresentou escore depressivo com AUC de 0,869. O escore depressivo acima de 35 teve sensibilidade de 82.98% e especificidade de 75%. A energia maníaca e o humor maníaco foram os subdominios que melhor distinguiram os bipolares de unipolares através de uma análise discriminante passo a passo progressiva. A correlação parcial entre domínios e subdomínios, controlados por sexo e idade, medida pelo alfa de Cronbach, variou entre 0.472 to 0.937.Concluímos que a SCI-MOODS - VP apresenta boa validade discriminante e confiabilidade. / The SCI-MOODS (Structured Clinical Interview for Mood Spectrum) is a 161 questions questionnaire with seven sub domains (depressive and manic mood, depressive and manic energy, depressive and manic cognitive functioning and rhythmicity) that evaluates mood as a dimensional construct allowing a more precise mood disorder diagnosis. We developed the SCI-MOODS-VP, a Brazilian Portuguese version, and established its validity and reliability, and determined a threshold score for differentiation between bipolar from unipolar and affective patients from non-affective controls. The English 3.0 version of SCI-MOODS was translated to Brazilian Portuguese and back-translated. Bipolars (n=47), unipolars (n=47) and schizophrenics (n=18) patients, and 22 subjects without psychiatric diagnostic according to SCID-I/P were selected. A forward stepwise discriminant analysis among bipolars and unipolars sub domains and two ROC curves to determine the bipolar threshold were done. The internal consistency of SCI-MOODS-VP domains and sub domains was 0, 86 to 0, 94. The interater reliability among 10 bipolar and 10 unipolar subjects had an intraclass coefficient of 0,956. The bipolar subjects had total and manic scores higher than other groups. A receiver operating characteristic (ROC) curve of manic score between bipolar and unipolar subjects had an area under curve (AUC) of 0,909. The manic score equal xix or above 30 had a 91, 49% sensibility and 74, 47% specificity. Other ROC curve between patients with affective disorders and controls (without psychiatric diagnostic and schizophrenics) had a depressive score with AUC of 0,869. The depressive score equal or above 35 had an 82, 98% sensibility and a 75% specificity. The manic energy and mood were the sub domains that best distinguished bipolar and unipolar subjects by discriminant analysis. The partial correlation among domains and subdomains, controlled by age and gender, measured by Cronbach\' s alpha, was 0,472 to 0,93. SCI-MOODS-VP has good reliability and discriminant validity.
|
160 |
Elaboração e análise de confiabilidade de escala de avaliação funcional do sentar e levantar da cadeira para portadores de distrofia muscular de Duchenne (DMD) / Elaboration and functional evaluation scale reliability analysis of sitting and standing from chair for Duchenne muscular dystrophy (DMD) carriersHukuda, Michele Emy 10 August 2009 (has links)
O objetivo foi construir escala do sentar e levantar da cadeira para distrofia muscular de Duchenne (EAF-1), testar confiabilidade e correlacioná-la com tempo de execução, idade e Escala de Vignos. A construção ocorreu por meio da avaliação de 30 crianças com distrofia muscular de Duchenne (5 a 12 anos), totalizando 120 registros do sentar e levantar. Utilizou-se Índice de Correlação Intra-Classe (ICC) e Coeficiente Kappa Ponderado. A escala abrange três fases para o sentar e três para o levantar, com escores de 0 a 44 e 0 a 54 respectivamente. Encontrou-se na repetibilidade ICC >= 0,91 e Kappa > 0,91 para o sentar; ICC >= 0,91 e Kappa > 0,87 para o levantar e na reprodutibilidade ICC >= 0,90 e Kappa > 0,88 para o sentar; ICC >= 0,89 e Kappa > 0,94 para o levantar; boa correlação com tempo do sentar e levantar (r = 0.69 e r = 0.66) e entre idade e sentar (r = 0.44). EAF-1 apresentou confiabilidade de excelente a muito boa, permitindo avaliação detalhada da função. / The aim was to construction a scale of sitting and standing from chair for Duchenne muscular dystrophy (EAF-1), to test reliability and to correlate it with running time, age and Vignos scale. The construction occurred by means of 30 childrens evaluation with Duchenne muscular dystrophy (aged 5-12 years), totalizing 120 registers of sitting and standing. It was used Intraclass Correlation Coefficient (ICC) and Weighted Kappa Coefficient. The scale comprises three phases for sitting and three for standing, with scores from 0 to 44 and from 0 to 54, respectively. In the repeatability was found ICC 0,91 and Kappa > 0,91 for sitting; ICC >= 0,91 and Kappa > 0,87 for standing and in the reproducibility ICC >= 0,90 and Kappa > 0,88 for sitting; ICC >= 0,89 and Kappa > 0,94 for standing; good correlation with time of sitting and standing (r = 0.69 e r = 0.66) and between age and sitting (r = 0.44). Excellent reliability was presented by EAF-1, which permits detailed evaluation function.
|
Page generated in 0.0899 seconds