• Refine Query
  • Source
  • Publication year
  • to
  • Language
  • 31
  • 16
  • 13
  • 8
  • 6
  • 5
  • 5
  • 5
  • 2
  • 2
  • 2
  • 1
  • 1
  • 1
  • Tagged with
  • 211
  • 103
  • 101
  • 101
  • 41
  • 41
  • 29
  • 19
  • 17
  • 17
  • 16
  • 16
  • 15
  • 15
  • 15
  • About
  • The Global ETD Search service is a free service for researchers to find electronic theses and dissertations. This service is provided by the Networked Digital Library of Theses and Dissertations.
    Our metadata is collected from universities around the world. If you manage a university/consortium/country archive and want to be added, details can be found on the NDLTD website.
141

Towards Cyberbullying-free social media in smart cities: a unified multi-modal approach

Kumari, K., Singh, J.P., Dwivedi, Y.K., Rana, Nripendra P. 27 September 2020 (has links)
Yes / Smart cities are shifting the presence of people from physical world to cyber world (cyberspace). Along with the facilities for societies, the troubles of physical world, such as bullying, aggression and hate speech, are also taking their presence emphatically in cyberspace. This paper aims to dig the posts of social media to identify the bullying comments containing text as well as image. In this paper, we have proposed a unified representation of text and image together to eliminate the need for separate learning modules for image and text. A single-layer Convolutional Neural Network model is used with a unified representation. The major findings of this research are that the text represented as image is a better model to encode the information. We also found that single-layer Convolutional Neural Network is giving better results with two-dimensional representation. In the current scenario, we have used three layers of text and three layers of a colour image to represent the input that gives a recall of 74% of the bullying class with one layer of Convolutional Neural Network. / Ministry of Electronics and Information Technology (MeitY), Government of India
142

Avaliação do preparo e da limpeza de canais radiculares ovais longos comparando os sistemas TF Adaptive e Reciproc, por meio da microtomografia computadorizada e da microscopia eletrônica de varredura / Evaluation of preparation and cleaning og long-oval root canals comparing TF Adaptive and Reciproc systems, using micro-computed tomography and scanning eléctron microscope

Busquim, Sandra Soares Kühne 23 May 2018 (has links)
A ação do instrumento endodôntico nas paredes do canal leva à formação de debris dentinários. Sua remoção é principalmente função das manobras de irrigação. Entretanto, a ação do instrumento pode facilitar ou não o acúmulo dos debris dentinários, principalmente em função da sua cinemática. O sistema Twisted File (TF) Adaptive (SybronEndo, Orange, CA) combina o movimento rotatório contínuo e reciprocante e o sistema Reciproc (VDW, Munique, Alemanha) realiza o preparo do canal radicular pelo movimento reciprocante puro. Os objetivos deste estudo foram, utilizando a microtomografia computadorizada (micro-CT): avaliar, ex vivo, o preparo (aumento de volume e superficies não preparadas) e quantificar o acúmulo de debris dentinários em canais ovais longos comparando os sistemas TF Adaptive e Reciproc; avaliar o efeito da irrigação ultrassônica ativada intermitente (IUAT) após o preparo químico-cirúrgico com os sistemas propostos, na redução de debris dentinários; e relacionar a presença de debris dentinários com o magma dentinário avaliado por meio do microscópio eletrônico de varredura. Canais distais de trinta e oito molares inferiores foram selecionados e divididos em dois grupos: G1 - TFA (n=19) e G2 - RC (n=19). Cada espécime foi submetido a três escaneamentos: prée pós-operatório e pós-irrigação ultrassônica passiva final. Após a reconstrução das imagens resultantes dos três escaneamentos, foi feito o corregistro das mesmas com o programa DataViewer. Os programas CTAn e CTvol foram utilizados para binarização dos objetos de interesse, avaliações morfométricas e reconstrução dos modelos tridimensionais. Foram mensurados o aumento de volume do canal, as superfícies não preparadas e os debris dentinários após o preparo químico-cirúrgico. Após a irrigação final, cinco raízes de cada grupo foram clivadas longitudinalmente ao meio no terço apical e analisadas no microscópio eletrônico de varredura (MEV) quanto à presença de magma dentinário. O sistema de pontuação utilizado foi o de Hülsmann et al. (1997). Os resultados foram analisados pelo programa Bioestat e mostraram uma distribuição não-paramétrica pelo teste D\'Agostino. Por esta razão, para a análise dois a dois entre os grupos foi utilizado o teste de Mann-Whitney. Na análise intra-grupo, o teste de Kruskal-Wallis foi o escolhido, com complementação do teste de Dunn quando necessário. Os dados obtidos mostraram que o sistema TF Adaptive removeu mais dentina quando todo canal foi considerado (p<0,05). Entretanto, não houve diferença estatisticamente significante quanto à porcentagem de superfície não preparada em ambos os sistemas (p>0.05). Em relação ao acúmulo de debris dentinários e redução de debris dentinários pós-irrigação ultrassônica passiva, também não houve diferença entre os grupos (p>0.05). A IUAI promoveu redução significativa de debris dentinários, à exceção do terço apical. Qualitativamente, no MEV, observou-se magma dentinário não-homogêneo cobrindo a parede do canal, especialmente no grupo Reciproc. Conclui-se que nenhum sistema conseguiu preparar completamente as paredes de canal radicular oval longo e que a IUAI reduz os debris dentinários na ordem de 60-70%. Na análise do MEV, pode-se observar que não houve correlação entre a redução de debris dentinários e o magma dentinário. / The action of the endodontic instrument leads to hard-tissue debris. The main goal of the irrigation procedures is its removal. Nevertheless, depending on its kinematics the rotary file can contribute to the removal of hard-tissue debris. The TF Adaptive system (SybronEndo, Orange, CA) combines the rotary and reciprocating movement and the Reciproc system (VDW< Munique, Alemanha) with pure reciprocating movement, removes significant quantities of dentin of the canal wall with more difficulties of pulling out hard-tissue debris. The aims of this study were, using microcomputed tomography: evaluate preparation and quantify hard-tissue debris reduction in long-oval canals comparing TF Adaptive and Reciproc systems; evaluate the effect of passive ultrassonic irrigation (PUI) after preparation with the proposed groups in the accumulation of hard-tissue debris; and correlate the presence of hardtissue debris with smear layer evaluated by scanning eléctron microscope (SEM). Distal canals of thirty-eight lower molars were selected and divided in two groups: G1 - TFA (n=19) and G2 - RC (n=19). Each specimen was scanned three times: preand post-instrumentation of the root canal; and post-passive ultrassonic irrigation. After reconstruction of the scanned images, a co-registration was done with DataViewer. The softwares CTan and CTvol were used for binarization of the objects of interest, morphometrics alterations and reconstruction of tridimensional models and hard-tissue debris models. Volume increasing, non-prepared surfaces and hardtissue debris after instrumentation were measured. After final irrigation, the root canals were clived in halves at the apical third and analyzed by scanning eléctron microscope related to smear layer. The score system described by Hülsmann et al. (1997) was used. The results showed a non-parametric distribution by D\'Agostino test. For that reason, Mann-Whitney test was used to compare the experimental groups and the Kruskall-Wallis test to compare intra-group performance. Dunn test determined which sample was different. Data shows that TF Adaptive removed more dentin in the whole root canal (p<0,05). There was no significant statistic difference comparing non-prepared surfaces between the systems (p>0,05). Related to hardtissue debris and reduction of hard-tissue debris after passive ultrassonic irrigation, there was no significant difference between TF Adaptive and Reciproc. The PUI reduced significantly hard-tissue debris, except for the apical third. The scanning electron microscope (SEM) showed a non-homogeneous smear layer covering the canal dentin walls, specially the Reciproc group. It was concluded that no system completely prepared the dentin walls of long-oval root canals and the passive ultrassonic irrigation decreased hard-tissue debris about 60-70% in the evaluated systems. SEM analysis reported no correlation between hard-tissue reduction and smear layer.
143

Estudo dos parâmetros operacionais de uma célula a combustível de glicerol direto utilizando uma membrana de polibencimidazol impregnada com ácido fosfórico (PBI/H3PO4) ou 1-hexil-3-metilimidazol trifluorometanosulfo / Study of the operating parameters of a direct glycerol fuel cell using a polibenzimidazole membrane impregnated with phosphoric acid (PBI/H3PO4) or 1-hexyl-3-methylimidaolium trifluoromethanesulfonate (PBI/HMI-Tf)

Barrientos, Wilner Valenzuela 16 July 2015 (has links)
Com o aumento da população mundial, o desenvolvimento de novas fontes e conversores de energia tornou-se uma necessidade. As células a combustível mostram-se como uma alternativa viável devido principalmente a duas razões, sua alta eficiência e a utilização de combustíveis renováveis. No presente trabalho se estuda a influência da temperatura de operação e o conteúdo de álcali no combustível sobre a densidade de potencia para uma célula a combustível de glicerol direto. Como combustível foi utilizado uma solução de glicerol:KOH (1M:xM, x=0, 1, 3, 5), como membranas foram utilizados filmes de polibencimidazol impregnado com ácido fosfórico (PBI/H3PO4, relação molar 1:11) ou 1-hexil-3-metilimidazol trifluorometanosulfonato (PBI/HMI-Tf relação molar 1:1.5), e finalmente, nano partículas de Pt suportadas em carbono (60% w/w) como catalizador no ânodo e no cátodo. Em geral, o incremento da temperatura e conteúdo de álcali no combustível mostra um efeito favorável na densidade de potencia do sistema. Numa célula a combustível unitária de glicerol direto utilizando membranas de PBI/ H3PO4 e PBI/HMI-Tf foram obtidas densidades de potencia de 0.54mW.cm-2 a 175°C e 0.599mW.cm-2 a 130°C, respectivamente, para uma solução de glicerol de (1M); enquanto que, para uma solução com um conteúdo maior de álcali, glicerol:KOH (1M:5M), foram obtidas densidades de potencia maiores, 44.1mW.cm-2 a 175°C e 29mW.cm-2 a 130°C, respectivamente. O efeito combinado do incremento da temperatura e concentração de álcali no combustível mostra um efeito maior em relação ao efeito só da temperatura. / With the increasing world population, the development of new energy sources or energy converters has become a necessity. Fuel cells show up as a viable alternative due mainly to two reasons, their high efficiency and the use of renewable fuels. In the present work we study the influence of operating temperature and alkali content in the fuel on the power density for a direct glycerol fuel cell. A glycerol:KOH (1M: xM, x = 0, 1, 3, 5) solution was used as fuels, as membranes were used polibencimidazol films impregnated with phosphoric acid (PBI/H3PO4, molar ratio of 1:11) or 1-hexyl-3-methylimidazolium trifluoromethanesulfonate (PBI/HMI-Tf), and finally, Pt nanoparticles supported on carbon (60% w / w) as catalyst in the anode and cathode. In general, increasing the temperature and alkali content in the fuel shows a favorable effect in the system power density. In a direct glycerol fuel cell using PBI/H3PO4 and PBI /HMI-Tf membranes were obtained power density of 0.54mW.cm-2 at 175°C and 0.599mW.cm-2 at 130°C, respectively, for a 1M glycerol solution; while for a glycerol solution with a higher content of alkali, glycerol:KOH (1M: 5M), were obtained higher power densities, 44.1mW.cm-2 at 175 ° C and 29mW.cm-2 at 130 ° C, respectively. The combined effect of increased temperature and alkali concentration in the fuel shows a greater effect compared to the effect of temperature only.
144

Automatisk extraktion av nyckelord ur ett kundforum / Automatic keyword extraction from a customer forum

Ekman, Sara January 2018 (has links)
Konversationerna i ett kundforum rör sig över olika ämnen och språket är inkonsekvent. Texterna uppfyller inte de krav som brukar ställas på material inför automatisk nyckelordsextraktion. Uppsatsens undersöker hur nyckelord automatiskt kan extraheras ur ett kundforum trots dessa svårigheter. Fokus i undersökningen ligger på tre aspekter av nyckelordsextraktion. Den första faktorn rör hur den etablerade nyckelordsextraktionsmetoden TF*IDF presterar jämfört med fyra metoder som skapas med hänsyn till materialets ovanliga struktur. Nästa faktor som testas är om olika sätt att räkna ordfrekvens påverkar resultatet. Den tredje faktorn är hur metoderna presterar om de endast använder inläggen, rubrikerna eller båda texttyperna i sina extraktioner. Icke-parametriska test användes för utvärdering av extraktionerna. Ett antal Friedmans test visar att metoderna i några fall skiljer sig åt gällande förmåga att identifiera relevanta nyckelord. I post-hoc-test mellan de högst presterande metoderna ses en av de nya metoderna i ett fall prestera signifikant bättre än de andra nya metoderna men inte bättre än TF*IDF. Ingen skillnad hittades mellan användning av olika texttyper eller sätt att räkna ordfrekvens. För framtida forskning rekommenderas reliabilitetstest av manuellt annoterade nyckelord. Ett större stickprov bör användas än det i aktuell studie och olika förslag ges för att förbättra rättning av extraherade nyckelord. / Conversations in a customer forum span across different topics and the language is inconsistent. The text type do not meet the demands for automatic keyword extraction. This essay examines how keywords can be automatically extracted despite these difficulties. Focus in the study are three areas of keyword extraction. The first factor regards how the established keyword extraction method TF*IDF performs compared to four methods created with the unusual material in mind. The next factor deals with different ways to calculate word frequency. The third factor regards if the methods use only posts, only titles, or both in their extractions. Non-parametric tests were conducted to evaluate the extractions. A number of Friedman's tests shows the methods in some cases differ in their ability to identify relevant keywords. In post-hoc tests performed between the highest performing methods, one of the new methods perform significantly better than the other new methods but not better than TF*IDF. No difference was found between the use of different text types or ways to calculate word frequency. For future research reliability test of manually annotated keywords is recommended. A larger sample size should be used than in the current study and further suggestions are given to improve the results of keyword extractions.
145

Estudo dos parâmetros operacionais de uma célula a combustível de glicerol direto utilizando uma membrana de polibencimidazol impregnada com ácido fosfórico (PBI/H3PO4) ou 1-hexil-3-metilimidazol trifluorometanosulfo / Study of the operating parameters of a direct glycerol fuel cell using a polibenzimidazole membrane impregnated with phosphoric acid (PBI/H3PO4) or 1-hexyl-3-methylimidaolium trifluoromethanesulfonate (PBI/HMI-Tf)

Wilner Valenzuela Barrientos 16 July 2015 (has links)
Com o aumento da população mundial, o desenvolvimento de novas fontes e conversores de energia tornou-se uma necessidade. As células a combustível mostram-se como uma alternativa viável devido principalmente a duas razões, sua alta eficiência e a utilização de combustíveis renováveis. No presente trabalho se estuda a influência da temperatura de operação e o conteúdo de álcali no combustível sobre a densidade de potencia para uma célula a combustível de glicerol direto. Como combustível foi utilizado uma solução de glicerol:KOH (1M:xM, x=0, 1, 3, 5), como membranas foram utilizados filmes de polibencimidazol impregnado com ácido fosfórico (PBI/H3PO4, relação molar 1:11) ou 1-hexil-3-metilimidazol trifluorometanosulfonato (PBI/HMI-Tf relação molar 1:1.5), e finalmente, nano partículas de Pt suportadas em carbono (60% w/w) como catalizador no ânodo e no cátodo. Em geral, o incremento da temperatura e conteúdo de álcali no combustível mostra um efeito favorável na densidade de potencia do sistema. Numa célula a combustível unitária de glicerol direto utilizando membranas de PBI/ H3PO4 e PBI/HMI-Tf foram obtidas densidades de potencia de 0.54mW.cm-2 a 175°C e 0.599mW.cm-2 a 130°C, respectivamente, para uma solução de glicerol de (1M); enquanto que, para uma solução com um conteúdo maior de álcali, glicerol:KOH (1M:5M), foram obtidas densidades de potencia maiores, 44.1mW.cm-2 a 175°C e 29mW.cm-2 a 130°C, respectivamente. O efeito combinado do incremento da temperatura e concentração de álcali no combustível mostra um efeito maior em relação ao efeito só da temperatura. / With the increasing world population, the development of new energy sources or energy converters has become a necessity. Fuel cells show up as a viable alternative due mainly to two reasons, their high efficiency and the use of renewable fuels. In the present work we study the influence of operating temperature and alkali content in the fuel on the power density for a direct glycerol fuel cell. A glycerol:KOH (1M: xM, x = 0, 1, 3, 5) solution was used as fuels, as membranes were used polibencimidazol films impregnated with phosphoric acid (PBI/H3PO4, molar ratio of 1:11) or 1-hexyl-3-methylimidazolium trifluoromethanesulfonate (PBI/HMI-Tf), and finally, Pt nanoparticles supported on carbon (60% w / w) as catalyst in the anode and cathode. In general, increasing the temperature and alkali content in the fuel shows a favorable effect in the system power density. In a direct glycerol fuel cell using PBI/H3PO4 and PBI /HMI-Tf membranes were obtained power density of 0.54mW.cm-2 at 175°C and 0.599mW.cm-2 at 130°C, respectively, for a 1M glycerol solution; while for a glycerol solution with a higher content of alkali, glycerol:KOH (1M: 5M), were obtained higher power densities, 44.1mW.cm-2 at 175 ° C and 29mW.cm-2 at 130 ° C, respectively. The combined effect of increased temperature and alkali concentration in the fuel shows a greater effect compared to the effect of temperature only.
146

Avaliação do preparo e da limpeza de canais radiculares ovais longos comparando os sistemas TF Adaptive e Reciproc, por meio da microtomografia computadorizada e da microscopia eletrônica de varredura / Evaluation of preparation and cleaning og long-oval root canals comparing TF Adaptive and Reciproc systems, using micro-computed tomography and scanning eléctron microscope

Sandra Soares Kühne Busquim 23 May 2018 (has links)
A ação do instrumento endodôntico nas paredes do canal leva à formação de debris dentinários. Sua remoção é principalmente função das manobras de irrigação. Entretanto, a ação do instrumento pode facilitar ou não o acúmulo dos debris dentinários, principalmente em função da sua cinemática. O sistema Twisted File (TF) Adaptive (SybronEndo, Orange, CA) combina o movimento rotatório contínuo e reciprocante e o sistema Reciproc (VDW, Munique, Alemanha) realiza o preparo do canal radicular pelo movimento reciprocante puro. Os objetivos deste estudo foram, utilizando a microtomografia computadorizada (micro-CT): avaliar, ex vivo, o preparo (aumento de volume e superficies não preparadas) e quantificar o acúmulo de debris dentinários em canais ovais longos comparando os sistemas TF Adaptive e Reciproc; avaliar o efeito da irrigação ultrassônica ativada intermitente (IUAT) após o preparo químico-cirúrgico com os sistemas propostos, na redução de debris dentinários; e relacionar a presença de debris dentinários com o magma dentinário avaliado por meio do microscópio eletrônico de varredura. Canais distais de trinta e oito molares inferiores foram selecionados e divididos em dois grupos: G1 - TFA (n=19) e G2 - RC (n=19). Cada espécime foi submetido a três escaneamentos: prée pós-operatório e pós-irrigação ultrassônica passiva final. Após a reconstrução das imagens resultantes dos três escaneamentos, foi feito o corregistro das mesmas com o programa DataViewer. Os programas CTAn e CTvol foram utilizados para binarização dos objetos de interesse, avaliações morfométricas e reconstrução dos modelos tridimensionais. Foram mensurados o aumento de volume do canal, as superfícies não preparadas e os debris dentinários após o preparo químico-cirúrgico. Após a irrigação final, cinco raízes de cada grupo foram clivadas longitudinalmente ao meio no terço apical e analisadas no microscópio eletrônico de varredura (MEV) quanto à presença de magma dentinário. O sistema de pontuação utilizado foi o de Hülsmann et al. (1997). Os resultados foram analisados pelo programa Bioestat e mostraram uma distribuição não-paramétrica pelo teste D\'Agostino. Por esta razão, para a análise dois a dois entre os grupos foi utilizado o teste de Mann-Whitney. Na análise intra-grupo, o teste de Kruskal-Wallis foi o escolhido, com complementação do teste de Dunn quando necessário. Os dados obtidos mostraram que o sistema TF Adaptive removeu mais dentina quando todo canal foi considerado (p<0,05). Entretanto, não houve diferença estatisticamente significante quanto à porcentagem de superfície não preparada em ambos os sistemas (p>0.05). Em relação ao acúmulo de debris dentinários e redução de debris dentinários pós-irrigação ultrassônica passiva, também não houve diferença entre os grupos (p>0.05). A IUAI promoveu redução significativa de debris dentinários, à exceção do terço apical. Qualitativamente, no MEV, observou-se magma dentinário não-homogêneo cobrindo a parede do canal, especialmente no grupo Reciproc. Conclui-se que nenhum sistema conseguiu preparar completamente as paredes de canal radicular oval longo e que a IUAI reduz os debris dentinários na ordem de 60-70%. Na análise do MEV, pode-se observar que não houve correlação entre a redução de debris dentinários e o magma dentinário. / The action of the endodontic instrument leads to hard-tissue debris. The main goal of the irrigation procedures is its removal. Nevertheless, depending on its kinematics the rotary file can contribute to the removal of hard-tissue debris. The TF Adaptive system (SybronEndo, Orange, CA) combines the rotary and reciprocating movement and the Reciproc system (VDW< Munique, Alemanha) with pure reciprocating movement, removes significant quantities of dentin of the canal wall with more difficulties of pulling out hard-tissue debris. The aims of this study were, using microcomputed tomography: evaluate preparation and quantify hard-tissue debris reduction in long-oval canals comparing TF Adaptive and Reciproc systems; evaluate the effect of passive ultrassonic irrigation (PUI) after preparation with the proposed groups in the accumulation of hard-tissue debris; and correlate the presence of hardtissue debris with smear layer evaluated by scanning eléctron microscope (SEM). Distal canals of thirty-eight lower molars were selected and divided in two groups: G1 - TFA (n=19) and G2 - RC (n=19). Each specimen was scanned three times: preand post-instrumentation of the root canal; and post-passive ultrassonic irrigation. After reconstruction of the scanned images, a co-registration was done with DataViewer. The softwares CTan and CTvol were used for binarization of the objects of interest, morphometrics alterations and reconstruction of tridimensional models and hard-tissue debris models. Volume increasing, non-prepared surfaces and hardtissue debris after instrumentation were measured. After final irrigation, the root canals were clived in halves at the apical third and analyzed by scanning eléctron microscope related to smear layer. The score system described by Hülsmann et al. (1997) was used. The results showed a non-parametric distribution by D\'Agostino test. For that reason, Mann-Whitney test was used to compare the experimental groups and the Kruskall-Wallis test to compare intra-group performance. Dunn test determined which sample was different. Data shows that TF Adaptive removed more dentin in the whole root canal (p<0,05). There was no significant statistic difference comparing non-prepared surfaces between the systems (p>0,05). Related to hardtissue debris and reduction of hard-tissue debris after passive ultrassonic irrigation, there was no significant difference between TF Adaptive and Reciproc. The PUI reduced significantly hard-tissue debris, except for the apical third. The scanning electron microscope (SEM) showed a non-homogeneous smear layer covering the canal dentin walls, specially the Reciproc group. It was concluded that no system completely prepared the dentin walls of long-oval root canals and the passive ultrassonic irrigation decreased hard-tissue debris about 60-70% in the evaluated systems. SEM analysis reported no correlation between hard-tissue reduction and smear layer.
147

Categorization of Swedish e-mails using Supervised Machine Learning / Kategorisering av svenska e-postmeddelanden med användning av övervakad maskininlärning

Mann, Anna, Höft, Olivia January 2021 (has links)
Society today is becoming more digitalized, and a common way of communication is to send e-mails. Currently, the company Auranest has a filtering method for categorizing e-mails, but the method is a few years old. The filter provides a classification of valuable e-mails for jobseekers, where employers can make contact. The company wants to know if the categorization can be performed with a different method and improved. The degree project aims to investigate whether the categorization can be proceeded with higher accuracy using machine learning. Three supervised machine learning algorithms, Naïve Bayes, Support Vector Machine (SVM), and Decision Tree, have been examined, and the algorithm with the highest results has been compared with Auranest's existing filter. Accuracy, Precision, Recall, and F1 score have been used to determine which machine learning algorithm received the highest results and in comparison, with Auranest's filter. The results showed that the supervised machine learning algorithm SVM achieved the best results in all metrics. The comparison between Auranest's existing filter and SVM showed that SVM performed better in all calculated metrics, where the accuracy showed 99.5% for SVM and 93.03% for Auranest’s filter. The comparative results showed that accuracy was the only factor that received similar results. For the other metrics, there was a noticeable difference. / Dagens samhälle blir alltmer digitaliserat och ett vanligt kommunikationssätt är att skicka e-postmeddelanden. I dagsläget har företaget Auranest ett filter för att kategorisera e-postmeddelanden men filtret är några år gammalt. Användningsområdet för filtret är att sortera ut värdefulla e-postmeddelanden för arbetssökande, där kontakt kan ske från arbetsgivare. Företaget vill veta ifall kategoriseringen kan göras med en annan metod samt förbättras. Målet med examensarbetet är att undersöka ifall filtreringen kan göras med högre träffsäkerhet med hjälp av maskininlärning. Tre övervakade maskininlärningsalgoritmer, Naïve Bayes, Support Vector Machine (SVM) och Decision Tree, har granskats och algoritmen med de högsta resultaten har jämförts med Auranests befintliga filter. Träffsäkerhet, precision, känslighet och F1-poäng har använts för att avgöra vilken maskininlärningsalgoritm som gav högst resultat sinsemellan samt i jämförelse med Auranests filter. Resultatet påvisade att den övervakade maskininlärningsmetoden SVM åstadkom de främsta resultaten i samtliga mätvärden. Jämförelsen mellan Auranests befintliga filter och SVM visade att SVM presterade bättre i alla kalkylerade mätvärden, där träffsäkerheten visade 99,5% för SVM och 93,03% för Auranests filter. De jämförande resultaten visade att träffsäkerheten var den enda faktorn som gav liknande resultat. För de övriga mätvärdena var det en märkbar skillnad.
148

@TheRealDonaldTrump’s tweets correlation with stock market volatility / @TheRealDonaldTrump's tweets korrelation med volatiliteten på aktiemarkanden

Olofsson, Isak January 2020 (has links)
The purpose of this study is to analyze if there is any tweet specific data posted by Donald Trump that has a correlation with the volatility of the stock market. If any details about the president Trump's tweets show correlation with the volatility, the goal is to find a subset of regressors with as high as possible predictability. The content of tweets is used as the base for regressors. The method which has been used is a multiple linear regression with tweet and volatility data ranging from 2010 until 2020. As a measure of volatility, the Cboe VIX has been used, and the regressors in the model have focused on the content of tweets posted by Trump using TF-IDF to evaluate the content of tweets. The results from the study imply that the chosen regressors display a small significant correlation of with an adjusted R2 = 0.4501 between Trump´s tweets and the market volatility. The findings Include 78 words with correlation to stock market volatility when part of President Trump's tweets. The stock market is a large and complex system of many unknowns, which aggravate the process of simplifying and quantifying data of only one source into a regression model with high predictability. / Syftet med denna studie är att analysera om det finns några specifika egenskaper i de tweets publicerade av Donald Trump som har en korrelation med volatiliteten på aktiemarknaden. Om egenskaper kring president Trumps tweets visar ett samband med volatiliteten är målet att hitta en delmängd av regressorer med för att beskriva sambandet med så hög signifikans som möjligt. Innehållet i tweets har varit i fokus använts som regressorer. Metoden som har använts är en multipel linjär regression med tweet och volatilitetsdata som sträcker sig från 2010 till 2020. Som ett mått på volatilitet har Cboe VIX använts, och regressorerna i modellen har fokuserat på innehållet i tweets där TF-IDF har använts för att transformera ord till numeriska värden. Resultaten från studien visar att de valda regressorerna uppvisar en liten men signifikant korrelation med en justerad R2 = 0,4501 mellan Trumps tweets och marknadens volatilitet. Resultaten inkluderar 78 ord som de när en är en del av president Trumps tweets visar en signifikant korrelation till volatiliteten på börsen. Börsen är ett stort och komplext system av många okända, som försvårar processen att förenkla och kvantifiera data från endast en källa till en regressionsmodell med hög förutsägbarhet.
149

A comparison of different methods in their ability to compare semantic similarity between articles and press releases / En jämförelse av olika metoder i deras förmåga att jämföra semantisk likhet mellan artiklar och pressmeddelanden

Andersson, Julius January 2022 (has links)
The goal of a press release is to have the information spread as widely as possible. A suitable approach to distribute the information is to target journalists who are likely to distribute the information further. Deciding which journalists to target has traditionally been performed manually without intelligent digital assistance and therefore has been a time consuming task. Machine learning can be used to assist the user by predicting a ranking of journalists based on their most semantically similar written article to the press release. The purpose of this thesis was to compare different methods in their ability to compare semantic similarity between articles and press releases when used for the task of ranking journalists. Three methods were chosen for comparison: (1.) TF-IDF together with cosine similarity, (2.) TF-IDF together with soft-cosine similarity and (3.) sentence mover’s distance (SMD) together with SBERT. Based on the proposed heuristic success metric, both TF-IDF methods outperformed the SMD method. The best performing method was TF-IDF with soft-cosine similarity. / Målet med ett pressmeddelande är att få informationen att spriddas till så många som möjligt. Ett lämpligt tillvägagångssätt för att sprida informationen är att rikta in sig på journalister som sannolikt kommer att sprida informationen vidare. Beslutet om vilka journalister man ska rikta sig till har traditionellt utförts manuellt utan intelligent digital assistans och har därför varit en tidskrävande uppgift. Maskininlärning kan användas för att hjälpa användaren genom att förutsäga en rankning av journalister baserat på deras mest semantiskt liknande skrivna artikel till pressmeddelandet. Syftet med denna uppsats var att jämföra olika metoder i deras förmåga att jämföra semantisk likhet mellan artiklar och pressmeddelanden när de används för att rangordna journalister. Tre metoder valdes för jämförelse: (1.) TF-IDF tillsammans med cosinus likhet, (2.) TF-IDF tillsammans med mjuk-cosinus likhet och (3.) sentence mover’s distance (SMD) tillsammans med SBERT. Baserat på det föreslagna heuristiska framgångsmåttet överträffade båda TF-IDF-metoderna SMD-metoden. Den bäst presterande metoden var TF-IDF med mjuk-cosinus likhet.
150

Maskininlärning för dokumentklassificering av finansielladokument med fokus på fakturor / Machine Learning for Document Classification of FinancialDocuments with Focus on Invoices

Khalid Saeed, Nawar January 2022 (has links)
Automatiserad dokumentklassificering är en process eller metod som syftar till att bearbeta ochhantera dokument i digitala former. Många företag strävar efter en textklassificeringsmetodiksom kan lösa olika problem. Ett av dessa problem är att klassificera och organisera ett stort antaldokument baserat på en uppsättning av fördefinierade kategorier.Detta examensarbete syftar till att hjälpa Medius, vilket är ett företag som arbetar med fakturaarbetsflöde, att klassificera dokumenten som behandlas i deras fakturaarbetsflöde till fakturoroch icke-fakturor. Detta har åstadkommits genom att implementera och utvärdera olika klassificeringsmetoder för maskininlärning med avseende på deras noggrannhet och effektivitet för attklassificera finansiella dokument, där endast fakturor är av intresse.I denna avhandling har två dokumentrepresentationsmetoder "Term Frequency Inverse DocumentFrequency (TF-IDF) och Doc2Vec" använts för att representera dokumenten som vektorer. Representationen syftar till att minska komplexiteten i dokumenten och göra de lättare att hantera.Dessutom har tre klassificeringsmetoder använts för att automatisera dokumentklassificeringsprocessen för fakturor. Dessa metoder var Logistic Regression, Multinomial Naïve Bayes och SupportVector Machine.Resultaten från denna avhandling visade att alla klassificeringsmetoder som använde TF-IDF, föratt representera dokumenten som vektorer, gav goda resultat i from av prestanda och noggranhet.Noggrannheten för alla tre klassificeringsmetoderna var över 90%, vilket var kravet för att dennastudie skulle anses vara lyckad. Dessutom verkade Logistic Regression att ha det lättare att klassificera dokumenten jämfört med andra metoder. Ett test på riktiga data "dokument" som flödarin i Medius fakturaarbetsflöde visade att Logistic Regression lyckades att korrekt klassificeranästan 96% av dokumenten.Avslutningsvis, fastställdes Logistic Regression tillsammans med TF-IDF som de övergripandeoch mest lämpliga metoderna att klara av problmet om dokumentklassficering. Dessvärre, kundeDoc2Vec inte ge ett bra resultat p.g.a. datamängden inte var anpassad och tillräcklig för attmetoden skulle fungera bra. / Automated document classification is an essential technique that aims to process and managedocuments in digital forms. Many companies strive for a text classification methodology thatcan solve a plethora of problems. One of these problems is classifying and organizing a massiveamount of documents based on a set of predefined categories.This thesis aims to help Medius, a company that works with invoice workflow, to classify theirdocuments into invoices and non-invoices. This has been accomplished by implementing andevaluating various machine learning classification methods in terms of their accuracy and efficiencyfor the task of financial document classification, where only invoices are of interest. Furthermore,the necessary pre-processing steps for achieving good performance are considered when evaluatingthe mentioned classification methods.In this study, two document representation methods "Term Frequency Inverse Document Frequency (TF-IDF) and Doc2Vec" were used to represent the documents as fixed-length vectors.The representation aims to reduce the complexity of the documents and make them easier tohandle. In addition, three classification methods have been used to automate the document classification process for invoices. These methods were Logistic Regression, Multinomial Naïve Bayesand Support Vector Machine.The results from this thesis indicate that all classification methods used TF-IDF, to represent thedocuments as vectors, give high performance and accuracy. The accuracy of all three classificationmethods is over 90%, which is the prerequisite for the success of this study. Moreover, LogisticRegression appears to cope with this task very easily, since it classifies the documents moreefficiently compared to the other methods. A test of real data flowing into Medius’ invoiceworkflow shows that Logistic Regression is able to correctly classify up to 96% of the data.In conclusion, the Logistic Regression together with TF-IDF is determined to be the overall mostappropriate method out of the other tested methods. In addition, Doc2Vec suffers to providea good result because the data set is not customized and sufficient for the method to workwell.

Page generated in 0.0444 seconds