Spelling suggestions: "subject:" indexing"" "subject:" indexings""
191 |
Search engine exclusion policies: implications on indexing e-commerce websitesMbikiwa, Fernie Neo January 2005 (has links)
THESIS
Submitted in fulfilment
of the requirements for the degree
MAGISTER TECHNOLOGIAE
in
INFORMATION TECHNOLOGY
in the
FACULTY OF BUSINESS INFORMATICS
at the
CAPE PENINSULA UNIVERSITY OF TECHNOLOGY
2005 / The aim of this research was to determine how search engine exclusion
policies and spam affect the indexing of e-Commerce websites. The Internet
has brought along new ways of doing business. The unexpected growth of
the World Wide Web made it essential for firms to adopt e-commerce as a
means of obtaining a competitive edge. The introduction of e-commerce in
turn facilitated the breaking down of physical barriers that were evident in
traditional business operations.
It is important for e-commerce websites to attract visitors, otherwise the
website content is irrelevant. Websites can be accessed through the use of
search engines, and it is estimated that 88% of users start with search
engines when completing tasks on the web. This has resulted in web
designers aiming to have their websites appear in the top ten search engine
result list, as a high placement of websites in search engines is one of the
strongest contributors to a commercial website’s success.
To achieve such high rankings, web designers often adopt Search Engine
Optimization (SEO) practices. Some of these practices invariably culminate in
undeserving websites achieving top rankings. It is not clear how these SEO
practices are viewed by search engines, as some practices that are deemed
unacceptable by certain search engines are accepted by others. Furthermore,
there are no clear standards for assessing what is considered good or bad
SEO practices. This confuses web designers in determining what is spam,
resulting in the amount of search engine spam having increased over time,
impacting adversely on search engine results.
From the literature reviewed in this thesis, as well as the policies of five top
search engines (Google, Yahoo!, AskJeeves, AltaVista, and Ananzi), this
author was able to compile a list of what is generally considered as spam.
Furthermore, 47 e-commerce websites were analysed to determine if they
contain any form of spam. The five major search engines indexed some of
these websites. This enabled the author to determine to what extent search
engines adhere to their policies. This analysis returned two major findings. A
small amount of websites contained spam, and from the pre-compiled list of
spam tactics, only two were identified in the websites, namely keyword
stuffing and page redirects. Of the total number of websites analysed, it was
found that 21.3% of the websites contained spam.
From these findings, the research contained in this thesis concluded that
search engines adhere to their own policies, but lack stringent controls for the
majority of websites that contained spam, and were still listed by search
engines. In this study, the author only analysed e-commerce websites, and
cannot therefore generalise the results to other websites outside ecommerce.
|
192 |
Os aspectos sociocognitivos para a indexação de fotografiasFELIPE, Carla Beatriz Marques 15 February 2016 (has links)
Submitted by Fabio Sobreira Campos da Costa (fabio.sobreira@ufpe.br) on 2016-08-04T13:21:33Z
No. of bitstreams: 2
license_rdf: 1232 bytes, checksum: 66e71c371cc565284e70f40736c94386 (MD5)
dissertaçao.doc carla.pdf: 1821084 bytes, checksum: ab6482337563c409c6df35c23834957c (MD5) / Made available in DSpace on 2016-08-04T13:21:33Z (GMT). No. of bitstreams: 2
license_rdf: 1232 bytes, checksum: 66e71c371cc565284e70f40736c94386 (MD5)
dissertaçao.doc carla.pdf: 1821084 bytes, checksum: ab6482337563c409c6df35c23834957c (MD5)
Previous issue date: 2016-02-15 / CAPEs / Esta dissertação aborda a história da fotografia apresentando a sua importância para a sociedade desde a sua invenção e a descreve como documento e suas variadas formas de disseminação da informação. Em consequência, explica a relação entre fotografia e memória, destacando a primeira enquanto um dispositivo de memória institucional. Nesse cenário, foi abordada a indexação de fotografias, bem como o seu processo de execução e os aspectos linguístico, lógico e cognitivo a ela envolvidos. Por conta disso, foi realizada uma pesquisa exploratória em duas instituições que possuíam acervo fotográfico, cujo objetivo geral foi analisar os aspectos sociocognitivos inerentes ao procedimento de indexação de fotografias e, de cada instituição, participaram dois bibliotecários indexadores. Esses profissionais indexaram quatro fotografias, sendo duas de cada instituição participante. O instrumento de coleta de dados utilizado foi o Protocolo Verbal Individual. Os principais resultados mostraram que por meio da cognição que o indexador faz a leitura das fotografias e analisam qual o tema da foto. Para isso, faz uso das memórias de curto e longo prazo e também da percepção sensorial. Os aspectos sociocognitivos influenciam diretamente o processo de indexação, pois são estes aspectos que regem o modo como os bibliotecários fazem a pesquisa para coletar informações sobre o acervo. Se utilizam ou não um vocabulário controlado para a tradução dos termos. Para a análise de assuntos os bibliotecários analisam as fotografias de maneiras muito parecidas com as metodologias sugeridas para estes procedimentos, sempre partindo do geral para o específico. Como última categoria analisada temos os descritores. Estes sofrem influência direta das categorias anteriores. Para a escolha dos descritores, entre todas as metodologias apresentadas no trabalho, como a de Rodrigues, Shatford, Manini, Panofsky e Bléry, a metodologia de Bléry (1979) é utilizada, de fato, pelo menos em uma instituição pesquisada. / This dissertation approaches the history of photography presenting their importance to society since its invention and it is described as a document and its various forms of information dissemination. As a result, it explains the relationship between photography and memory, highlighting the first one as an institutional memory device. In this scenario, the indexing of photographs was studied, as well as its implementation process and linguistic, logical and cognitive aspects involved with it. Because of this, an exploratory survey was conducted in two institutions that had photographic collection, whose main objective was to analyze the socio-cognitive aspects of the photographs indexing procedure. In each institution two indexers librarians attended it. These professionals indexed four photos, two of each participating institution. The data collection instrument used was the Single Verbal Protocol. The main results showed that through the cognition an indexer reads the photographs and analyzes what is its subject. To be successful, he uses short and long term memory and also sensory perception. Social cognitive aspects influence directly in indexing process, as these are aspects that govern how librarians do the research to gather information about the collection. If they should use or not a controlled vocabulary for translation of terms. For topics analysis, librarians analyze the photos in very similar ways to the methodologies suggested for these procedures, always from the general to the specific. As a last category we analyzed the descriptors. These ones suffer direct influence of the previous categories. For the choice of descriptors, the Bléry methodology (1979) is used, in fact, at least in one research institution.
|
193 |
O uso de linguagem documentária no âmbito da política de indexação em biblioteca escolar /Guim, Vera Lúcia Ribeiro. January 2016 (has links)
Orientadora: Mariângela Spotti Lopes Fujita / Resumo: Considerando a representação e organização do conhecimento por meio do processo de indexação e as diretrizes e normas estabelecidas pela política de indexação, esta pesquisa apresenta uma discussão sobre o uso das linguagens documentárias tendo como alvo as bibliotecas escolares. A complexidade que envolve as bibliotecas escolares tem gerado dúvidas recorrentes que, por sua vez, afetam os interesses dos bibliotecários que buscam efetivar as linguagens documentárias de forma sistematizada e eficiente para todo público usuário. Dessa forma, essa pesquisa teve como proposição avaliar o uso da linguagem documentária no tratamento temático em biblioteca escolar. Teve como objetivo geral contribuir com estudos acerca do uso de linguagens documentárias em ambiente escolar e como objetivos específicos realizar um estudo teórico sobre uso de linguagem documentária no contexto da biblioteca escolar da rede SIBESC em Garça, SP; Realizar observação com pesquisa participante do uso de linguagem documentária em biblioteca escolar na perspectiva do indexador e por fim avaliar o uso comparado de linguagem documentária em biblioteca escolar. Para tanto realizou revisão de literatura nacional e internacional sobre a temática do uso de linguagem documentária, politica de indexação bem como estudos sobre as bibliotecas escolares. A metodologia utilizada avaliou por meio da observação com pesquisa participante e técnica introspectiva do protocolo verbal o uso da linguagem documentária na institui... (Resumo completo, clicar acesso eletrônico abaixo) / Abstract: Considering the representation and organization of knowledge through the indexing process and the guidelines and standards set by the indexing policy, this research presents a discussion on the use of documentary languages targeting school libraries. The complexity involved in school libraries has generated recurring doubts that, in turn, affect the interests of librarians who seek to carry out the documentary languages in a systematic and efficient way to all public users. Thus, this research was to propose to evaluate the use of the documentary language in thematic treatment in the school library. We aimed to contribute to studies on the use of documentary languages in the school environment and specific objectives conduct a theoretical study on the use of indexing language in the context of school library SIBESC network Heron, SP; Perform observation with participatory research the use of documentary language school library on the index perspective and finally evaluate the compared use of documentary language school library. For both held national and international literature review on the topic of using indexing language, indexing policy and studies of school libraries. The methodology assessed through observation with participatory research and introspective technique of verbal protocol using the documentary language in the institution as well as its comparison with the terminology of the National Library on the index perspective. Results and discussion highlights the us... (Complete abstract click electronic access below) / Mestre
|
194 |
An interdisciplinary comparison of master's thesis abstractsChance, Patricia Belle 01 January 2005 (has links)
Explores different ways that organizational moves and other discourse elements such as hedging are reflected in graduate student research abstracts for theses, dissertations and research articles. Master's thesis abstracts from five disciplines at California State University, San Bernardino were analyzed. Rhetorical conventions in these texts that reflect the epistemological and social expectations of the writers' academic communities were explored. Results indicate that these abstracts use a variety of hedging patterns and many of the moves that have been described for published research articles.
|
195 |
Using Information Retrieval to Improve Integration TestingAlazzam, Iyad January 2012 (has links)
Software testing is an important factor of the software development process. Integration testing is an important and expensive level of the software testing process. Unfortunately, since the developers have limited time to perform integration testing and debugging and integration testing becomes very hard as the combinations grow in size, the chain of calls from one module to another grow in number, length, and complexity. This research is about providing new methodology for integration testing to reduce the number of test cases needed to a significant degree while returning as much of its effectiveness as possible. The proposed approach shows the best order in which to integrate the classes currently available for integration and the external method calls that should be tested and in their order for maximum effectiveness. Our approach limits the number of integration test cases. The integration test cases number depends mainly on the dependency among modules and on the number of the integrated classes in the application. The dependency among modules is determined by using an information retrieval technique called Latent Semantic Indexing (LSI). In addition, this research extends the mutation testing for use in integration testing as a method to evaluate the effectiveness of the integration testing process. We have developed a set of integration mutation operators to support development of integration mutation testing. We have conducted experiments based on ten Java applications. To evaluate the proposed methodology, we have created mutants using new mutation operators that exercise the integration testing. Our experiments show that the test cases killed more than 60% of the created mutants.
|
196 |
Scalable Frequent Subgraph MiningAbdelhamid, Ehab 19 June 2017 (has links)
A graph is a data structure that contains a set of nodes and a set of edges connecting these nodes. Nodes represent objects while edges model relationships among these objects. Graphs are used in various domains due to their ability to model complex relations among several objects.
Given an input graph, the Frequent Subgraph Mining (FSM) task finds all subgraphs with frequencies exceeding a given threshold. FSM is crucial for graph analysis, and it is an essential building block in a variety of applications, such as graph clustering and indexing. FSM is computationally expensive, and its existing solutions are extremely slow. Consequently, these solutions are incapable of mining modern large graphs. This slowness is caused by the underlying approaches of these solutions which require finding and storing an excessive amount of subgraph matches.
This dissertation proposes a scalable solution for FSM that avoids the limitations of previous work. This solution is composed of four components. The first component is a single-threaded technique which, for each candidate subgraph, needs to find only a minimal number of matches. The second component is a scalable parallel FSM technique that utilizes a novel two-phase approach. The first phase quickly builds an approximate search space, which is then used by the second phase to optimize and balance the workload of the FSM task. The third component focuses on accelerating frequency evaluation, which is a critical step in FSM. To do so, a machine learning model is employed to predict the type of each graph node, and accordingly, an optimized method is selected to evaluate that node. The fourth component focuses on mining dynamic graphs, such as social networks. To this end, an incremental index is maintained during the dynamic updates. Only this index is processed and updated for the majority of graph updates. Consequently, search space is significantly pruned and efficiency is improved.
The empirical evaluation shows that the proposed components significantly outperform existing solutions, scale to a large number of processors and process graphs that previous techniques cannot handle, such as large and dynamic graphs.
|
197 |
Toward an Effective Automated Tracing ProcessMahmoud, Anas Mohammad 17 May 2014 (has links)
Traceability is defined as the ability to establish, record, and maintain dependency relations among various software artifacts in a software system, in both a forwards and backwards direction, throughout the multiple phases of the project’s life cycle. The availability of traceability information has been proven vital to several software engineering activities such as program comprehension, impact analysis, feature location, software reuse, and verification and validation (V&V). The research on automated software traceability has noticeably advanced in the past few years. Various methodologies and tools have been proposed in the literature to provide automatic support for establishing and maintaining traceability information in software systems. This movement is motivated by the increasing attention traceability has been receiving as a critical element of any rigorous software development process. However, despite these major advances, traceability implementation and use is still not pervasive in industry. In particular, traceability tools are still far from achieving performance levels that are adequate for practical applications. Such low levels of accuracy require software engineers working with traceability tools to spend a considerable amount of their time verifying the generated traceability information, a process that is often described as tedious, exhaustive, and error-prone. Motivated by these observations, and building upon a growing body of work in this area, in this dissertation we explore several research directions related to enhancing the performance of automated tracing tools and techniques. In particular, our work addresses several issues related to the various aspects of the IR-based automated tracing process, including trace link retrieval, performance enhancement, and the role of the human in the process. Our main objective is to achieve performance levels, in terms of accuracy, efficiency, and usability, that are adequate for practical applications, and ultimately to accomplish a successful technology transfer from research to industry.
|
198 |
Efficient Skyline Community Discovery in Large NetworksAkber, Mohammad Ali 30 August 2022 (has links)
Every entity in the real world can be described uniquely by it’s attributes. It is possible to rank similar entities based on these attributes, i.e. a professor can be ranked by his/her number of publications, citations etc. A community is formed by a group of connected entities. Individual ranking of an entity plays an important role in the quality of a community. Skyline community in a network represents the highest ranked communities in the network. But how do we define this ranking? Ranking system in some model considers only a single attribute [16], whereas the other [15] [23] considers multiple attributes. Intuitively multiple attributes represent a community better and produce good results. We propose a novel community discovery model, which considers multiple attribute when ranking the community and is efficient in terms of computation time and result size. We use a progressive (can produce re- sults gradually without depending on the future processing) algorithm to calculate the community in an order such that a community is guaranteed not to be dominated by those generated after it. And to verify the dominance relationship between two communities, we came up with a range based comparison where the dominance rela- tionship is decided by the set of nodes each group dominates. If domination list of a group is a subset of another group, we say the second group dominates the first. Because a groups domination list contains it’s member along with the nodes they dominate. So in the example, the second group dominates every node of the first group. / Graduate
|
199 |
GRAPH PATTERN MATCHING, APPROXIMATE MATCHING AND DYNAMIC GRAPH INDEXINGJin, Wei 30 August 2011 (has links)
No description available.
|
200 |
Improving Query Performance through Application-Driven Processing and RetrievalGibas, Michael A. 11 September 2008 (has links)
No description available.
|
Page generated in 0.0399 seconds