Filters








710 Hits in 3.5 sec

Towards Practical Visual Search Engine within Elasticsearch [article]

Cun Mu, Jun Zhao, Guang Yang, Jing Zhang, Zheng Yan
2019 arXiv   pre-print
By doing that, we can utilize Elasticsearch to efficiently retrieve similar images based on similarities within encoded sting tokens.  ...  In this paper, we describe our end-to-end content-based image retrieval system built upon Elasticsearch, a well-known and popular textual search engine.  ...  feature vectors into RAM, one of the most expensive resources in large-scale computations.  ... 
arXiv:1806.08896v3 fatcat:7clzb6unybbgtlgexz6nd3ck6y

SIR: Similar Image Retrieval for Product Search in E-Commerce [article]

Theban Stanley, Nihar Vanjara, Yanxin Pan, Ekaterina Pirogova, Swagata Chakraborty, Abon Chaudhuri
2020 arXiv   pre-print
It can be addressed by building supervised models to tagging product images with labels representing themes and later retrieving them by labels.  ...  We present a similar image retrieval (SIR) platform that is used to quickly discover visually similar products in a catalog of millions.  ...  On the other hand, the older, mature search systems like Elasticsearch and Solr come with built-in support for scaling text-based searches to millions of documents.  ... 
arXiv:2009.13836v1 fatcat:dp2xsjoadzhk3behxeejw7rjmi

A Scalable and Semantic Data as a Service Marketplace for Enhancing Cloud-Based Applications

Evangelos Psomakelis, Anastasios Nikolakopoulos, Achilleas Marinakis, Alexandros Psychas, Vrettos Moulos, Theodora Varvarigou, Andreas Christou
2020 Future Internet  
We show that the proposed system outperforms the classic ElasticSearch queries in data discovery use cases, providing more accurate results.  ...  Furthermore, the semantic enhancement of the process adds extra results which extend the user query with a more abstract definition to each notion.  ...  Thus, we continued our experiment with the GA10 algorithm in order to provide ElasticSearch with the most favorable conditions.  ... 
doi:10.3390/fi12050077 fatcat:akzvlxw3abho3mrdiptgmyteei

Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines

Jan Rygl, Jan Pomikálek, Radim Řehůřek, Michal Růžička, Vít Novotný, Petr Sojka
2017 Proceedings of the 2nd Workshop on Representation Learning for NLP  
This opens up exciting avenues for major efficiency gains, along with simpler deployment, scaling and monitoring.  ...  The end result is a fast and scalable vector database with a tunable tradeoff between vector search performance and quality, backed by a standard fulltext engine such as Elasticsearch.  ...  The size of the points indicates the number of retrieved results: large = Elasticsearch page size 320, medium = page size 80, small = 20.  ... 
doi:10.18653/v1/w17-2611 dblp:conf/rep4nlp/RyglPRRNS17 fatcat:i36wby2b4zcxllktzhde5yaqqu

Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines [article]

Jan Rygl, Jan Pomikálek, Radim Řehůřek, Michal Růžička, Vít Novotný, Petr Sojka
2017 arXiv   pre-print
This opens up exciting avenues for major efficiency gains, along with simpler deployment, scaling and monitoring.  ...  The end result is a fast and scalable vector database with a tunable trade-off between vector search performance and quality, backed by a standard fulltext engine such as Elasticsearch.  ...  The size of the points indicates the number of retrieved results: large = Elasticsearch page size 320, medium = page size 80, small = 20.  ... 
arXiv:1706.00957v1 fatcat:y7aw7uwyuvdi3dotvaqv2gijdy

SmarT: Machine Learning Approach for Efficient Filtering and Retrieval of Spatial and Temporal Data in Big Data

Sávio S. T. De Oliveira, Vagner J. S. Rodrigues, Wellington S. Martins
2021 Journal of Information and Data Management  
In a detailed experimental evaluation, considering the Apache Spark, Elasticsearch, and SciDB big data platforms, the response time decreased up to 22% when using SmarT.  ...  SpatialSpark [You et al. 2015] , for example, aims to provide efficient spatial operations using Apache Spark to process large scale spatial join operations.  ...  for filtering and retrieving time series data were not as significant as Elasticsearch.  ... 
doi:10.5753/jidm.2021.1951 fatcat:jrw6szksnbhgxmmo42jg5uzpn4

Using Inverted Index for Fingerprint Search

Johnny Marcos S. Soares, Luciano Barbosa, Paulo Antonio Leal Rego, Regis Pires Magalhães, Jose Antônio F. de Macêdo
2021 Journal of Information and Data Management  
In this work, we devise a solution that applies traditional inverted index, widely used in textual information retrieval, for fingerprint search.  ...  With the increase in fingerprint data, indexing techniques are essential to perform an efficient search.  ...  search similar to Approach 1 to scale for large datasets.  ... 
doi:10.5753/jidm.2021.1918 fatcat:lomgqt4dgvbftf7wg4z6xflh5q

Network Security Log Analysis System Based on ELK

Chun-jing LU, Heng ZENG, Jian-yi LIU, Ru ZHANG, Yuan-kun CHEN, Yuan-gang YAO
2017 DEStech Transactions on Computer Science and Engineering  
The results show that the proposed method enhances the system's functions of crawling and analyzing, especially the log retrieval ability, and combines with the large data storage technology, improving  ...  Aiming at the practical problems of network security log management analysis system, with the log management and analysis system as the main object of the research, combined with the problems of log system  ...  On the VM 2-Indexer is equipped with the indexer version of Redis, Logstash and an ElasticSearch node to handle the processing and retrieval of the log, and realize the message queue function.  ... 
doi:10.12783/dtcse/cece2017/14597 fatcat:4z5jqv7bjbgbbdo3e44fnawiii

Supporting Knowledge Re-Use with Effective Searches of Related Engineering Documents - A Comparison of Search Engine and Natural Language Processing-Based Algorithms

Ivar Örn Arnarsson, Otto Frost, Emil Gustavsson, Daniel Stenholm, Mats Jirstrand, Johan Malmqvist
2019 Proceedings of the International Conference on Engineering Design  
., free text) and previous research has pointed out that product developers find current it systems lacking capabilities to accurately retrieve relevant documents with unstructured data.  ...  Domain knowledge experts evaluated the results and it shows that the models applied managed to find relevant documents with up to 90% accuracy of the cases tested.  ...  Elasticsearch is built with a focus on scalability and has a distributed index which promises horizontal scalability for data up to the petabyte scale.  ... 
doi:10.1017/dsi.2019.266 fatcat:3mq2kdsiezdzlmg7jmgnmlht3u

BigDataGrapes D6.1 - Integrated Software Stack and APIs

Pythagoras Karampiperis, Sotiris Konstantinidis, Panagiotis Zervas
2019 Zenodo  
A container is an image with state, thus whenever an image is executed it becomes a container.  ...  It allows the deployment of every component and service with a single command. Compose takes as input a simple yml configuration file, that describes all the different docker images.  ... 
doi:10.5281/zenodo.2531665 fatcat:sfwxalpkgfd6lco2edbjjrdrla

BUDA.ART: A Multimodal Content-Based Analysis and Retrieval System for Buddha Statues [article]

Benjamin Renoust, Matheus Oliveira Franca, Jacob Chan, Van Le, Ayaka Uesaka, Yuta Nakashima, Hajime Nagahara, Jueren Wang, Yutaka Fujioka
2019 arXiv   pre-print
The system combines different CBIR and classical retrieval techniques to assemble 2D pictures, 3D statue scans and meta-data, that is focused on the Buddha facial characteristics.  ...  In order to investigate Buddhism at a large scale, we analyze a large archive of Buddhism related documents through the produced art.  ...  It is based on top of elasticsearch [4] for textbased search and exploration (that is full text search from our extracted metadata, image path, and other user defined properties), and an image similarity  ... 
arXiv:1909.12932v1 fatcat:5q6xnyeoy5amjpftjacgxjrhqq

VRLE: Lifelog Interaction Prototype in Virtual Reality

Aaron Duane, Björn Þór Jónsson, Cathal Gurrin
2020 Proceedings of the Third Annual Workshop on Lifelog Search Challenge  
With this paper we present a novel approach to visual lifelog exploration based on our research to date utilising virtual reality as a medium for interactive information retrieval.  ...  KEYWORDS Lifelog, interactive retrieval, virtual reality.  ...  However, Elasticsearch can also index and retrieve large-scale data in diverse formats, including geospatial data, dates, numeric vectors, and images.  ... 
doi:10.1145/3379172.3391716 dblp:conf/mir/Duane0G20 fatcat:47rccqhjjrdbzegvxs7d4yf47e

CogStack - experiences of deploying integrated information retrieval and extraction services in a large National Health Service Foundation Trust hospital

Richard Jackson, Ismail Kartoglu, Clive Stringer, Genevieve Gorrell, Angus Roberts, Xingyi Song, Honghan Wu, Asha Agrawal, Kenneth Lui, Tudor Groza, Damian Lewsley, Doug Northwood (+3 others)
2018 BMC Medical Informatics and Decision Making  
Here, we describe our work on creating and deploying a low cost structured and unstructured information retrieval and extraction architecture within King's College Hospital, the management of governance  ...  For information governance reasons, no clinical record data can be provided with this research.  ...  Biomedical entity extraction, Bio-YODIE and Bio-LarK Implementing an information retrieval system over clinical records represents a high return on investment by lowering the barrier to large scale data  ... 
doi:10.1186/s12911-018-0623-9 pmid:29941004 pmcid:PMC6020175 fatcat:vpvyss7hxfb5vn7bbjqw2dzvly

BigDataGrapes D6.1 - Integrated Software Stack and APIs

Pythagoras Karampiperis, Sotiris Konstantinidis, Panagiotis Zervas, Mihalis Papakonstadinou, Zormpas Kyriakos, Ioanna Polychronou, Giannis Stoitsis, Nikolaos Doukas, Katrachoura Athina, Timotheos Lanitis
2019 Zenodo  
A container is an image with state, thus whenever an image is executed it becomes a container.  ...  It allows the deployment of every component and service with a single command. Compose takes as input a simple yml configuration file, that describes all the different docker images.  ... 
doi:10.5281/zenodo.3557402 fatcat:pqbsz6mlvfdwxpwvck4b7m6lcq

CogStack - Experiences Of Deploying Integrated Information Retrieval And Extraction Services In A Large National Health Service Foundation Trust Hospital [article]

Richard Jackson, Ismail Emre Kartoglu, Asha Agrawal, Kenneth Lui, Honghan Wu, Tudor Groza, Angus Roberts, Genevieve Gorrell, Xingyi Song, Damian Lewsley, Doug Northwood, Amos Folarin (+3 others)
2017 biorxiv/medrxiv   pre-print
For information governance reasons, no clinical record data can be provided with this research.  ...  Biomedical entity extraction, Bio-YODIE and Bio-LarK Implementing an information retrieval system over clinical records represents a high return on investment by lowering the barrier to large scale data  ...  The nontransactional, NoSQL data model used by Elasticsearch enables the ingestion of large quantities of data at high speed, making it rapidly available for querying.  ... 
doi:10.1101/123299 fatcat:fvsrib54xrcjzivuhduri2jrtu
« Previous Showing results 1 — 15 out of 710 results