43 Hits in 1.1 sec

Interactive wrapper generation with minimal user effort

Utku Irmak, Torsten Suel
2006 Proceedings of the 15th international conference on World Wide Web - WWW '06  
While much of the data on the web is unstructured in nature, there is also a significant amount of embedded structured data, such as product information on e-commerce sites or stock data on financial sites. A large amount of research has focused on the problem of generating wrappers, i.e., software tools that allow easy and robust extraction of structured data from text and HTML sources. In many applications, such as comparison shopping, data has to be extracted from many different sources,
more » ... ng manual coding of a wrapper for each source impractical. On the other hand, fully automatic approaches are often not reliable enough, resulting in low quality of the extracted data. We describe a complete system for semi-automatic wrapper generation that can be trained on different data sources in a simple interactive manner. Our goal is to minimize the amount of user effort for training reliable wrappers through design of a suitable training interface that is implemented based on a powerful underlying extraction language and a set of training and ranking algorithms. Our experiments show that our system achieves reliable extraction with a very small amount of user effort.
doi:10.1145/1135777.1135859 dblp:conf/www/IrmakS06 fatcat:tu6ng2mtvbdctfq7rypmbgfey4

Contextual Ranking of Keywords Using Click Data

Utku Irmak, Vadim von Brzeski, Reiner Kraft
2009 Proceedings / International Conference on Data Engineering  
The problem of automatically extracting the most interesting and relevant keyword phrases in a document has been studied extensively as it is crucial for a number of applications. These applications include contextual advertising, automatic text summarization, and user-centric entity detection systems. All these applications can potentially benefit from a successful solution as it enables computational efficiency (by decreasing the input size), noise reduction, or overall improved user
more » ... ion. In this paper, we study this problem and focus on improving the overall quality of user-centric entity detection systems. First, we review our concept extraction technique, which relies on search engine query logs. We then define a new feature space to represent the interestingness of concepts, and describe a new approach to estimate their relevancy for a given context. We utilize click through data obtained from a large scale user-centric entity detection system -Contextual Shortcuts -to train a model to rank the extracted concepts, and evaluate the resulting model extensively again based on their click through data. Our results show that the learned model outperforms the baseline model, which employs similar features but whose weights are tuned carefully based on empirical observations, and reduces the error rate from 30.22% to 18.66%.
doi:10.1109/icde.2009.76 dblp:conf/icde/IrmakBK09 fatcat:6gtfi6er5rhnxctcvpduiwoxx4

Efficient query subscription processing for prospective search engines

Utku Irmak, Svilen Mihaylov, Torsten Suel, Samrat Ganguly, Rauf Izmailov
2006 Proceedings of the 15th international conference on World Wide Web - WWW '06  
Current web search engines are retrospective in that they limit users to searches against already existing pages. Prospective search engines, on the other hand, allow users to upload queries that will be applied to newly discovered pages in the future. Some examples of prospective search are the subscription features in Google News and in RSS-based blog search engines. In this paper, we study the problem of efficiently processing large numbers of keyword query subscriptions against a stream of
more » ... ewly discovered documents, and propose several query processing optimizations for prospective search. Our experimental evaluation shows that these techniques can improve the throughput of a well known algorithm by more than a factor of 20, and allow matching hundreds or thousands of incoming documents per second against millions of subscription queries per node.
doi:10.1145/1135777.1136003 dblp:conf/www/IrmakMSGI06 fatcat:voveoqjjazd3jkzrpqqy63kwni

A scalable machine-learning approach for semi-structured named entity recognition

Utku Irmak, Reiner Kraft
2010 Proceedings of the 19th international conference on World wide web - WWW '10  
Named entity recognition studies the problem of locating and classifying parts of free text into a set of predefined categories. Although extensive research has focused on the detection of person, location and organization entities, there are many other entities of interest, including phone numbers, dates, times and currencies (to name a few examples). We refer to these types of entities as semistructured named entities, since they usually follow certain syntactic formats according to some
more » ... ntions, although their structure is typically not well-defined. Regular expression solutions require significant amount of manual effort and supervised machine learning approaches rely on large sets of labeled training data. Therefore, these approaches do not scale when we need to support many semi-structured entity types in many languages and regions. In this paper, we study this problem and propose a novel threelevel bootstrapping framework for the detection of semi-structured entities. We describe the proposed techniques for phone, date and time entities, and perform extensive evaluations on English, German, Polish, Swedish and Turkish documents. Despite the minimal input from the user, our approach can achieve 95% precision and 84% recall for phone entities, and 94% precision and 81% recall for date and time entities, on average. We also discuss implementation details and report run time performance results, which show significant improvements over regular expression based solutions.
doi:10.1145/1772690.1772738 dblp:conf/www/IrmakK10 fatcat:ehled2fptzamzgkckomjaiwz6y

Hierarchical substring caching for efficient content distribution to low-bandwidth clients

Utku Irmak, Torsten Suel
2005 Proceedings of the 14th international conference on World Wide Web - WWW '05  
While overall bandwidth in the internet has grown rapidly over the last few years, and an increasing number of clients enjoy broadband connectivity, many others still access the internet over much slower dialup or wireless links. To address this issue, a number of techniques for optimized delivery of web and multimedia content over slow links have been proposed, including protocol optimizations, caching, compression, and multimedia transcoding, and several large ISPs have recently begun to
more » ... y promote dialup acceleration services based on such techniques. A recent paper by Rhea, Liang, and Brewer proposed an elegant technique called value-based caching that caches substrings of files, rather than entire files, and thus avoids repeated transmission of substrings common to several pages or page versions. We propose and study a hierarchical substring caching technique that provides significant savings over this basic approach. We describe several additional techniques for minimizing overheads and perform an evaluation on a large set of real web access traces that we collected. In the second part of our work, we compare our approach to a widely studied alternative approach based on delta compression, and show how to integrate the two for best overall performance. The studied techniques are typically employed in a clientproxy environment, with each proxy serving a large number of clients, and an important aspect is how to conserve resources on the proxy while exploiting the significant memory and CPU power available on current clients.
doi:10.1145/1060745.1060757 dblp:conf/www/IrmakS05 fatcat:lk5uopncerey5pfrw3imxwg5fq

Leveraging context in user-centric entity detection systems

Vadim von Brzeski, Utku Irmak, Reiner Kraft
2007 Proceedings of the sixteenth ACM conference on Conference on information and knowledge management - CIKM '07  
A user-centric entity detection system is one in which the primary consumer of the detected entities is a person who can perform actions on the detected entities (e.g. perform a search, view a map, shop, etc.). We contrast this with machine-centric detection systems where the primary consumer of the detected entities is a machine. Machine-centric detection systems typically focus on the quantity of detected entities, measured by precision and recall metrics, with the goal of correctly
more » ... g every single entity in a document. However, the simple precision/recall scores of machinecentric entity detection systems fail to accurately reflect the quality of detected entities in user-centric systems, where users may not necessarily want to "see" every possible entity. We posit that not all of the detected entities in a given piece of text are necessarily relevant to the main topic of the text, nor are they necessarily interesting enough to the user to warrant further action. In fact, presenting all of the detected entities to a user may annoy the user to the point where he decides to turn this capability off completely, an undesirable outcome. Therefore, we propose to measure the quality and utility of user-centric entity detection systems in three core dimensions: the accuracy, the interestingness, and the relevance of the entities it presents to the user. We show that leveraging surrounding context can greatly improve the performance of such systems in all three dimensions by employing novel algorithms for generating a concept vector and for finding concept extensions using search query logs. We extensively evaluate the proposed algorithms within Contextual Shortcuts -a large-scale user-centric entity detection platform -using 1,586 entities detected over 1,519 documents. The results confirm the importance of using context within user-centric entity detection systems, and validate the usefulness of the proposed algorithms by showing how they improve the overall entity detection quality within Contextual Shortcuts.
doi:10.1145/1321440.1321537 dblp:conf/cikm/BrzeskiIK07 fatcat:rzc34jvau5e4tdowwgmgmyhd4y

Wpływ suplementacji hormonów tarczycy na ograniczenie przyrostu nasierdziowej tkanki tłuszczowej u chorych z subkliniczną niedoczynnością tarczycy

Irmak Sayin, Aycan Fahri Erkan, Berkay Ekici, Utku Kutuk, Ahmet Corakci, Hasan Fehmi Tore
2016 Kardiologia polska  
A b s t r a c t Background: Subclinical hypothyroidism (SCH) is a common disorder which has adverse cardiovascular effects. Epicardial adipose tissue (EAT), a novel marker of cardiovascular risk, is increased in SCH. Aim: We aimed to investigate whether L-thyroxine treatment can reverse the thickening of EAT in SCH. Methods: Forty-four patients with SCH and 42 euthyroid control subjects were included. EAT thickness was measured using transthoracic echocardiography at baseline and after
more » ... on of the euthyroid status with 3 months of L-thyroxine treatment. Results: At baseline, mean EAT thickness was significantly greater in the SCH group when compared to the control group (6.3 ± 1.7 mm vs. 4.1 ± 0.9 mm, respectively, p < 0.001). There was a significant positive correlation between baseline serum thyroid stimulating hormone (TSH) level and EAT thickness in the SCH group. There was a significant reduction in mean EAT thickness in response to L-thyroxine treatment (6.3 ± 1.7 mm vs. 5.1 ± 1.4 mm, p < 0.001). The decrease in EAT thickness after L-thyroxine treatment when compared to baseline (DEAT) significantly correlated to the difference in TSH levels before and after treatment (DTSH; r = 0.323; p = 0.032). Conclusions: Epicardial adipose tissue thickness is increased in patients with SCH. This thickening was alleviated with restoration of the euthyroid status with L-thyroxine treatment in our study population of predominantly male, relatively old subjects with greater baseline EAT thickness.
doi:10.5603/kp.a2016.0053 pmid:27112940 fatcat:flygtuqh7nbiriwkkr5cnapjzi

Kentsel Donatı Elemanlarının Kalite Değerlendirilmesi: Uşak, Cumhuriyet Mahallesi Örneği

Hüseyin Berk TÜRKER, Gülbeyaz SAKINMAZ
2021 Mimarlık Bilimleri ve Uygulamaları Dergisi (MBUD)  
Ayrıca donatı elemanlarının çevreye uyumlu olması kent kimliğine katkıda bulunur ve imgeselliği artırır (Külekçi ve Irmak, 2019) .  ...  Veysel Eroğlu, Funda, Uğur, Örnek Cami, Merih, Utku, Özge, Uzun, Albayrak, Şirinkent ve Akpınar Parkı'dır.  ... 
doi:10.30785/mbud.973693 fatcat:huw6rhoyfzfhhnywxu63uvy6cm

The association of myocardial hypertrophy and apical diverticulum: more than coincidence

Ali Sivrioglu, Muzaffer Saglam, Ersin Ozturk, Mehmet Incedayi, Guner Sonmez, Hakan Mutlu, Bulent Karaman
2013 Diagnostic and Interventional Radiology  
Gallego 1 8 Bela Kis 1 8 Figen Demirkazık 1 10 Oğuz Dicle 1 10 Houman Djaladat 1 10 Irmak Durur-Subaşı 1 10 T. Grieser 1 10 W.J.  ...  Spiliopoulos 5 5 Pascal Baltzer 5 7 Moritz Albrecht 5 8 Ting Chen 5 9 Murat Canyiğit 5 12 Feng Gao 5 12 Naciye Sinem Gezer 5 12 Dimitrios Filippiadis 5 15 Eduardo Fleury 5 15 Utku  ... 
doi:10.5152/dir.2013.166 fatcat:dp6p3w3rfvbyhbwj2e7cyouj3e

Current Statement of Intensive Care Units in Turkey: Data obtained from 67 Centers

Ozlem Ediboglu, Ozlem Yazicioglu Mocin, Ezgi Ozyilmaz, Cuneyt Salturk, Tugba Onalan, Gulsah Seydaoglu, Turgay Celikel, Huseyin Arikan, Sena Ataman, Cenk Kirakli, Zerrin Ozcelik, Sema Kultufan (+94 others)
2018 Turkish Thoracic Journal  
CONCLUSION: In terms of the number of staff, there is a need for specialist physicians, especially during the night and nurses in our country. It was thought that the number of ICU-certified nurses was comparatively sufficient, yet the target was supposed to be 100% for this rate.
doi:10.5152/turkthoracj.2018.170104 pmid:30322437 pmcid:PMC6196900 fatcat:kzv4hvhawjhybiarral37yqq6i


2016 Fırat Üniversitesi Sosyal Bilimler Dergisi  
Yine aynı müellifler Nil nehrinin ana kolları arasında yer alan çok sayıdaki göl, ırmak ve kanaldan da ayrıntılı biçimde bahsetmektedirler.  ...  Utku, age, s. 528, 537, 538 İstanbul, 1991, s. 241 . 28 Utku, age, s. 310. 29 Eymen Fuâd Seyyid, "Nil", DİA, c. 33, s. 123.  ... 
doi:10.18069/fusbed.00509 fatcat:xy54lw5qhjhxhcxx2vb4pmsjou

Fliliz Boran, Filiz Akti
2015 Hittite Journal of Science & Engineering  
Ali KOSE, Kadir EROL, Celal Utku DENIZ and Emel TAMAHKAR IRMAK for their help and support.  ... 
doi:10.17350/hjse19030000022 fatcat:u5vgv7ljlzchnhzjjiopf6fnku


Özlem Yatkın, Melike Yılmazer
2017 International Journal of Contemporary Tourism Research  
Bu araştırmada müdür yardımcılarının genel müdürlerden daha yüksek oranda tükenmişlik yaşadıkları bulunmuştur (Ersoy ve Demirel Utku, 2005) .  ...  Kuruüzüm, Anafarta ve Irmak (2008), Antalya'daki 4 ve 5 yıldızlı konaklama işletmelerindeki orta kademe yöneticileri üzerinde yaptıkları çalışmada, iş doyumunun ve işin özelliklerinin ve demografik değişkenlerin  ... 
doi:10.30625/ijctr.315763 fatcat:6b5dzudxg5aebk555p3g3ptghe

Epilepsi EEG Verilerinin Makine Öğrenmesi Teknikleriyle Sınıflandırılması

2021 European Journal of Science and Technology  
YSA, insan beyninin öğrenme işlevi örnek alınarak tasarlanan hücrelerden meydana gelmektedir (Asilkan ve Irmak, 2009 ).  ... 
doi:10.31590/ejosat.857507 fatcat:fgb5s7uegrbxlk3aefxughhha4


2018 Elektronik Sosyal Bilimler Dergisi  
Diğer/popüler kategorisine önceki üç kategoriye girmeyen isimler ve popüler diyebileceğimiz moda isimler (Gizem, Utku, Mesut, Esra, Tuğçe, Beyza, Rojda, v.b. ) girmektedir.  ...  Muhammedin ve yakınlarının adları, cennet ile ilgili adlar, din büyüklerinin adları Kuran-ı Kerim ile ilgili adlar, Kabe ile ilgili adlar, süre adları, ülke adları, deniz, ırmak, göl adları, ağaç adları  ... 
doi:10.17755/esosder.344705 fatcat:vhlzonn7wfgxzhvhlrc4xvdo7q
« Previous Showing results 1 — 15 out of 43 results