Filters








912 Hits in 10.1 sec

Testing Feedforward Neural Networks Training Programs [article]

Houssem Ben Braiek, Foutse Khomh
2022 arXiv   pre-print
Multiple testing techniques are proposed to generate test cases that can expose inconsistencies in the behavior of DNN models.  ...  Nowadays, we are witnessing an increasing effort to improve the performance and trustworthiness of Deep Neural Networks (DNNs), with the aim to enable their adoption in safety critical systems such as  ...  ACKNOWLEDGMENTS This work is partly funded by the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Fonds de Recherche du Quebec (FRQ).  ... 
arXiv:2204.00694v1 fatcat:u75eyoipkzaxhoyr5zn7fjsbjm

Online controlled experiments at large scale

Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, Nils Pohlmann
2013 Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '13  
Running experiments at large scale requires addressing multiple challenges in three areas: cultural/organizational, engineering, and trustworthiness.  ...  Classical testing and debugging techniques no longer apply when there are billions of live variants of the site, so alerts are used to identify issues rather than relying on heavy upfront testing.  ...  We have been fortunate to have been part of Bing during the massive growth in experimentation, and wish to thank many people for encouraging data-driven decision making, especially Qi Lu and Harry Shum  ... 
doi:10.1145/2487575.2488217 dblp:conf/kdd/KohaviDFWXP13 fatcat:z4lqv2qcnnhjjofbpnarrbkks4

Online Experimentation with Surrogate Metrics: Guidelines and a Case Study [article]

Weitao Duan, Shan Ba, Chunzhe Zhang
2021 arXiv   pre-print
In this paper, we discuss how to adjust the A/B testing comparison to ensure experiment results are trustworthy. We also provide practical guidelines on the choice of good surrogate metrics.  ...  A/B tests have been widely adopted across industries as the golden rule that guides decision making.  ...  Deployment and analysis of controlled experiments are done at large scale. This presents unique challenges and pitfalls.  ... 
arXiv:2106.01421v1 fatcat:in63j6edubco3p3zj5bndkh5ku

Methodology for development of scientific software and test frameworks in function of precision of the expected results [article]

T. Przedzinski
2022 arXiv   pre-print
The analysis of the development process of these tools can help estimate the effort needed to improve the design and precision of complex algorithms.  ...  The relation between increased precision of the results and increased complexity of tests and test frameworks is also demonstrated based on these projects.  ...  It also summarizes the amount of work needed to adapt the tool to higher technical precision. a) b) Figure 24 . Relations between the projects as function of time (precision).  ... 
arXiv:2203.11650v1 fatcat:3oqxydonc5b47eszhk345sklta

Formal analysis of SAML 2.0 web browser single sign-on

Alessandro Armando, Roberto Carbone, Luca Compagna, Jorge Cuellar, Llanos Tobarra
2008 Proceedings of the 6th ACM workshop on Formal methods in security engineering - FMSE '08  
In this paper we provide formal models of the protocol corresponding to one of the most applied use case scenario (the SP-Initiated SSO with Redirect/POST Bindings) and of a variant of the protocol implemented  ...  by Google and currently in use by Google's customers (the SAML-based SSO for Google Applications).  ...  sent(RS, B, A, M, Ch) receive(A,B,RS,M,Ch) −−−−−−−−−−−→ rcvd(A, B, M, Ch) (2) rcvd(A, B, M, Ch) stater (j, A, es, S) send j (A,B,B1,...  ... 
doi:10.1145/1456396.1456397 dblp:conf/ccs/ArmandoCCCT08 fatcat:fgvtcc5lzzekjoojix2pgrhewy

Modelling confidence in railway safety case

Rui Wang, Jérémie Guiochet, Gilles Motet, Walter Schön
2017 Safety Science  
Additionally, subsets of {(B, C, A), (B, C, A), (B, C, A), (B, C, A), (B, C, A), (B, C, A), (B, C, A)(B, C, A)} are used to represent the possible inferences among A, B and C: e.g.  ...  Figure 2 : 2 The measures of truth of statement S with D-S theory bel(P ) = M ⊆P,M =∅ m Ω (M ) ∀P ⊆ Ω , subsets of {(B, A), (B, A), (B, A), (B, A)} are used to represent the possible inferences between  ... 
doi:10.1016/j.ssci.2017.11.012 fatcat:lnmahazdqvf7zaowveqybjni6e

Opinion Mining [chapter]

2017 Encyclopedia of Machine Learning and Data Mining  
A/B tests, split tests, randomized experiments, con- trol/treatment tests, and online field experi- ments.  ...  Controlled Experiments and A/B Testing Synonyms A/B Testing; Randomized Experiments; Split Tests Motivation and Background Many good resources are available with motivation and explanations about  ... 
doi:10.1007/978-1-4899-7687-1_100511 fatcat:oluapsjgxzh6nlkqujjj562lzi

Co-Immune: a case study on open innovation for vaccination hesitancy and access [article]

Camille M. Masselot, Bastian Greshake Tzovaras, Christopher L.B. Graham, Rathin Jeyaram, Gary Finnegan, Isabelle Vitali, Thomas E. Landrain, Marc Santolini
2021 medRxiv   pre-print
In spite of this, institutional silos, paywalls and lack of participation of non-academic citizens in the design of solutions hamper efforts to meet these challenges.  ...  Methods: We designed and implemented Co-Immune, a programme created to tackle the question of vaccination hesitancy and access to vaccination through an online and offline challenge-based open innovation  ...  We thank the interviewees at the 7th Fondation Merieux Vaccine Acceptance conference for highlighting the key issues to address and potential solutions participants could build on.  ... 
doi:10.1101/2021.03.29.20248781 fatcat:aad2xthrfvbd7dmnprcg6636t4

Celebrity Endorsement as Drivers of Advertising Strategy: The Case of Toc Tien Endorsing Oppo

Phuong Nguyen Van
2017 VNU Journal of Science Economics and Business  
Moreover, there was no evidence to show any linkage between trustworthiness and advertising.  ...  of 304 respondents.  ...  A. (2005) Trustworthiness 5-point Likert Ayanwale, A. B., Alimi, T., & Ayanbimipe, M.  ... 
doi:10.25073/2588-1108/vnueab.4074 fatcat:hqlfr2jrqvenxm5h4p2x2qkory

Future User Engagement Prediction and Its Application to Improve the Sensitivity of Online Experiments

Alexey Drutsa, Gleb Gusev, Pavel Serdyukov
2015 Proceedings of the 24th International Conference on World Wide Web - WWW '15  
Modern Internet companies improve their services by means of data-driven decisions that are based on online controlled experiments (also known as A/B tests).  ...  Especially, we show how it can be used to detect the treatment effect of an A/B test faster with the same level of statistical significance.  ...  Some of existing works focused on the study of the trustworthiness of the results of an A/B test.  ... 
doi:10.1145/2736277.2741116 dblp:conf/www/DrutsaGS15 fatcat:r5nlnjnm6nbqfbqb43q46tnzg4

Measuring Article Quality in Wikipedia using the Collaboration Network

Baptiste de La Robertie, Yoann Pitarch, Olivier Teste
2015 Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015 - ASONAM '15  
Due to the huge number of articles and the intensive edit rate, the manual evaluation of article content quality is inconceivable.  ...  This work gives a generic formulation of the Mutual Reinforcement principle held between articles quality and authors authority and take explicitly advantage of the co-edits graph generated by individuals  ...  To illustrate the intuition, let suppose 3 classes of articles A, B and C such as articles of class A are of better quality than those of type B and those belonging to class B of better quality than articles  ... 
doi:10.1145/2808797.2808895 dblp:conf/asunam/RobertiePT15 fatcat:c7e6v3t5vzeejdf7gr3jpbljle

Machine Learning for Performance Prediction of Channel Bonding in Next-Generation IEEE 802.11 WLANs [article]

Francesc Wilhelmi, David Góez, Paola Soto, Ramon Vallés, Mohammad Alfaifi, Abdulrahman Algunayah, Jorge Martin-Pérez, Luigi Girletti, Rajasekar Mohan, K Venkat Ramnan, Boris Bellalta
2021 arXiv   pre-print
In this context, the International Telecommunication Union (ITU) organized the first AI for 5G Challenge to bring industry and academia together to introduce and solve representative problems related to  ...  the increasing complexity of future 5G and beyond communications.  ...  Nevertheless, the same credit goes to the rest of the participants of PS-013 in the ITU AI for 5G Challenge: Miguel Camelo, Natalia Gaviria, Mohammad Abid, Ayman M. Aloshan, Faisal Alomar, Khaled M.  ... 
arXiv:2105.14219v1 fatcat:eakapobfcbcahfzp45btsxrb5i

Machine learning for performance prediction of channel bonding in next-generation IEEE 802.11 WLANS

Francesc Wilhelmi, David G�ez, Paola Soto, Ramon Vall�s, Mohammad Alfaifi, Abdulrahman Algunayah, Jorge Mart�n-P�rez, Luigi Girletti, Rajasekar Mohan, K Venkat Ramnan, Boris Bellalta
2021 ITU Journal  
In this context, the International Telecommunication Union (ITU) organized the First AI for 5G Challenge to bring industry and academia together to introduce and solve representative problems related to  ...  the increasing complexity of future 5G and beyond communications.  ...  Nevertheless, the same credit goes to the rest of the participants of PS-013 in the ITU AI for 5G Challenge: Miguel Camelo, Natalia Gaviria, Mohammad Abid, Ayman M. Aloshan, Faisal Alomar, Khaled M.  ... 
doi:10.52953/nbgs1213 fatcat:pckiqxmaz5cx3jsdpna6t4oije

Fair ranking: a critical review, challenges, and future directions [article]

Gourab K Patro, Lorenzo Porcaro, Laura Mitchell, Qiuyue Zhang, Meike Zehlike, Nikhil Garg
2022 arXiv   pre-print
Ranking, recommendation, and retrieval systems are widely used in online platforms and other societal systems, including e-commerce, media-streaming, admissions, gig platforms, and hiring.  ...  spillovers and compounding effects over time, induced strategic incentives, and the effect of statistical uncertainty.  ...  offline (and commonly, precision-driven) recommendation experiments are not always predictive of long-term simulation or online A/B testing outcomes [66, 21, 90] .  ... 
arXiv:2201.12662v1 fatcat:tf36txkf65gzbkysjhiog7o7ne

Crowdsourcing Quality of Experience Experiments [chapter]

Sebastian Egger-Lampl, Judith Redi, Tobias Hoßfeld, Matthias Hirth, Sebastian Möller, Babak Naderi, Christian Keimel, Dietmar Saupe
2017 Lecture Notes in Computer Science  
Qualinet members that participated in the creation of Best Practices and Recommendations for Crowdsourced QoE -Lessons learned from the Qualinet Task Force [38] .  ...  The authors want to thank Schloss Dagstuhl Leibniz-Zentrum für Informatik, the participants of Dagstuhl Seminar 15481 Evaluation in the Crowd: Crowdsourcing and Human-Centred Experiments as well as the  ...  On the other hand, replacing the probability P (A > B) by its empirical estimate P A, B = N A,B /(N A,B + N B,A ) and applying the inverse of this function will yield an estimate of the distance of the  ... 
doi:10.1007/978-3-319-66435-4_7 fatcat:mkhhxbmtpzhqzcawfep7m7v5oi
« Previous Showing results 1 — 15 out of 912 results