A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2022; you can also visit the original URL.
The file type is application/pdf
.
Filters
Testing Feedforward Neural Networks Training Programs
[article]
2022
arXiv
pre-print
Multiple testing techniques are proposed to generate test cases that can expose inconsistencies in the behavior of DNN models. ...
Nowadays, we are witnessing an increasing effort to improve the performance and trustworthiness of Deep Neural Networks (DNNs), with the aim to enable their adoption in safety critical systems such as ...
ACKNOWLEDGMENTS This work is partly funded by the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Fonds de Recherche du Quebec (FRQ). ...
arXiv:2204.00694v1
fatcat:u75eyoipkzaxhoyr5zn7fjsbjm
Online controlled experiments at large scale
2013
Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '13
Running experiments at large scale requires addressing multiple challenges in three areas: cultural/organizational, engineering, and trustworthiness. ...
Classical testing and debugging techniques no longer apply when there are billions of live variants of the site, so alerts are used to identify issues rather than relying on heavy upfront testing. ...
We have been fortunate to have been part of Bing during the massive growth in experimentation, and wish to thank many people for encouraging data-driven decision making, especially Qi Lu and Harry Shum ...
doi:10.1145/2487575.2488217
dblp:conf/kdd/KohaviDFWXP13
fatcat:z4lqv2qcnnhjjofbpnarrbkks4
Online Experimentation with Surrogate Metrics: Guidelines and a Case Study
[article]
2021
arXiv
pre-print
In this paper, we discuss how to adjust the A/B testing comparison to ensure experiment results are trustworthy. We also provide practical guidelines on the choice of good surrogate metrics. ...
A/B tests have been widely adopted across industries as the golden rule that guides decision making. ...
Deployment and analysis of controlled experiments are done at large scale. This presents unique challenges and pitfalls. ...
arXiv:2106.01421v1
fatcat:in63j6edubco3p3zj5bndkh5ku
Methodology for development of scientific software and test frameworks in function of precision of the expected results
[article]
2022
arXiv
pre-print
The analysis of the development process of these tools can help estimate the effort needed to improve the design and precision of complex algorithms. ...
The relation between increased precision of the results and increased complexity of tests and test frameworks is also demonstrated based on these projects. ...
It also summarizes the amount of work needed to adapt the tool to higher technical precision.
a) b) Figure 24 . Relations between the projects as function of time (precision). ...
arXiv:2203.11650v1
fatcat:3oqxydonc5b47eszhk345sklta
Formal analysis of SAML 2.0 web browser single sign-on
2008
Proceedings of the 6th ACM workshop on Formal methods in security engineering - FMSE '08
In this paper we provide formal models of the protocol corresponding to one of the most applied use case scenario (the SP-Initiated SSO with Redirect/POST Bindings) and of a variant of the protocol implemented ...
by Google and currently in use by Google's customers (the SAML-based SSO for Google Applications). ...
sent(RS, B, A, M, Ch) receive(A,B,RS,M,Ch) −−−−−−−−−−−→ rcvd(A, B, M, Ch) (2) rcvd(A, B, M, Ch) stater (j, A, es, S) send j (A,B,B1,... ...
doi:10.1145/1456396.1456397
dblp:conf/ccs/ArmandoCCCT08
fatcat:fgvtcc5lzzekjoojix2pgrhewy
Modelling confidence in railway safety case
2017
Safety Science
Additionally, subsets of {(B, C, A), (B, C, A), (B, C, A), (B, C, A), (B, C, A), (B, C, A), (B, C, A)(B, C, A)} are used to represent the possible inferences among A, B and C: e.g. ...
Figure 2 : 2 The measures of truth of statement S with D-S theory bel(P ) = M ⊆P,M =∅ m Ω (M ) ∀P ⊆ Ω
, subsets of {(B, A), (B, A), (B, A), (B, A)} are used to represent the possible inferences between ...
doi:10.1016/j.ssci.2017.11.012
fatcat:lnmahazdqvf7zaowveqybjni6e
Opinion Mining
[chapter]
2017
Encyclopedia of Machine Learning and Data Mining
A/B
tests, split tests, randomized experiments, con-
trol/treatment tests, and online field experi-
ments. ...
Controlled Experiments and A/B Testing
Synonyms
A/B Testing; Randomized Experiments; Split Tests
Motivation and Background Many good resources are available with motivation and explanations about ...
doi:10.1007/978-1-4899-7687-1_100511
fatcat:oluapsjgxzh6nlkqujjj562lzi
Co-Immune: a case study on open innovation for vaccination hesitancy and access
[article]
2021
medRxiv
pre-print
In spite of this, institutional silos, paywalls and lack of participation of non-academic citizens in the design of solutions hamper efforts to meet these challenges. ...
Methods: We designed and implemented Co-Immune, a programme created to tackle the question of vaccination hesitancy and access to vaccination through an online and offline challenge-based open innovation ...
We thank the interviewees at the 7th Fondation Merieux Vaccine Acceptance conference for highlighting the key issues to address and potential solutions participants could build on. ...
doi:10.1101/2021.03.29.20248781
fatcat:aad2xthrfvbd7dmnprcg6636t4
Celebrity Endorsement as Drivers of Advertising Strategy: The Case of Toc Tien Endorsing Oppo
2017
VNU Journal of Science Economics and Business
Moreover, there was no evidence to show any linkage between trustworthiness and advertising. ...
of 304 respondents. ...
A. (2005)
Trustworthiness
5-point
Likert
Ayanwale, A.
B., Alimi, T.,
&
Ayanbimipe,
M. ...
doi:10.25073/2588-1108/vnueab.4074
fatcat:hqlfr2jrqvenxm5h4p2x2qkory
Future User Engagement Prediction and Its Application to Improve the Sensitivity of Online Experiments
2015
Proceedings of the 24th International Conference on World Wide Web - WWW '15
Modern Internet companies improve their services by means of data-driven decisions that are based on online controlled experiments (also known as A/B tests). ...
Especially, we show how it can be used to detect the treatment effect of an A/B test faster with the same level of statistical significance. ...
Some of existing works focused on the study of the trustworthiness of the results of an A/B test. ...
doi:10.1145/2736277.2741116
dblp:conf/www/DrutsaGS15
fatcat:r5nlnjnm6nbqfbqb43q46tnzg4
Measuring Article Quality in Wikipedia using the Collaboration Network
2015
Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015 - ASONAM '15
Due to the huge number of articles and the intensive edit rate, the manual evaluation of article content quality is inconceivable. ...
This work gives a generic formulation of the Mutual Reinforcement principle held between articles quality and authors authority and take explicitly advantage of the co-edits graph generated by individuals ...
To illustrate the intuition, let suppose 3 classes of articles A, B and C such as articles of class A are of better quality than those of type B and those belonging to class B of better quality than articles ...
doi:10.1145/2808797.2808895
dblp:conf/asunam/RobertiePT15
fatcat:c7e6v3t5vzeejdf7gr3jpbljle
Machine Learning for Performance Prediction of Channel Bonding in Next-Generation IEEE 802.11 WLANs
[article]
2021
arXiv
pre-print
In this context, the International Telecommunication Union (ITU) organized the first AI for 5G Challenge to bring industry and academia together to introduce and solve representative problems related to ...
the increasing complexity of future 5G and beyond communications. ...
Nevertheless, the same credit goes to the rest of the participants of PS-013 in the ITU AI for 5G Challenge: Miguel Camelo, Natalia Gaviria, Mohammad Abid, Ayman M. Aloshan, Faisal Alomar, Khaled M. ...
arXiv:2105.14219v1
fatcat:eakapobfcbcahfzp45btsxrb5i
Machine learning for performance prediction of channel bonding in next-generation IEEE 802.11 WLANS
2021
ITU Journal
In this context, the International Telecommunication Union (ITU) organized the First AI for 5G Challenge to bring industry and academia together to introduce and solve representative problems related to ...
the increasing complexity of future 5G and beyond communications. ...
Nevertheless, the same credit goes to the rest of the participants of PS-013 in the ITU AI for 5G Challenge: Miguel Camelo, Natalia Gaviria, Mohammad Abid, Ayman M. Aloshan, Faisal Alomar, Khaled M. ...
doi:10.52953/nbgs1213
fatcat:pckiqxmaz5cx3jsdpna6t4oije
Fair ranking: a critical review, challenges, and future directions
[article]
2022
arXiv
pre-print
Ranking, recommendation, and retrieval systems are widely used in online platforms and other societal systems, including e-commerce, media-streaming, admissions, gig platforms, and hiring. ...
spillovers and compounding effects over time, induced strategic incentives, and the effect of statistical uncertainty. ...
offline (and commonly, precision-driven) recommendation experiments are not always predictive of long-term simulation or online A/B testing outcomes [66, 21, 90] . ...
arXiv:2201.12662v1
fatcat:tf36txkf65gzbkysjhiog7o7ne
Crowdsourcing Quality of Experience Experiments
[chapter]
2017
Lecture Notes in Computer Science
Qualinet members that participated in the creation of Best Practices and Recommendations for Crowdsourced QoE -Lessons learned from the Qualinet Task Force [38] . ...
The authors want to thank Schloss Dagstuhl Leibniz-Zentrum für Informatik, the participants of Dagstuhl Seminar 15481 Evaluation in the Crowd: Crowdsourcing and Human-Centred Experiments as well as the ...
On the other hand, replacing the probability P (A > B) by its empirical estimate P A, B = N A,B /(N A,B + N B,A ) and applying the inverse of this function will yield an estimate of the distance of the ...
doi:10.1007/978-3-319-66435-4_7
fatcat:mkhhxbmtpzhqzcawfep7m7v5oi
« Previous
Showing results 1 — 15 out of 912 results