Structuring E-Commerce Inventory

Karin Mauge, Khash Rohanimanesh, Jean-David Ruvini
2012 Annual Meeting of the Association for Computational Linguistics  
Large e-commerce enterprises feature millions of items entered daily by a large variety of sellers. While some sellers provide rich, structured descriptions of their items, a vast majority of them provide unstructured natural language descriptions. In the paper we present a 2 steps method for structuring items into descriptive properties. The first step consists in unsupervised property discovery and extraction. The second step involves supervised property synonym discovery using a maximum
more » ... py based clustering algorithm. We evaluate our method on a year worth of ecommerce data and show that it achieves excellent precision with good recall.
dblp:conf/acl/MaugeRR12 fatcat:5qqvbl2igvd5pdrxl4ycuismfm