Automatic Extraction of Rarely Explored Materials and Methods Sections from Research Journals using Machine Learning Techniques

Kavitha Jayaram, Prakash G, Jayaram V
2020 International Journal of Advanced Computer Science and Applications  
The scientific community is expanding by leaps and bounds every day owing to pioneering and path breaking scientific literature published in journals around the globe. Viewing as well as retrieving this data is a challenging task in today's fast paced world. The essence and importance of scientific research papers for the expert lies in their experimental and theoretical results along with the sanctioned research projects from the organizations. Since scant work has been done in this direction,
more » ... the alternative option is to explore text mining by machine learning techniques. Myriad journals are available on material research which throws light on a gamut of materials, synthesis methods, and characterization methods used to study properties of the materials. Application of materials has many diversified areas, hence selected papers from "Journal of Material Science" where "Materials and Methods" sections contains names of the method, characterization techniques (instrumental methods), algorithms, images, etc. used in research work. The "Acknowledgment" section conveys information about authors' proximity, collaborations with organizations that are again not explored for the citation network. In the present articulated work, our attempt is to derive a means to automatically extract methods or terminologies used in characterization techniques, author, organization data from "Materials and Methods" and "Acknowledgment" sections, using machine learning techniques. Another goal of this research is to provide a data set for characterization terms, classification and an extended version of the existing citation network for material research. The complete dataset will help new researchers to select research work, find new domains and techniques to solve advanced scientific research problems.
doi:10.14569/ijacsa.2020.0110857 fatcat:4rm7w6i2z5efnpbggx7foxdcuu