TY - JOUR AU - Man, Mustafa AU - Ahmad Sabri, Ily Amalina PY - 2017/10/20 Y2 - 2024/03/29 TI - The Proposed Algorithm for Semi-Structured Data Integration: Case Study of Setiu Wetland Data Set JF - Journal of Telecommunication, Electronic and Computer Engineering (JTEC) JA - JTEC VL - 9 IS - 3-3 SE - Articles DO - UR - https://jtec.utem.edu.my/jtec/article/view/2876 SP - 79-84 AB - Recent evolutions in web technology and computer science provide environmental community in expanding resources for data collection and analysis. Today, people are facing challenges to the design of analysis methods, workflows, and interaction with data sets. Data integration is one of older research fields in database area. It is consists of three types of data; structured data, semi-structured data and unstructured data. Web pages is a part of semi-structured data. In this paper, we briefly introduce the problem of data extraction from web pages focus on images. We also discuss the evolution of extraction images from semi-structured to structured format using WEIDJ (Wrapper for extraction Images using Document Object Model (DOM) and JavaScript Object Notation Data (JSON) approach). An experiment was conducted on same website using different approach JSON and DOM to show the comparison of time performance. ER -