||Processing Large-scale XML Files on GPGPU Cluster
102015-03-24 , 法政大学大学院情報科学研究科
XML has been used as a textual data format for transporting and storing information in many areas. However, the cost to process the large-scale XML file will become a serious issue for general processing methods. In this paper, we propose a design and implementation of a large-scale XML processing system on GPU cluster to address the processing performance issue. This system cooperates CPU and GPGPU to the master-slave architecture for processing the XML file. The processing consists of two phases, structure extracting, and tags parsing. The structure extracting uses multiple threads to read the file and recognize the document structure, tags parsing will take advantage of GPGPU to get every tag’s name and attributes using the location information got in structure extracting phase.