Keywords:real estate, record linkage, clustering, building linkage, dwelling linkage
Real estate information database, an influential data source for comprehensive understanding of real estate transactions, has some problems, since the database is created by multiple real estate intermediary agents. Particularly, we confirm that there are substantial duplication in condominiums and apartments, and thus, it is necessary to integrate the duplicate records together. We regard the task as one of record linkage problems, and develop the model integrating the high likelihood of dwellings with the application to existing data handling techniques such as hierarchical clustering. We then validate whether the integrated records by the proposed method achieve practical recall and precision.
Authentication for paper PDF access
A password is required to view paper PDFs. If you are a registered participant, please log on the site from Participant Log In.
You could view the PDF with entering the PDF viewing password bellow.