Baseball, P. (2000). In P. Basketball, H. F. Spirer, & L. Spirer (Eds.), Deciding to make the Situation: Examining Major Peoples Legal rights Abuses Having fun with Suggestions Solutions and you can Studies Analysis. AAAS.
Belin, T. Roentgen., & Rubin, D. B. (1995). A strategy having calibrating not true-fits rates from inside the record linkage. Record of the Western Analytical Relationship, 90(430), 694–707.
Bilenko, Yards., & Mooney, R. J. (2003). Adaptive Content Recognition Playing with Learnable String Resemblance Measures. In KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated Listing Linkage Having fun with Seeded Nearest Neighbor and you can Service Vector Machine Classification. In the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A study out-of indexing tips for scalable listing linkage and you will deduplication. IEEE Transactions toward Studies and you can Study Technologies, 24(9), 1537–1555.
Cohen, W., Raviku). A comparison off sequence metrics to possess complimentary names and you may ideas. From inside the KDD workshop into the studies cleaning and you may target integration (Vol. step 3, pp. 73–78).
Copas, J., & Hilton, F. (1990). Listing linkage: Analytical patterns having complimentary desktop info. Journal of your Royal Mathematical People, Collection A, 153(3), 287–320.
Dai, A good. M., & Storkey, An excellent. J. (2011). The grouped creator-procedure design to have unsupervised entity quality. From inside the Phony neural networks and you will machine understanding–icann 2011 (pp. 241–249). Springer.
Fortini, M., Liseo, B., Nuccitelli, A., & Scanu, M. (2001). To your Bayesian Listing Linkage. Look during the Certified Statistics, 4(1), 185–198.
Gutman, R., Afendulis, C., & Zaslavsky, An effective. (2013). A good bayesian process of file hooking up to research end- of-lifetime scientific will set you back. Journal of Western Analytical Connection, 108(501), 34–47.
Hsu, W., Lee, Yards. L., Liu, B., & Ling, T. W. (2000). Exploration Exploration inside Diabetic patients Database: Results and you will Findings. From inside the KDD ’00 (pp. 430–436). ACM.
A split-mix Markov chain Monte Carlo procedure for brand new Dirichlet procedure blend design
Jewell, Letter. P., Spagat, Yards., & Jewell, B. L. (2013). MSE and Casualty Counts: Presumptions, Interpretation, and you will Pressures. Into the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Relying Civil Casualties: An overview of Recording and you can Estimating Nonmilitary Fatalities in conflict. Oxford, UK: Oxford School Press.
Larsen, M. D. (2002)ments on Hierarchical Bayesian Listing Linkage. https://internationalwomen.net/fi/kambodzalaiset-naiset/ Within the Process of shared mathematical meetings, area towards the questionnaire look procedures (pp. 1995–2000). The brand new American Analytical Connection.
Steorts, Roentgen
Larsen, Meters. D. (2005). Improves from inside the Record Linkage Theory: Hierarchical Bayesian Number Linkage Principle. From inside the Legal proceeding of joint statistical meetings, area with the questionnaire browse procedures (pp. 3277–3284). The latest Western Statistical Organization.
Larsen, Meters. D., & Rubin, D. B. (2001). Iterative automatic checklist linkage using blend patterns. Record of one’s Western Analytical Relationship, 96(453), 32–41.
Lum, K., Rates, M. Elizabeth., & Banking companies, D. (2013). Apps out-of Numerous Options Estimate within the Individual Rights Search. Brand new Western Statistician, 67(4), 191–2 hundred.
Marchant, Letter. G., C., Kaplan, An excellent., Rubinstein, B. I. P., & Elazar, D. Letter. (2019). D-blink: Marketed prevent-to-stop bayesian organization resolution.
McCallum, A., & Wellner, B. (2004). Conditional Varieties of Term Uncertainty having Application to Noun Coreference. During the Enhances from inside the neural pointers operating systems (nips ’04) (pp. 905–912). MIT Drive.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A domain-Specific Unit to your Deduplication of Inoculation Records Info when you look at the Youth Immunization Registriesputers and Biomedical Look, 33(2), 126–143.
Murphy, J., Brackbill, R. Yards., Thalji, L., Dolan, Yards., Pulliam, P., & Walker, D. J. (2007). Calculating and Enhancing Coverage worldwide Trading Heart Health Registry. Statistics from inside the Drug, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic listing linkage and you can deduplication shortly after indexing, blocking, and you will selection. Log of Confidentiality and you may Privacy, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Meters., Axford, S. J., & James, An effective. P. (1959). Automated linkage regarding vital records servers can be used to extract” follow-up” statistics from group out of data files out-of regimen records. Research, 130(3381), 954–959.
Sadinle, Meters. (2014). Finding Duplicates when you look at the a homicide Registry Using a Bayesian Partitioning Strategy. Annals away from Used Statistics, 8(4), 2404–2434.
Sariyar, Yards., Borg, A beneficial., & Pommerening, K. (2012). Effective Studying Tips for the new Deduplication off Digital Diligent Investigation Having fun with Category Trees. Journal out of Biomedical Informatics, 45(5), 893–900.
C., Hallway, Roentgen., & Fienberg, S. E. (2016). A good Bayesian Approach to Graphical List Linkage and Deduplication. Diary of your own American Mathematical Relationship, 111(516), 1660–1672.
Tancredi, A beneficial., & Liseo, B. (2011). A good hierarchical Bayesian method of checklist linkage and you can population proportions trouble. Annals away from Applied Analytics, 5(2B), 1553–1585.