Ball, P. (2000). Into the P. Basketball, H. F. Spirer, & L. Spirer (Eds.), Making the Instance: Examining Major Individual Liberties Violations Having fun with Information Solutions and you may Investigation Investigation. AAAS.
Belin, T. R., & Rubin, D. B. (1995). A strategy getting calibrating not the case-suits costs into the record linkage. Record of your Western Statistical Association, 90(430), 694–707.
Bilenko, M., & Mooney, Roentgen. J. (2003). Transformative Content Detection Using Learnable String Similarity Procedures. Into the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated Checklist Linkage Playing with Seeded Nearby Neighbor and Help Vector Servers Classification. Inside KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A study out of indexing suggestions for scalable number linkage and you can deduplication. IEEE Transactions towards the Education and Investigation Technologies, 24(9), 1537–1555.
Cohen, W., Raviku). A comparison from string metrics for coordinating brands and you may information. From inside the KDD working area towards data cleanup and target integration (Vol. step three, pp. 73–78).
Copas, J., & Hilton, F. (1990). Checklist linkage: Statistical models for coordinating desktop facts. Diary of your Royal Analytical Neighborhood, Show A great, 153(3), 287–320.
Dai, A beneficial. Meters., & Storkey, A. J. (2011). New grouped publisher-situation model for unsupervised organization solution. From inside the Phony neural systems and you may machine discovering–icann 2011 (pp. 241–249). Springer.
Fortini, Yards., Liseo, B., Nuccitelli, A., & Scanu, Meters. (2001). Visit Your URL On Bayesian Checklist Linkage. Browse into the Authoritative Statistics, 4(1), 185–198.
Gutman, Roentgen., Afendulis, C., & Zaslavsky, Good. (2013). A great bayesian procedure of file connecting to analyze avoid- of-lives medical will cost you. Record of your own American Mathematical Relationship, 108(501), 34–47.
Hsu, W., Lee, Yards. L., Liu, B., & Ling, T. W. (2000). Exploration Exploration during the Diabetics Database: Results and you may Findings. Into the KDD ’00 (pp. 430–436). ACM.
A torn-mix Markov strings Monte Carlo procedure for this new Dirichlet process mixture model
Jewell, Letter. P., Spagat, Yards., & Jewell, B. L. (2013). MSE and you can Casualty Matters: Assumptions, Interpretation, and you may Demands. Inside T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Counting Civil Casualties: An introduction to Tape and you may Estimating Nonmilitary Deaths incompatible. Oxford, UK: Oxford College Push.
Larsen, Yards. D. (2002)ments to your Hierarchical Bayesian Number Linkage. Within the Procedures of combined analytical group meetings, area into the questionnaire search methods (pp. 1995–2000). This new American Mathematical Connection.
Larsen, M. D. (2005). Advances from inside the Record Linkage Idea: Hierarchical Bayesian Number Linkage Theory. For the Procedures of the mutual statistical conferences, part on questionnaire research procedures (pp. 3277–3284). The latest American Analytical Connection.
Larsen, Meters. D., & Rubin, D. B. (2001). Iterative automated list linkage using blend models. Record of your own American Statistical Organization, 96(453), 32–41.
Lum, K., Price, Yards. Elizabeth., & Banks, D. (2013). Applications off Several Systems Estimation within the Human Liberties Look. The American Statistician, 67(4), 191–two hundred.
Marchant, N. Grams., C., Kaplan, Good., Rubinstein, B. I. P., & Elazar, D. N. (2019). D-blink: Marketed prevent-to-avoid bayesian entity quality.
McCallum, A beneficial., & Wellner, B. (2004). Conditional Varieties of Label Uncertainty having Software in order to Noun Coreference. In Advances into the neural suggestions control expertise (nips ’04) (pp. 905–912). MIT Drive.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A site-Particular Product towards the Deduplication off Vaccination Background Details during the Young people Immunization Registriesputers and you can Biomedical Lookup, 33(2), 126–143.
Murphy, J., Brackbill, R. Meters., Thalji, L., Dolan, Yards., Pulliam, P., & Walker, D. J. (2007). Computing and Improving Coverage in the world Trading Cardiovascular system Health Registry. Analytics within the Treatments, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic record linkage and you may deduplication shortly after indexing, blocking, and you will selection. Journal out of Confidentiality and you will Confidentiality, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Meters., Axford, S. J., & James, A great. P. (1959). Automated linkage of vital records servers are often used to extract” follow-up” statistics away from parents out-of documents away from regimen information. Science, 130(3381), 954–959.
Sadinle, Meters. (2014). Finding Copies for the a murder Registry Using a great Bayesian Partitioning Strategy. Annals of Applied Analytics, 8(4), 2404–2434.
Sariyar, M., Borg, A good., & Pommerening, K. (2012). Active Learning Methods for the brand new Deduplication regarding Digital Diligent Study Using Classification Trees. Log off Biomedical Informatics, 45(5), 893–900.
C., Hallway, R., & Fienberg, S. Age. (2016). A great Bayesian Way of Graphical Checklist Linkage and you can Deduplication. Record of one’s Western Statistical Organization, 111(516), 1660–1672.
Tancredi, An excellent., & Liseo, B. (2011). Good hierarchical Bayesian method to checklist linkage and you may inhabitants size difficulties. Annals from Used Analytics, 5(2B), 1553–1585.