Is this you? Create Your Porfile

Lisa Friedland

University of Massachusetts Amherst

Archive Network Publication Hotspot Collaboration

Network

Latest external collaboration on country level. Dive into details by clicking on the dots.

Explore More

Hotspot

Dive into the research topics where Lisa Friedland is active.

Explore More

Publication

Featured researches published by Lisa Friedland.

knowledge discovery and data mining | 2003

Learning relational probability trees

Jennifer Neville; David D. Jensen; Lisa Friedland; Michael Hay

Classification trees are widely used in the machine learning and data mining communities for modeling propositional data. Recent work has extended this basic paradigm to probability estimation trees. Traditional tree learning algorithms assume that instances in the training data are homogenous and independently distributed. Relational probability trees (RPTs) extend standard probability estimation trees to a relational setting in which data instances are heterogeneous and interdependent. Our algorithm for learning the structure and parameters of an RPT searches over a space of relational features that use aggregation functions (e.g. AVERAGE, MODE, COUNT) to dynamically propositionalize relational data and create binary splits within the RPT. Previous work has identified a number of statistical biases due to characteristics of relational data such as autocorrelation and degree disparity. The RPT algorithm uses a novel form of randomization test to adjust for these biases. On a variety of relational learning tasks, RPTs built using randomization tests are significantly smaller than other models and achieve equivalent, or better, performance.

Sigkdd Explorations | 2003

Exploiting relational structure to understand publication patterns in high-energy physics

Amy McGovern; Lisa Friedland; Michael Hay; Brian Gallagher; Andrew S. Fast; Jennifer Neville; David D. Jensen

We analyze publication patterns in theoretical high-energy physics using a relational learning approach. We focus on four related areas: understanding and identifying patterns of citations, examining publication patterns at the author level, predicting whether a paper will be accepted by specific journals, and identifying research communities from the citation patterns and paper text. Each of these analyses contributes to an overall understanding of theoretical high-energy physics.

knowledge discovery and data mining | 2007

Finding tribes: identifying close-knit individuals from employment patterns

Lisa Friedland; David D. Jensen

We present a family of algorithms to uncover tribes-groups of individuals who share unusual sequences of affiliations. While much work inferring community structure describes large-scale trends, we instead search for small groups of tightly linked individuals who behave anomalously with respect to those trends. We apply the algorithms to a large temporal and relational data set consisting of millions of employment records from the National Association of Securities Dealers. The resulting tribes contain individuals at higher risk for fraud, are homogenous with respect to risk scores, and are geographically mobile, all at significant levels compared to random or to other sets of individuals who share affiliations.

knowledge discovery and data mining | 2007

Relational data pre-processing techniques for improved securities fraud detection

Andrew S. Fast; Lisa Friedland; Marc E. Maier; Brian J. Taylor; David D. Jensen; Henry G. Goldberg; John Komoroske

Commercial datasets are often large, relational, and dynamic. They contain many records of people, places, things, events and their interactions over time. Such datasets are rarely structured appropriately for knowledge discovery, and they often contain variables whose meanings change across different subsets of the data. We describe how these challenges were addressed in a collaborative analysis project undertaken by the University of Massachusetts Amherst and the National Association of Securities Dealers(NASD). We describe several methods for data pre-processing that we applied to transform a large, dynamic, and relational dataset describing nearly the entirety of the U.S. securities industry, and we show how these methods made the dataset suitable for learning statistical relational models. To better utilize social structure, we first applied known consolidation and link formation techniques to associate individuals with branch office locations. In addition, we developed an innovative technique to infer professional associations by exploiting dynamic employment histories. Finally, we applied normalization techniques to create a suitable class label that adjusts for spatial, temporal, and other heterogeneity within the data. We show how these pre-processing techniques combine to provide the necessary foundation for learning high-performing statistical models of fraudulent activity.

knowledge discovery and data mining | 2013

Detecting insider threats in a real corporate database of computer usage activity

Ted E. Senator; Henry G. Goldberg; Alex Memory; William T. Young; Brad Rees; Robert Pierce; Daniel Huang; Matthew Reardon; David A. Bader; Edmond Chow; Irfan A. Essa; Joshua Jones; Vinay Bettadapura; Duen Horng Chau; Oded Green; Oguz Kaya; Anita Zakrzewska; Erica Briscoe; Rudolph L. Mappus; Robert McColl; Lora Weiss; Thomas G. Dietterich; Alan Fern; Weng-Keen Wong; Shubhomoy Das; Andrew Emmott; Jed Irvine; Jay Yoon Lee; Danai Koutra; Christos Faloutsos

conference on information and knowledge management | 2008