Brian J. Taylor
I completed my Ph.D. from the department of computer science at the University of Massachusetts Amherst in August 2015. My research work was in the area of Causality, learning causal models from observational data. I am currently working for Amazon.com, Inc. as an Applied Scientist in the area of online display advertising. I am interested in machine learning, particularly causal knowledge discovery, and applying machine learning techniques in relational domains. I was a member of the Knowledge Discovery Laboratory (KDL) and my advisor was David Jensen.

Contact Information:

Email:   brian.james.taylor at gmail dot com
Phone:  585.613.1248
Address: 9420 Frontier Ave SE
               Snoqualmie, WA 98065

Current Employment

Amazon Advertising Platform

I am currently working as an Applied Scientist within the Amazon Advertising Platform, a business that creates and supports display advertising on Amazon owned and operated sites as well as on other publishers in the Amazon Ad Platform network. We utilize logistic regression and in some cases random forests to predict the performance of Ads such as probability users will click on an Ad or purchase the product being advertised. This is often referred to as optimization. This changes the ranking of Ads and helps to decide what Ads we place in front of users. Initially I started within the Advertising Platform as a Machine Learning Engineer with part emphasis on production code development and part on researching optimization models. That slowly became a 100% software development role and as I completed my PhD, I changed roles and teams internally to become an Applied Scientist which continues to be a mix of engineering and data science.

Previous Research

Causal Learning as Planning

I investigated a generalized approach to causal learning called causal planning that reasons over causal designs as actions used to learn a causal model. I have been able to show that the incorporation of informative priors over the causal model space combined with assessing the benefits and utilities of causal designs, leads to performance improvements over standard constraint-based learning algorithms such as PC. My early work with causal learning was in the development of an extension of PC that works in relational domains called relational PC (RPC).

Causal Knowledge Discovery

I helped to create a system that enables causal knowledge discovery through identification of quasi-experimental designs (QEDs) from static, observational data. QEDs are traditionally found through manual, time-consuming means. Our automated system can find large numbers of possible QEDs and then reduce that number through analysis of dependencies in the data, generating a list of highly likely candidates for good QEDs. This system can combine QEDs with other kinds of tests on the data such as conditional independence tests and then automatically apply these designs and build a causal model over the data set through an iterative process.

Peer Production/Collaborative Systems

I developed a small research platform to investigate peer production and collaborative sensing systems called Photobase that ran on a Nokia N95. These are systems where people come together to generate or gather content that is then used and shared by the community. Wikipedia is an example. Photobase allowed us to experiment while participants use the system so we can evaluate what influences and affects their behavior and levels of participation. The Photobase design and experimental controls enabled strong causal inference. For example I found that when participants view the collaboration as a competition, they actually participate less frequently than those who do not. The study also suggested that for any participatory system that includes areas infrequently traveled will either need to rely on a very large and carefully selected participant list or the use of coordination to guarantee coverage.

Relational Learning

My initial research at UMass was in the area of relational machine learning algorithms and graphical models. I investigated the relational probability tree (RPT) and the relational dependency network (RDN) and how different characteristics of a data set can bias the learning algorithms. Biases like relational autocorrelation and degree dependency can influence relational learning algorithms and cause them to believe there is knowledge within the data that is not present but techniques like randomization tests and Gibbs sampling can correct for them. As part of a team within KDL, I helped to identify fraudulent behavior among brokers who are licensed by the Financial Industrial Regulatory Authority (FINRA). As part of a class project I applied relational learning techniques to clusters of individuals and movies to predict movie preferences for the Netflix Prize competition.

Verification and Validation of Adaptive Systems

Prior to coming to UMass I worked in the area of software engineering, specifically verification and validation (V&V), on adaptive systems. I was the Principle Investigator in the study and development of novel techniques into neural network software verification and validation that resulted in guidance for software engineers working on neural network adaptive systems. I developed a neural network rule extraction algorithm to translate the inner knowledge of a self-organizing neural network called DCS into a formal set of association rules that can then be used for human understanding, software validation, testing, and hazard mitigation. I also helped to develop and implement a flight-qualified intelligent flight control system onboard an experimental F-15 aircraft as part of a research team on the Intelligent Flight Control Systems project funded by NASA Dryden Flight Research Center.

Skills

Machine Learning
 Causality, Logistic Regression, Decision Trees, Random Forests, Bayesian Networks, Relational Learning, NNs (Self-organizing maps, MLPs)
Programming Languages
 Java, Python (moderate), R (novice), SQL, Unix Shell Script, Scala (novice), C (rusty)
Technologies
 Spark (novice), PostgreSQL, Hive (novice), git, AWS (DynamoDB, S3, Kinesis, Redshift, EC2, SQS, SNS, EMR), Python Libraries (NumPy, SciPy, Pandas, Scikit Learn)
Tools
 Eclipse, LateX, Word, Excel, PowerPoint, OmniGraffle, OmniOutliner, Keynote, Visio
Software Development
 Scrum/Agile/Kanban, Production Coding and Deployment, V&V
Other
 Quasi-Experimental Designs, A/B testing, Peer Production Systems

Selected Publications

Dissertation and Theses

Books and Chapters

Journal Articles

Conference Papers

Have a good day!