Recent Work

Major components of most recent work

Last updated on Oct 10, 2019 2 min read

Major components of most recent work

Contributed to a tensorflow library that enabled for rapid development and evaluation of new models. Our work focused on rank based metrics and so our framework made use of custom hooks during training to enable early stopping using custom rank metrics (topK, MAP).
Involved replicating state-of-the-art model architectures for siamese sequence matching (e.g. decomposable attention), RNNs, 1D CNNs
Wrote wrapper for finetuning BERT
Library for finetuning/training a set of best in class models with new data to quickly evaluate novel datasets
Deploying production models using Flask and ElasticBeanstalk

The main focus of our research was around question answering (by FAQ; question-question matching), topic classification for questions, and designing model architectures with inductive biases for paraphrastic encodings (i.e. sentences with different lexical form but same meaning should have similar encodings).
Main model architectures included siamese architectures for sequence matching (RNN, CNN)
Embeddings (glove, elmo, flair, custom, etc.)
Hyper-params
Model architectures (decomposable attention, CNNs, RNNs, regularisation)
Training data (difference sources, mixes, negative sampling strategies)
Pre-trained language models (BERT) + finetuning
Tools for visualising/analysing latent representation
Error analyses

Our research identified that our baseline models lacked generalisability. Two methods to approach this were to focus on architectural designs that promote generalization by design and to gather more data to expose the models to. The latter approach involved a significant amount of work with methods for dataset creation including:
Model-based data augmentation
Synonym swapping using wordnet (library)
Back-translation
Similarity-based negative sampling for improving training dynamics
Mechanical turk for creating paraphrases and labelled data, including model-based mechanical turk environments where soft model predictions were used as label targets
Model-based bootstrapping for creating soft-labels and noisy paraphrases
Created gold-standard evaluation dataset using expert and novice labellers and a voting system

Taxonomy/knowledge base construction and optimisation
Created and managed databases for asset storage and access (dynamoDB)
Accessing and manipulating data from No-SQL databases (mongoDB)
Analytics
AWS (EC2, DynamoDB, S3)
Wrote a number of utils/helper functions e.g. for connecting to ec2, setting cloudwatch alarms, synchronizing local-s3, etc.