You may download a PDF version of this resume here
Atigeo, Aug 2013 – Sep 2016
Senior Software Engineer & Data Scientist/Engineer – Developed Hive/MapReduce/Spark Python modules for ML & predictive analytics in Hadoop/Hive/Hue on AWS. Implemented a Python-based distributed random forest via Hive and Python streaming. Pipelined (ingest/clean/munge/transform) data for feature extraction toward downstream classification.
Primary designer/developer of a scikit-learn based random forest & ensemble ML pipeline for cross-fold-validated predictive analytics, including insight via feature importance exposure. Performed statistical analysis and visualization of ML results via ROC curves & AUC.
Onboarded multiple new hires to assist them in familiarizing with our vast and complex data pipeline and suite of databases and tables.
Primary designer/developer of a pipeline that ingests/catalogs/stores/analyzes new datasets with final analytics/visualization. This project implemented an SOW whose completion was the keystone of a seven-figure contract.
Expedia via Slalom Consulting placement, Dec 2012 – Jul 2013
Big Data Engineer, Consultant – Brief MongoDB project, then Hadoop/Hive on AWS, using EMR and nonEMR-Hadoop in EC2. Tasks: EC2-to-S3 data synch., Hive stand-up, AWS profiling. Accomplishments: Hadoop 2.0/YARN EC2 deployment. Amazon's own engineers were curious about my progress.
Slalom Consulting, Feb 2012 – Jul 2013
Big Data Engineer, Consultant – National Mobility team (under Jeff Rubingh), National BI team (under Kevin Gregory), developing big data processing techniques. Focus: Hadoop MapReduce, Hive, Cloudera, Tier 3, Hadoop-on-Azure. Topics: CRM, NY MTA, Linked-In/Twitter APIs, some OpenLayers visualization.
University of Washington, Dept. of Astronomy, Feb 2010 – Jan 2012
Research Scientist IV – LSST group (mgr. Andrew Connolly). Development of massively parallel image processing routines in Hadoop, namely image coaddition (multiple partially overlapping images are registered, stacked, and mosaiced together). Test dataset: SDSSDB (30TB, 4 million images), future applications to LSST (60PBs). Cluster (NSF CluE): 892 machines, 700TB storage, 3568 concurrent processes.
University of Washington, Applied Physics Lab, May 2007 – Feb 2010
Software Engineer IV – Proj. 1: Sonar Simulation Toolkit (under Robert Goddard), an eigenray model of underwater acoustics: Incorporation of external libraries, OO design, feature development, optimization/performance- redesign, refactorization, unit-testing. Proj. 2: a real-time data-acquisition/FFT-processing system with low data-loss tolerances, rapid throughput, and amenability to future parallelism.
University of New Mexico, 1999 – 2007
Course Instructor (Jan 2007 – May 2007) – CS241, Data structures/algorithms, taught in C.
The Institute for Genomic Research, Sep 1997 – Aug 1999
Software Developer – C++ bioinformatics software development for DNA sequencing tools and closure analysis.
Sample only. Please see my website for a comprehensive listing and github for a few public disseminations.
Image/Acoustic Signal Processing
Keith's Image Stacker: Multi-threaded (aka parallel) image stacking, Laplacian sharpening, wavelet denoising. Used by amateur astrophotographers, reviewed online and in Astronomy and Sky & Telescope.
WildSpectra (collaboration: Dr. R. Haven Wiley, Biology dept, UNC-CH): Mac real-time spectrogram analyzer, used in Dr. Wiley's research lab and by researchers throughout the acoustic-biology community.
Keith's iPod Photo Reader: Extracts images from iPod .ithmb image files. Implementation required reverse-engineering the image format from scratch.
Data Analytics/Dynamic Websites
Neuromorphic CM1K Emulator A Python emulator of General Vision's CM1K neuromorphic chip, including slides presenting modeling experiments. See personal website or github for more info.
Movie Hurl (http://moviehurl.com): A Perl server-side website of user ratings of “shaky-cam”movies. Generic ratings: weighted average of anonymous submissions. Personalized predictions: correlated user-pair ratings across overlapping data subsets. See associated New York Times article under Publicity below.
Petri (game): grow a cell culture in a Petri dish, fend off invasive cultures and phage outbreaks.
WildSpectra Mobile: Real-time scrolling spectrograms (FFT and octave-band) on Android devices. Also: real-time waveform & FFT/octave spectrum, and post-recording editing/playback and file I/O.
Shead Spreet: Spread sheet for Android devices with 300,000 installs, 8500 sales, and a 4.3/5 rating.
Distributed Mandelbrot Set: Generates fractal images by farming job-segments to multiple computers. Networking coded from scratch using sockets. Automatic load-balancing ensures optimal performance.
Druid (PhD thesis): Vector drawing program which permits interwoven surfaces (Celtic knots, Olympic rings, etc.) and which provides an isomorphic efficient user interface.
Simulation: Artificial life, evolutionary/genetic algorithms, cellular automata, robotics, flocking (please see my website).
Web Design: http://keithwiley.com, http://moviehurl.com, http://badlandswatches.com
Positions, Publicity, Awards
• Movie Hurl (see Personal Projects above) was mentioned in a New York Times article at
• Numerous interviews & articles following my book's publication (see my website for links), 2014–2015.
• Advisor to the Brain Preservation Foundation, 2014–present.
• Science Advisor to the Lifeboat Foundation, 2011–present.
• Proceedings chair for the Computer Science at UNM Student Conference committee, 2006.
• Sky & Telescope magazine. Software review: Keith's Image Stacker and Keith's Astroimager, p. 110, Aug 2004.
• First place in the first International Online Artificial Life Creator's Contest, Cyberbotics Webots, khepera robot sim., Jul 1999.
Graduate Research (sample)
Winter 2003–Summer 2006, Ph.D. thesis
Submitted or Under Review
Mind Uploading and the Question of Life, the Universe, and Everything. 2015.
Pattern Evolver, An Evolutionary Algorithm that Solves the Nonintuitive Problem of Black and White Pixel Distribution to Produce Tiled Patterns that Appear Gray. The Handbook of Genetic Algorithms, 1999.
'Interstellar' Might Depict AI Slavery. H+ Magazine, 2014. (also available on H+ Magazine's website).