Harry Freeman

I am a PhD student in the Robotics Institute at Carnegie Mellon University advised by Professor George Kantor Carnegie Mellon University in the Kantor Lab.

I received my B.S. in ECE from Cornell University and my Masters in Robotics at CMU, where I worked on computer vision, 3D reconstruction, and next-best-view planning to phenotype small crops in agriculture. Prior to CMU, I was a senior embedded software engineer for Amazon Web Services in their AI Devices division working on AWS Panorama.

Email / CV / Google Scholar / LinkedIn / Github

Research

My research interests lie at the intersection of 3D reconstruction, robotic manipulation, and learning from human demonstration. I wish to enable robots to perform complex tasks in diverse and unstructured environments such as agriculture. I am currently working on adapting gaussian splatting, human-object interaction, and diffusion policies for the task of autonomous vine pruning.

Research Publications

Peer-reviewed Conferences

	Transformer-Based Spatio-Temporal Association of Apple Fruitlets Harry Freeman, George Kantor In Submission at IEEE International Conference on Intelligent Robots and Systems (IROS), 2025 [arXiv] [PDF] [Video] Created a method for temporal apple fruitlet association utilizing stereo images and transformers. Able to achieve F1 matching accuracy of 92.4% on new dataset collected over 3 years of 3 different varietals. We demonstrate that our transformer architecture is generalizable to other datasets and modalities.
	Autonomous Apple Fruitlet Sizing with Next Best View Planning Harry Freeman, George Kantor Accepted to International Conference on Robotics and Automation (ICRA), 2024 [arXiv] [PDF] [Video] Developed a novel next-best-view planning approach to enable a 7 DoF robotic arm to autonomously capture images of apple fruitlets. Utilized a coarse and fine dual-map representation along with an attention-guided information gain formulation to determine the next best camera pose. Presented a robust estimation and graph clustering approach to associate fruit detections across images in the presence of wind and sensor error.
	3D Reconstruction-Based Seed Counting of Sorghum Panicles for Agricultural Inspection Harry Freeman, Eric Schneider, Chung Hee Kim, Moonyoung Lee, George Kantor International Conference on Robotics and Automation (ICRA), 2023 [arXiv] [PDF] [Dataset] [Video] We develop a method for creating high-quality 3D models of sorghum panicles to non-destructively estimate seed counts. This is acheived using seeds as semantic 3D landmarks for global registration and a novel density-based clustering approach. Additionally, we present an unsupervised metric to assess point cloud reconstruction quality in the absence of ground truth.
	3D Human Reconstruction in the Wild with Collaborative Aerial Cameras Cherie Ho, Andrew Jong, Harry Freeman, Rohan Rao, Rogerio Bonatti, Sebastian Scherer International Conference on Intelligent Robots and Systems (IROS), 2021 [arXiv] [PDF] [Video] We build a real-time aerial system for multi-camera control that can reconstruct human motions in natural environments without the use of special-purpose markers. This is acheived with a multi-robot coordination scheme that maintains the optimal flight formation for target reconstruction quality amongst obstacles.

In Submission

Autonomous Apple Fruitlet Sizing and Growth Rate Tracking using Computer Vision
Harry Freeman, Mohamad Qadri, Abhisesh Silwal, Paul O'Connor, Zachary Rubinstein, Daniel Cooley, George Kantor [arXiv] [PDF] [Video]

We develop a computer vision-based method to size and track the growth rates of apple fruitlets. Fruitlets are sized and temporally associated using a combination of deep learning-based and classical methods.

Workshops

Towards Autonomous Apple Fruitlet Sizing with Next Best View Planning
Harry Freeman, George Kantor
AI for Agriculture and Food Systems (AIAFS), 2023
[PDF]

We develop a next-best-view planning approach to capture images of and size apple fruitlets. Our planner utilizes both coarse and fine octrees to map the environment and to calculate the information gain of sampled viewpoints. Fruitlet sizing is performed by reprojecting extracted fruitlet surfaces onto 2D images and fitting ellipses.

Toward Semantic Scene Understanding for Fine-Grained 3D Modeling of Plants
Mohamad Qadri, Harry Freeman, Eric Schneider, George Kantor
AI for Agriculture and Food Systems (AIAFS), 2022
[PDF] [Video]

We demonstrate how the use of semantics and environmental priors can help in constructing accurate 3D maps for downstream agricultural tasks with the target application of phenotyping Sorghum.

Thesis

Computer Vision-Based Phenotyping in Agriculture: Leveraging Semantic Information for Non-Destructive Small Crop Analysis
Harry Freeman
Master's Thesis, 2023
[PDF]

We present computer vision-based methods to non-destructively measure phenotypes of smaller grains and fruit, specifically sorghum seed counts and apple fruitlet sizes. We do this by leveraging semantic information to improve tasks such as localization, association, and viewpoint planning.

Projects

A few selected projects from a mix of academic and personal.

	Learning a Unified Policy for Locomotion with Eye-in-Hand Perception Deep Learning for Robotics Final Project (December 2022) [PDF] [Code Repo] Applied deep reinforcement learning to learn a unified policy that controls a quadruped mounted with a camera on a mobile arm with the task of tracking a moving target.
	Online Informative Path Planning for Drone Mapping with Reinforcement Learning Introduction to Robot Learning (Robostats) Final Project (December 2022) [PDF] [Code Repo] Applied deep reinforcement learning techniques towards informative path planning for drone mapping.
	Sparse View Mesh Reconstruction of Plants Learning for 3D Vision Final Project (May 2022) [PDF] [Data Repo] [Code Repo] Applied NeuS and metalearning towards watertight mesh reconstruction of plants from sparse viewpoints.
	GANs for Coarse Style and Scene Data Augmentation Learning-Based Image Synthesis Final Project (May 2022) [Project Webpage] [Code Repo] Utilized Generative-Adversarial Networks and Swapping AutoEncoders for coarse style and scene data augmentation.
	Real Time Face Detection Personal Project (2019) [Project Summary] Built a real-time face detection system in Cython and C using custom implementation of Histogram of Oriented Gradients.

Template from Jon Barron. Last updated: August 20th 2023.