The primary goal of this project is to enhance the efficiency of the City of Boston’s 311 service request system by improving the categorization and assignment of service requests using AI technologies. This will reduce the manual work required to correctly classify requests, ensuring faster and more accurate routing to the appropriate department.
The key objectives include:
This project aligns with the broader goals of open-source projects by:
This section outlines the key features implemented, milestones achieved, and the reasoning behind the technical choices made:
AI for Categorization:
The goal was to apply image-recognition AI to categorize civic service requests based on images provided by Boston residents. We were aware of on-device AI solutions like Core ML (iOS) and ML Kit (Android), but both required separate development efforts. TensorFlow Lite was another option, but similarly, it would have meant maintaining two separate codebases for iOS and Android.
To address this, we decided to build a Progressive Web App (PWA). PWAs provide a cross-platform solution, working across mobile devices regardless of the operating system, as they only require a browser. One key advantage of using a PWA is the ability to easily deliver updates during development, allowing for rapid iterations and testing. This flexibility made PWA a practical and efficient choice for our prototype.
Additionally, with recent advancements in web-based machine learning (Web ML) technologies like WebNN (by W3C) and Google’s Visual Blocks, the web has become a viable platform for deploying ML models. Even more powerful transformer-based models can now run in the browser using transformer.js, which leverages ONNX Runtime. However, we found TensorFlow.js to be more lightweight and better suited for vision tasks, making it a more appropriate choice for our image classification needs.
Image Classification Task and Model Training:
We trained an image classification model to categorize images into nine specific civic service request categories. These categories included:
The model was developed using transfer learning from the tensorflow-ic-imagenet-mobilenet-v3-large-100-224
model. This architecture is lightweight and optimized for mobile and web applications, making it suitable for deployment in the Progressive Web App (PWA).
Due to the large size of the dataset, we chose AWS for training, leveraging SageMaker. SageMaker allowed us to efficiently manage the computational demands and scale required for the training process. The resulting model was robust and ready for deployment in the PWA, handling real-time image classification directly in the browser.