-
Improving Knot Prediction in Wood Logs with Longitudinal Feature Propagation
Authors:
Salim Khazem,
Jeremy Fix,
Cédric Pradalier
Abstract:
The quality of a wood log in the wood industry depends heavily on the presence of both outer and inner defects, including inner knots that are a result of the growth of tree branches. Today, locating the inner knots require the use of expensive equipment such as X-ray scanners. In this paper, we address the task of predicting the location of inner defects from the outer shape of the logs. The data…
▽ More
The quality of a wood log in the wood industry depends heavily on the presence of both outer and inner defects, including inner knots that are a result of the growth of tree branches. Today, locating the inner knots require the use of expensive equipment such as X-ray scanners. In this paper, we address the task of predicting the location of inner defects from the outer shape of the logs. The dataset is built by extracting both the contours and the knots with X-ray measurements. We propose to solve this binary segmentation task by leveraging convolutional recurrent neural networks. Once the neural network is trained, inference can be performed from the outer shape measured with cheap devices such as laser profilers. We demonstrate the effectiveness of our approach on fir and spruce tree species and perform ablation on the recurrence to demonstrate its importance.
△ Less
Submitted 22 August, 2023;
originally announced August 2023.
-
First-Generation Inference Accelerator Deployment at Facebook
Authors:
Michael Anderson,
Benny Chen,
Stephen Chen,
Summer Deng,
Jordan Fix,
Michael Gschwind,
Aravind Kalaiah,
Changkyu Kim,
Jaewon Lee,
Jason Liang,
Haixin Liu,
Yinghai Lu,
Jack Montgomery,
Arun Moorthy,
Satish Nadathur,
Sam Naghshineh,
Avinash Nayak,
Jongsoo Park,
Chris Petersen,
Martin Schatz,
Narayanan Sundaram,
Bangsheng Tang,
Peter Tang,
Amy Yang,
Jiecao Yu
, et al. (90 additional authors not shown)
Abstract:
In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the in…
▽ More
In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the inference accelerator platform ecosystem we developed and deployed at Facebook: both hardware, through Open Compute Platform (OCP), and software framework and tooling, through Pytorch/Caffe2/Glow. A characteristic of this ecosystem from the start is its openness to enable a variety of AI accelerators from different vendors. This platform, with six low-power accelerator cards alongside a single-socket host CPU, allows us to serve models of high complexity that cannot be easily or efficiently run on CPUs. We describe various performance optimizations, at both platform and accelerator level, which enables this platform to serve production traffic at Facebook. We also share deployment challenges, lessons learned during performance optimization, as well as provide guidance for future inference hardware co-design.
△ Less
Submitted 4 August, 2021; v1 submitted 8 July, 2021;
originally announced July 2021.
-
Glow: Graph Lowering Compiler Techniques for Neural Networks
Authors:
Nadav Rotem,
Jordan Fix,
Saleem Abdulrasool,
Garret Catron,
Summer Deng,
Roman Dzhabarov,
Nick Gibson,
James Hegeman,
Meghan Lele,
Roman Levenstein,
Jack Montgomery,
Bert Maher,
Satish Nadathur,
Jakob Olesen,
Jongsoo Park,
Artem Rakhov,
Misha Smelyanskiy,
Man Wang
Abstract:
This paper presents the design of Glow, a machine learning compiler for heterogeneous hardware. It is a pragmatic approach to compilation that enables the generation of highly optimized code for multiple targets. Glow lowers the traditional neural network dataflow graph into a two-phase strongly-typed intermediate representation. The high-level intermediate representation allows the optimizer to p…
▽ More
This paper presents the design of Glow, a machine learning compiler for heterogeneous hardware. It is a pragmatic approach to compilation that enables the generation of highly optimized code for multiple targets. Glow lowers the traditional neural network dataflow graph into a two-phase strongly-typed intermediate representation. The high-level intermediate representation allows the optimizer to perform domain-specific optimizations. The lower-level instruction-based address-only intermediate representation allows the compiler to perform memory-related optimizations, such as instruction scheduling, static memory allocation and copy elimination. At the lowest level, the optimizer performs machine-specific code generation to take advantage of specialized hardware features. Glow features a lowering phase which enables the compiler to support a high number of input operators as well as a large number of hardware targets by eliminating the need to implement all operators on all targets. The lowering phase is designed to reduce the input space and allow new hardware backends to focus on a small number of linear algebra primitives.
△ Less
Submitted 3 April, 2019; v1 submitted 2 May, 2018;
originally announced May 2018.
-
YARBUS : Yet Another Rule Based belief Update System
Authors:
Jeremy Fix,
Herve Frezza-buet
Abstract:
We introduce a new rule based system for belief tracking in dialog systems. Despite the simplicity of the rules being considered, the proposed belief tracker ranks favourably compared to the previous submissions on the second and third Dialog State Tracking challenges. The results of this simple tracker allows to reconsider the performances of previous submissions using more elaborate techniques.
We introduce a new rule based system for belief tracking in dialog systems. Despite the simplicity of the rules being considered, the proposed belief tracker ranks favourably compared to the previous submissions on the second and third Dialog State Tracking challenges. The results of this simple tracker allows to reconsider the performances of previous submissions using more elaborate techniques.
△ Less
Submitted 24 July, 2015;
originally announced July 2015.
-
Optimal Degree Distributions for Uniform Small World Rings
Authors:
R. Seth Terashima,
James D. Fix
Abstract:
Motivated by Kleinberg's (2000) and subsequent work, we consider the performance of greedy routing on a directed ring of $n$ nodes augmented with long-range contacts. In this model, each node $u$ is given an additional $D_u$ edges, a degree chosen from a specified probability distribution. Each such edge from $u$ is linked to a random node at distance $r$ ahead in the ring with probability proport…
▽ More
Motivated by Kleinberg's (2000) and subsequent work, we consider the performance of greedy routing on a directed ring of $n$ nodes augmented with long-range contacts. In this model, each node $u$ is given an additional $D_u$ edges, a degree chosen from a specified probability distribution. Each such edge from $u$ is linked to a random node at distance $r$ ahead in the ring with probability proportional to $1/r$, a "harmonic" distance distribution of contacts. Aspnes et al. (2002) have shown an $O(\log^2 n / \ell)$ bound on the expected length of greedy routes in the case when each node is assigned exactly $\ell$ contacts and, as a consequence of recent work by Dietzfelbinger and Woelfel (2009), this bound is known to be tight. In this paper, we generalize Aspnes' upper bound to show that any degree distribution with mean $\ell$ and maximum value $O(\log n)$ has greedy routes of expected length $O(\log^2n / \ell)$, implying that any harmonic ring in this family is asymptotically optimal. Furthermore, for a more general family of rings, we show that a fixed degree distribution is optimal. More precisely, if each random contact is chosen at distance $r$ with a probability that decreases with $r$, then among degree distributions with mean $\ell$, greedy routing time is smallest when every node is assigned $\floor{\ell}$ or $\ceiling{\ell}$ contacts.
△ Less
Submitted 3 August, 2010; v1 submitted 24 June, 2010;
originally announced June 2010.
-
A computational approach to the covert and overt deployment of spatial attention
Authors:
Jérémy Fix,
Nicolas P. Rougier,
Frédéric Alexandre
Abstract:
Popular computational models of visual attention tend to neglect the influence of saccadic eye movements whereas it has been shown that the primates perform on average three of them per seconds and that the neural substrate for the deployment of attention and the execution of an eye movement might considerably overlap. Here we propose a computational model in which the deployment of attention wi…
▽ More
Popular computational models of visual attention tend to neglect the influence of saccadic eye movements whereas it has been shown that the primates perform on average three of them per seconds and that the neural substrate for the deployment of attention and the execution of an eye movement might considerably overlap. Here we propose a computational model in which the deployment of attention with or without a subsequent eye movement emerges from local, distributed and numerical computations.
△ Less
Submitted 26 September, 2008;
originally announced September 2008.
-
A Computational Model of Spatial Memory Anticipation during Visual Search
Authors:
Jérémy Fix,
Julien Vitay,
Nicolas Rougier
Abstract:
Some visual search tasks require to memorize the location of stimuli that have been previously scanned. Considerations about the eye movements raise the question of how we are able to maintain a coherent memory, despite the frequent drastically changes in the perception. In this article, we present a computational model that is able to anticipate the consequences of the eye movements on the visu…
▽ More
Some visual search tasks require to memorize the location of stimuli that have been previously scanned. Considerations about the eye movements raise the question of how we are able to maintain a coherent memory, despite the frequent drastically changes in the perception. In this article, we present a computational model that is able to anticipate the consequences of the eye movements on the visual perception in order to update a spatial memory
△ Less
Submitted 9 October, 2006;
originally announced October 2006.