Sorry, you need to enable JavaScript to visit this website.
Share

Publications

2022

  • Computational models of disfluencies : fillers and discourse markers in spoken language understanding
    • Dinkar Tanvi
    , 2022. People rarely speak in the same manner that they write – they are generally disfluent. Disfluencies can be defined as interruptions in the regular flow of speech, such as pausing silently, repeating words, or interrupting oneself to correct something said previously. Despite being a natural characteristic of spontaneous speech, and the rich linguistic literature that discusses their informativeness, they are often removed as noise in post-processing from the output transcripts of speech recognisers. So far, their consideration in a Spoken Language Understanding (SLU) context has been rarely explored. The aim of this thesis is to develop computational models of disfluencies in SLU. To do so, we take inspiration from psycholinguistic models of disfluencies, which focus on the role that disfluencies play in the production (by the speaker) and comprehension (by the listener) of speech. Specifically, when we use the term ``computational models of disfluencies'', we mean to develop methodologies that automatically process disfluencies to empirically observe 1) their impact on the production and comprehension of speech, and 2) how they interact with the primary signal (the lexical, or what was said in essence). To do so, we focus on two discourse contexts; monologues and task-oriented dialogues.Our results contribute to broader tasks in SLU, and also research relevant to Spoken Dialogue Systems. When studying monologues, we use a combination of traditional and neural models to study the representations and impact of disfluencies on SLU performance. Additionally, we develop methodologies to study disfluencies as a cue for incoming information in the flow of the discourse. In studying task-oriented dialogues, we focus on developing computational models to study the roles of disfluencies in the listener-speaker dynamic. We specifically study disfluencies in the context of verbal alignment; i.e. the alignment of the interlocutors' lexical expressions, and the role of disfluencies in behavioural alignment; a new alignment context that we propose to mean when instructions given by one interlocutor are followed with an action by another interlocutor. We also consider how these disfluencies in local alignment contexts can be associated with discourse level phenomena; such as success in the task. We consider this thesis one of the many first steps that could be undertaken to integrate disfluencies in SLU contexts.
  • Metric learning for video to music recommendation
    • Prétet Laure
    , 2022. Music enhances moving images and allows to efficiently communicate emotion or narrative tension, thanks to cultural codes common to the filmmakers and viewers. A successful communication requires not only a choice of track matching the video's mood and content, but also a temporal synchronization of the audio and visual main events. This is the goal of the music supervision industry, which traditionally carries out the task manually. In this dissertation, we study the automation of tasks related to music supervision. The music supervision problem generally doesn't have a unique solution, as it includes external constraints such as the client's identity or budget. It is thus relevant to proceed by recommendation. As the number of available musical videos is in constant augmentation, it makes sense to use data-driven tools. More precisely, we use the metric learning paradigm to learn the relevant projections of multimodal (video and music) data. First, we address the music similarity problem, which is used to broaden the results of a music search. We implement an efficient content-based imitation of a tag-based similarity metric. To do so, we present a method to train a convolutional neural network from ranked lists. Then, we focus on direct, content-based music recommendation for video. We adapt a simple self-supervised system and we demonstrate a way to improve its performance, by using pretrained audio features and learning their aggregation. We then carry a qualitative and quantitative analysis of official music videos to better understand the temporal organization of musical videos. Results show that official music videos are carefully edited in order to align audio and video events, and that the level of synchronization depends on the music and video genres. With this insight, we propose the first recommendation system designed specifically for music supervision: the Seg-VM-Net, which uses both content and structure to perform the matching of music and video.
  • High-definition video broadcasting with a room-temperature quantum cascade laser emitting in the long-wave infrared domain
    • Didier Pierre
    • Yang Ke
    • Spitz Olivier
    • Guillaume-Manca Alice
    • Liu Junqi
    • Grillot Frédéric
    , 2022, pp.46. Quantum cascade lasers (QCLs) are relevant optical sources for free-space communication because they can emit in the long-wave infrared (LWIR) domain, i.e. in the 8-12 µm region. The advantage of this optical domain is that it combines a high atmosphere transmission1 with a reduced distortion for propagating beams,2 thus the superiority of LWIR lasers in comparison with existing near-infrared systems is very dependent on link availability.3 Furthermore, QCLs are characterized by the absence of relaxation oscillation resonance.4 This peculiarity could imply a very large modulation bandwidth, even if QCL structures still need to be optimized to avoid parasitic effects.5 Recent experimental efforts have highlighted the potential of QCL-based free-space communication systems6–8 and the current 4 Gbits/s record rate is expected to be outperformed in the near future with bandwidth-enhanced structures.9 This work describes a free-space live video broadcasting with a room-temperature QCL emitting at 8.1 µm. The video file is encoded in uncompressed high-definition format (1280 pixels x 720 pixels) and this corresponds to a data rate of 1.485 Gbits/s with on-off keying scheme. This high-speed electrical signal is directly injected in the QCL via the AC port of a bias tee. The modulated optical signal from the QCL is retrieved with a Mercury-Cadmium-Telluride detector and the resulting electrical signal is sent to a TV monitor where the video can be watched in live. The current findings demonstrate the versatility of a communication system with QCLs and this paves the way for real-field applications (10.1117/12.2608511)
    DOI : 10.1117/12.2608511
  • Chaos-based mid-infrared communications
    • Grillot Frédéric
    • Spitz Olivier
    , 2022, 11995, pp.1. The advantage of mid-infrared wavelength is that it is less affected by atmospheric conditions than conventional near-infrared wavelength, and this optical domain is thus envisioned to play a key role in the 6G standard under development. The directivity of the beam, as well as the stealth conferred by the background emission, makes communication systems based on long-wave infrared quantum cascade lasers (QCL) highly desirable. However, some applications require a further level of privacy. Protecting the communication link against eavesdroppers is possible with chaos-based enciphering. Using this concept, a chaotic master QCL is used to conceal the private message while deciphering is achieved with a second, identical, remote QCL that is called the slave. The deciphering process relies on chaos anti-synchronization where the slave only reproduces the reversed chaotic pattern of the master, thus allowing the recovery of the private message by adding the slave signal and the master signal. The privacy of our system is also assessed and shows that an illegitimate receiver would end with a detrimental error rate during translation, even in the unlikely case this eavesdropper knows the coding format of the private message. We believe our private communication system brings a cost-effective, reliable and versatile alternative for free-space data links, especially in harsh environments where mid-infrared lasers strongly outperform their near-infrared counterparts. Features such as room-temperature operation and highspeed transmission further advocates for a large deployment, and we anticipate that this finding can have a significant impact on the development of novel applications based on QCLs. (10.1117/12.2613797)
    DOI : 10.1117/12.2613797
  • Effects of external optical feedback in InAs/InP quantum dot frequency comb lasers on silicon
    • Renaud Thibaut
    • Huang Heming
    • Liang Di
    • Kurczveil Geza
    • Beausoleil Raymond
    • Grillot Frédéric
    , 2022, 11995, pp.17. On-chip integration of semiconductor lasers have shown a growing interest in recent years, especially for the development of photonic integrated circuits (PICs) which are of paramount importance for high-speed communication within and between data centers, and fast on-board data exchanges. For all these applications, a key challenge remains the stability of the laser sources integrated on a PIC in presence of external optical feedback with the view to avoid integrated bulky and costly optical isolation. In this study, the effects of external optical feedback are investigated in hybrid InAs/InP quantum dot comb lasers on silicon. The design of the cavity includes a semiconductor optical amplifier section, a saturable absorber and an on-chip external cavity incorporating a vertical coupler. We measured the resulting feedback properties with respect to the operation conditions (bias current and voltage) and to the length of the saturable absorber. We show that under most operating conditions, the laser remains stable against optical feedback, only few regimes of operation occur, which either improve or degrade the frequency comb and/or the radio-frequency beatnote power of the laser. (10.1117/12.2608949)
    DOI : 10.1117/12.2608949
  • Design territorial, représentations spatiales et participation citoyenne : revue de cas et analyse d’outils
    • Jolivet-Duval Marion
    • Safin Stéphane
    • Huron Samuel
    Sciences du Design, Presses Universitaires de France, 2022, n° 14 (2), pp.55-75. (10.3917/sdd.014.0055)
    DOI : 10.3917/sdd.014.0055
  • Hair Color Digitization through Imaging and Deep Inverse Graphics
    • Kips Robin
    • Bokaris Panagiotis-Alexandros
    • Perrot Matthieu
    • Gori Pietro
    • Bloch Isabelle
    , 2022. Hair appearance is a complex phenomenon due to hair geometry and how the light bounces on different hair fibers. For this reason, reproducing a specific hair color in a rendering environment is a challenging task that requires manual work and expert knowledge in computer graphics to tune the result visually. While current hair capture methods focus on hair shape estimation many applications could benefit from an automated method for capturing the appearance of a physical hair sample, from augmented/virtual reality to hair dying development. Building on recent advances in inverse graphics and material capture using deep neural networks, we introduce a novel method for hair color digitization. Our proposed pipeline allows capturing the color appearance of a physical hair sample and renders synthetic images of hair with a similar appearance, simulating different hair styles and/or lighting environments. Since rendering realistic hair images requires path-tracing rendering, the conventional inverse graphics approach based on differentiable rendering is untractable. Our method is based on the combination of a controlled imaging device, a path-tracing renderer, and an inverse graphics model based on self-supervised machine learning, which does not require to use differentiable rendering to be trained. We illustrate the performance of our hair digitization method on both real and synthetic images and show that our approach can accurately capture and render hair color.
  • Radio frequency exposure analysis in 5G massive MIMO systems
    • Al Hajj Maarouf
    , 2022. This dissertation presents in-situ measurements of a massive MIMO antenna and analyses the different parameters pertinent to the estimation of the EMF exposure in a 5G network. Multiple methods are presented and discussed to estimate the power received in the network while focusing on the advantages and inconveniences in each of them. This dissertation also proposes a new analytical method for studying the average exposure, presented by the total power received, in a 5G mmWave massive MIMO network. Using stochastic geometry, a close-form equation of the exposure is developed and studied by fitting a mmWave channel model using NYUSIM into statistical distributions and by modeling the BSs as a PPP. A sensitivity analysis is performed to quantify the influence of the input variables onto the exposure.Another model for a multi-user massive MIMO network is also developed deploying maximum-ratio combining precoding and max-min fairness downlink power control, and where MTs are distributed following a PPP and can be either LoS or NLoS. A closed-form expression of the expectation of the total power received and the expression of the ratio between the total power and the SIR at the nearest MT to its serving BS, where the exposure is highest. The average exposure is then studied in relation to network parameters taking into account the trade-offs presented by the power control model and antenna gains. Likewise, the ratio between the exposure and SIR is also analyzed to study the increase of exposure per the increase of the SIR at the nearest MT to its BS. And it is shown that the higher the number of antenna elements a massive MIMO antenna has, the more efficient it is in terms of SIR considering the produced exposure.
  • Use of multi-temporal and multi-sensor data for continental water body extraction in the context of the SWOT mission
    • Gasnier Nicolas
    , 2022. Spaceborne remote sensing provides hydrologists and decision-makers with data that are essential for understanding the water cycle and managing the associated resources and risks. The SWOT satellite, which is a collaboration between the French (CNES) and American (NASA, JPL) space agencies, is scheduled for launch in 2022 and will measure the height of lakes, rivers, and oceans with high spatial resolution. It will complement existing sensors, such as the SAR and optical constellations Sentinel-1 and 2, and in situ measurements. SWOT represents a technological breakthrough as it is the first satellite to carry a near-nadir swath altimeter. The estimation of water levels is done by interferometry on the SAR images acquired by SWOT. Detecting water in these images is therefore an essential step in processing SWOT data, but it can be very difficult, especially with low signal-to-noise ratios, or in the presence of unusual radiometries. In this thesis, we seek to develop new methods to make water detection more robust. To this end, we focus on the use of exogenous data to guide detection, the combination of multi-temporal and multi-sensor data and denoising approaches. The first proposed method exploits information from the river database used by SWOT (derived from GRWL) to detect narrow rivers in the image in a way that is robust to both noise in the image, potential errors in the database, and temporal changes. This method relies on a new linear structure detector, a least-cost path algorithm, and a new Conditional Random Field segmentation method that combines data attachment and regularization terms adapted to the problem. We also proposed a method derived from GrabCut that uses an a priori polygon containing a lake to detect it on a SAR image or a time series of SAR images. Within this framework, we also studied the use of a multi-temporal and multi-sensor combination between Sentinel-1 SAR and Sentinel-2 optical images. Finally, as part of a preliminary study on denoising methods applied to water detection, we studied the statistical properties of the geometric temporal mean and proposed an adaptation of the variational method MuLoG to denoise it.
  • Demonstrating Internet Of Secure Elements Server
    • Urien Pascal
    , 2022, pp.923-924. This demonstration presents the first IOSE (Internet Of Secure Elements) server working within a Raspberry Pi, and managing four secure elements. This server executes two TCP daemons. RACS port (Remote APDU Call Secure) is used to download applications in secure elements according to global platform (GP) protocols. TLS port is used to route TLS messages to TLS-SE applications, identified by their server name. TLS-SE is an embedded software for secure element, protected by TLS1.3 secure channel (i.e. TLS server). The demo shows on-demand deployment of TLS-SE App from service provider to IOSE server. It presents the user’s enrollment procedure required to transfer the exclusive TLS-SE ownership to user. Finally it shows a use case in which TLS-SE App is used for trusted and secure Ethereum transaction generation (10.1109/CCNC49033.2022.9700553)
    DOI : 10.1109/CCNC49033.2022.9700553
  • Towards Internet Of Secure Elements
    • Urien Pascal
    , 2022, pp.949-950. This poster introduces the Internet Of Secure Elements (IOSE) infrastructure. IOSE enables the deployment of personal Hardware Secure Module (HSM), based on secure element (SE) grid, providing secure storage and tamper resistant computing resources (10.1109/CCNC49033.2022.9700663)
    DOI : 10.1109/CCNC49033.2022.9700663
  • The Dichotomy of Evaluating Homomorphism-Closed Queries on Probabilistic Graphs
    • Amarilli Antoine
    • Ceylan İsmail İlkan
    Logical Methods in Computer Science, Logical Methods in Computer Science Association, 2022. We study the problem of query evaluation on probabilistic graphs, namely, tuple-independent probabilistic databases over signatures of arity two. We focus on the class of queries closed under homomorphisms, or, equivalently, the infinite unions of conjunctive queries. Our main result states that the probabilistic query evaluation problem is #P-hard for all unbounded queries from this class. As bounded queries from this class are equivalent to a union of conjunctive queries, they are already classified by the dichotomy of Dalvi and Suciu (2012). Hence, our result and theirs imply a complete data complexity dichotomy, between polynomial time and #P-hardness, on evaluating homomorphism-closed queries over probabilistic graphs. This dichotomy covers in particular all fragments of infinite unions of conjunctive queries over arity-two signatures, such as negation-free (disjunctive) Datalog, regular path queries, and a large class of ontology-mediated queries. The dichotomy also applies to a restricted case of probabilistic query evaluation called generalized model counting, where fact probabilities must be 0, 0.5, or 1. We show the main result by reducing from the problem of counting the valuations of positive partitioned 2-DNF formulae, or from the source-to-target reliability problem in an undirected graph, depending on properties of minimal models for the query. (10.46298/lmcs-18(1:2)2022)
    DOI : 10.46298/lmcs-18(1:2)2022
  • An eager splitting strategy for online decision trees in ensembles
    • Manapragada Chaitanya
    • Gomes Heitor Murilo
    • Salehi Mahsa
    • Bifet Albert
    • Webb Geoffrey I.
    Data Mining and Knowledge Discovery, Springer, 2022, 36 (2), pp.566--619. Decision tree ensembles are widely used in practice. In this work, we study in ensemble settings the effectiveness of replacing the split strategy for the state-of-the-art online tree learner, Hoeffding Tree, with a rigorous but more eager splitting strategy that we had previously published as Hoeffding AnyTime Tree. Hoeffding AnyTime Tree (HATT), uses the Hoeffding Test to determine whether the current best candidate split is superior to the current split, with the possibility of revision, while Hoeffding Tree aims to determine whether the top candidate is better than the second best and if a test is selected, fixes it for all posterity. HATT converges to the ideal batch tree while Hoeffding Tree does not. We find that HATT is an efficacious base learner for online bagging and online boosting ensembles. On UCI and synthetic streams, HATT as a base learner outperforms HT at a 0.05 significance level for the majority of tested ensembles on what we believe is the largest and most comprehensive set of testbenches in the online learning literature. Our results indicate that HATT is a superior alternative to Hoeffding Tree in a large number of ensemble settings. (10.1007/S10618-021-00816-X)
    DOI : 10.1007/S10618-021-00816-X
  • Approximating Perfect Recall when Model Checking Strategic Abilities: Theory and Applications
    • Belardinelli Francesco
    • Lomuscio Alessio
    • Malvone Vadim
    • Yu Emily
    Journal of Artificial Intelligence Research, Association for the Advancement of Artificial Intelligence, 2022, 73, pp.897-932. The model checking problem for multi-agent systems against specifications in the alternating-time temporal logic AT L, hence AT L∗ , under perfect recall and imperfect information is known to be undecidable. To tackle this problem, in this paper we investigate a notion of bounded recall under incomplete information. We present a novel three-valued semantics for AT L∗ in this setting and analyse the corresponding model checking problem. We show that the three-valued semantics here introduced is an approximation of the classic two-valued semantics, then give a sound, albeit partial, algorithm for model checking two-valued perfect recall via its approximation as three-valued bounded recall. Finally, we extend MCMAS, an open-source model checker for AT L and other agent specifications, to incorporate bounded recall; we illustrate its use and present experimental results. (10.1613/jair.1.12539)
    DOI : 10.1613/jair.1.12539
  • Competition between Entrainment Phenomenon and Chaos in a Quantum-Cascade Laser under Strong Optical Reinjection
    • Spitz Olivier
    • Durupt Lauréline
    • Grillot Frédéric
    Photonics, MDPI, 2022, 9 (1), pp.1-10. The topic of external optical feedback in quantum-cascade lasers is relevant for stability and beam-properties considerations. Albeit less sensitive to external optical feedback than other lasers, quantum-cascade lasers can exhibit several behaviors under such feedback, and those are relevant for a large panel of applications, from communication to ranging and sensing. This work focused on a packaged Fabry–Perot quantum-cascade laser under strong external optical feedback and shows the influence of the beam-splitter characteristics on the optical power properties of this commercially available laser. The packaged quantum-cascade laser showed extended conditions of operation when subject to strong optical feedback, and the maximum power that can be extracted from the external cavity was also increased. When adding a periodic electrical perturbation, various non-linear dynamics were observed, and this complements previous efforts about the entrainment phenomenon in monomode quantum-cascade lasers, with the view of optimizing private communication based on mid-infrared quantum-cascade lasers. Overall, this work is a step forward in understanding the behavior of the complex quantum-cascade-laser structure when it is subjected to external optical feedback. (10.3390/photonics9010029)
    DOI : 10.3390/photonics9010029
  • One Versus all for deep Neural Network Incertitude (OVNNI) quantification
    • Franchi Gianni
    • Bursuc Andrei
    • Aldea Emanuel
    • Dubuisson Séverine
    • Bloch Isabelle
    IEEE Access, IEEE, 2022. Deep neural networks (DNNs) are powerful learning models yet their results are not always reliable. This is due to the fact that modern DNNs are usually uncalibrated and we cannot characterize their epistemic uncertainty. In this work, we propose a new technique to quantify the epistemic uncertainty of data easily. This method consists in mixing the predictions of an ensemble of DNNs trained to classify One class vs All the other classes (OVA) with predictions from a standard DNN trained to perform All vs All (AVA) classification. On the one hand, the adjustment provided by the AVA DNN to the score of the base classifiers allows for a more fine-grained inter-class separation. On the other hand, the two types of classifiers enforce mutually their detection of out-of-distribution (OOD) samples, circumventing entirely the requirement of using such samples during training. Our method achieves state of the art performance in quantifying OOD data across multiple datasets and architectures while requiring little hyper-parameter tuning. (10.1109/access.2021.3138978)
    DOI : 10.1109/access.2021.3138978
  • Mid-infrared hyperchaos of interband cascade lasers
    • Deng Yu
    • Fan Zhuo-Fei
    • Zhao Bin-Bin
    • Wang Xing-Guang
    • Zhao Shiyuan
    • Wu Jiagui
    • Grillot Frédéric
    • Wang Cheng
    Light: Science and Applications, Nature Publishing Group, 2022, 11 (7), pp.1-10. Abstract Chaos in nonlinear dynamical systems is featured with irregular appearance and with high sensitivity to initial conditions. Near-infrared light chaos based on semiconductor lasers has been extensively studied and has enabled various applications. Here, we report a fully-developed hyperchaos in the mid-infrared regime, which is produced from interband cascade lasers subject to the external optical feedback. Lyapunov spectrum analysis demonstrates that the chaos exhibits three positive Lyapunov exponents. Particularly, the chaotic signal covers a broad frequency range up to the GHz level, which is two to three orders of magnitude broader than existed mid-infrared chaos solutions. The interband cascade lasers produce either periodic oscillations or low-frequency fluctuations before bifurcating to hyperchaos. This hyperchaos source is valuable for developing long-reach secure optical communication links and remote chaotic Lidar systems, taking advantage of the high-transmission windows of the atmosphere in the mid-infrared regime. (10.1038/s41377-021-00697-1)
    DOI : 10.1038/s41377-021-00697-1
  • Compositional Equivalences Based on Open pNets
    • Ameur-Boulifa Rabéa
    • Henrio Ludovic
    • Madelaine Eric
    Journal of Logical and Algebraic Methods in Programming, Elsevier, 2022, 131, pp.100842. Establishing equivalences between programs is crucial both for verifying correctness of programs and for justifying optimisations and program transformations. There exist several equivalence relations for programs, and bisimulations are among the most versatile of these equivalences. Among bisimulations one distinguishes strong bisimulation that requires that each action of a program is simulated by a single action of the equivalent program, and weak bisimulation that allows some of the actions to be invisible, and thus not simulated. pNet is a generalisation of automata that model open systems. They feature variables and hierarchical composition. Open pNets are pNets with holes, i.e. placeholders that can be filled later by subsystems. However, there is no standard tool for defining the semantics of an open system in this context. This article first defines open automata that are labelled transition systems with parameters and holes. Relying on open automata, it then defines bisimilarity relations for the comparison of systems specified as pNets. We first present a strong bisimilarity for open pNets called FH-bisimilarity. Next we offer an equivalence relation similar to the classical weak bisimulation equivalence, and study its properties. Among these properties we are interested in compositionality: if two systems are proven equivalent they will be indistinguishable by their context, and they will also be indistinguishable when their holes are filled with equivalent systems. We identify sufficient conditions to ensure compositionality of strong and weak bisimulation. The contributions of this article are illustrated using a transport protocol as running example. (10.1016/j.jlamp.2022.100842)
    DOI : 10.1016/j.jlamp.2022.100842
  • Unsupervised Audio Source Separation Using Differentiable Parametric Source Models
    • Schulze-Forster Kilian
    • Doire Clement S J
    • Richard Gael
    • Badeau Roland
    Computing Research Repository, ACM / ArXiv, 2022. Supervised deep learning approaches to underdetermined audio source separation achieve state-of-the-art performance but require a dataset of mixtures along with their corresponding isolated source signals. Such datasets can be extremely costly to obtain for musical mixtures. This raises a need for unsupervised methods. We propose a novel unsupervised modelbased deep learning approach to musical source separation. Each source is modelled with a differentiable parametric sourcefilter model. A neural network is trained to reconstruct the observed mixture as a sum of the sources by estimating the source models' parameters given their fundamental frequencies. At test time, soft masks are obtained from the synthesized source signals. The experimental evaluation on a vocal ensemble separation task shows that the proposed method outperforms learning-free methods based on nonnegative matrix factorization and a supervised deep learning baseline. Integrating domain knowledge in the form of source models into a data-driven method leads to high data efficiency: the proposed approach achieves good separation quality even when trained on less than three minutes of audio. This work makes powerful deep learning based separation usable in scenarios where training data with ground truth is expensive or nonexistent.
  • Introduction
    • Bloch Isabelle
    • Euzenat Jérôme
    • Lang Jérôme
    • Schwarzentruber François
    Revue Ouverte d'Intelligence Artificielle, Association pour la diffusion de la recherche francophone en intelligence artificielle, 2022, 3, pp.193-199. no abstract (10.5802/roia.28fr)
    DOI : 10.5802/roia.28fr
  • Post-actes de la Conférence Nationale en Intelligence Artificielle (CNIA 2018-2020)
    • Bloch Isabelle
    • Euzenat Jérôme
    • Lang Jérôme
    • Schwarzentruber François
    Revue Ouverte d'Intelligence Artificielle, Association pour la diffusion de la recherche francophone en intelligence artificielle, 2022, 3, pp.193-413. no abstract
  • High-capacity free-space optical link in the midinfrared thermal atmospheric windows using unipolar quantum devices
    • Didier Pierre
    • Dely Hamza
    • Bonazzi Thomas
    • Spitz Olivier
    • Awwad Elie
    • Rodriguez Étienne
    • Vasanelli Angela
    • Sirtori Carlo
    • Grillot Frédéric
    Advanced photonics, SPIE, 2022, 4 (5), pp.056004. Free-space optical communication is a very promising alternative to fiber communication systems, in terms of ease of deployment and costs. Midinfrared light has several features of utter relevance for free-space applications: low absorption when propagating in the atmosphere even under adverse conditions, robustness of the wavefront during long-distance propagation, and absence of regulations and restrictions for this range of wavelengths. A proof-of-concept of high-speed transmission taking advantage of intersubband devices has recently been demonstrated, but this effort was limited by the short-distance optical path (up to 1 m). In this work, we study the possibility of building a long-range link using unipolar quantum optoelectronics. Two different detectors are used: an uncooled quantum cascade detector and a nitrogen-cooled quantum well-infrared photodetector. We evaluate the maximum data rate of our link in a back-to-back configuration before adding a Herriott cell to increase the length of the light path up to 31 m. By using pulse shaping, pre- and post-processing, we reach a record bitrate of 30 Gbit s − 1 for both two-level (OOK) and four-level (PAM-4) modulation schemes for a 31-m propagation link and a bit error rate compatible with error-correction codes. (10.1117/1.AP.4.5.056004)
    DOI : 10.1117/1.AP.4.5.056004
  • LAKE DETECTION WITH SENTINEL-1 DATA USING A GRAB-CUT METHOD AND ITS MULTI-TEMPORAL EXTENSION
    • Gasnier Nicolas
    • Denis Loïc
    • Fjørtoft Roger
    • Liege Frédéric
    • Tupin Florence
    , 2022. This paper presents a semi-guided method to detect lakes in Sentinel-1 SAR data. The proposed approach is an adaptation of the grab-cut framework developed in [1]. Starting from a coarse bounding box around the lake, an accurate segmentation is extracted using a Conditional Random Field formalism and a graph-cut based optimization. Then an extension of this approach to process jointly a stack of multi-temporal data is presented. A temporal regularization term is introduced to control the joint segmentation. The proposed approach is evaluated on Sentinel-1 datasets. Qualitative and quantitative results demonstrate the interest of the proposed framework and its robustness to the initialization polygon of the lake. (10.1109/IGARSS46834.2022.9883219)
    DOI : 10.1109/IGARSS46834.2022.9883219
  • Combining Embeddings and Rules for Fact Prediction
    • Boschin Armand
    • Jain Nitisha
    • Keretchashvili Gurami
    • Suchanek Fabian M.
    , 2022. Knowledge bases are typically incomplete, meaning that they are missing information that we would expect to be there. Recent years have seen two main approaches to guess missing facts: Rule Mining and Knowledge Graph Embeddings. The first approach is symbolic, and finds rules such as "If two people are married, they most likely live in the same city". These rules can then be used to predict missing statements. Knowledge Graph Embeddings, on the other hand, are trained to predict missing facts for a knowledge base by mapping entities to a vector space. Each of these approaches has their strengths and weaknesses, and this article provides a survey of neuro-symbolic works that combine embeddings and rule mining approaches for fact prediction. (10.4230/OASIcs.AIB.2022.4)
    DOI : 10.4230/OASIcs.AIB.2022.4
  • (Complex) natural gradient optimization for optical quantum circuit design
    • Yao Yuan
    • Cussenot Pierre
    • Wolf Richard A.
    • Miatto Filippo M.
    Physical Review A, American Physical Society, 2022, 105 (5), pp.052402. Few-mode optical quantum circuits can be simulated and optimized using gradient-descent methods, as optical gates can be parametrized by continuous parameters. However, the parameter space as seen by a nonholomorphic cost function is not Euclidean, which means that the Euclidean gradient does not generally point in the direction of steepest ascent. In order to retrieve the true steepest-ascent direction, one must take into account the local metric in what is known as the natural-gradient (NG) method. In this work we implement the NG for the optimization of optical quantum circuits. Specifically, we adapt the NG approach to a complex-valued parameter space, allowing us to apply the NG method directly within the Bargmann formalism, which we rely on for transforming between phase space and Fock space. We then compare the NG approach to vanilla gradient descent and to Adam over two standard state-preparation benchmarks: a single-photon source and a Gottesman-Kitaev-Preskill state source. We observe that the NG approach converges significantly faster (due in part to the possibility of using larger learning rates) and with a smoother decay of the cost function throughout the optimization. This result further improves our ability to design quantum circuits via classical optimization methods. (10.1103/PhysRevA.105.052402)
    DOI : 10.1103/PhysRevA.105.052402