OPUS 4 | Search

Refine

Has Fulltext

no (390) (remove)

390 search hits

171 to 180

Sort by

Year
Year
Title
Title
Author
Author

MSM: Multi-stage Multicuts for Scalable Image Clustering (2021)

Ho, Kalun ; Chatzimichailidis, Avraam ; Keuper, Margret ; Keuper, Janis

Correlation Clustering, also called the minimum cost Multicut problem, is the process of grouping data by pairwise similarities. It has proven to be effective on clustering problems, where the number of classes is unknown. However, not only is the Multicut problem NP-hard, an undirected graph G with n vertices representing single images has at most edges, thus making it challenging to implement correlation clustering for large datasets. In this work, we propose Multi-Stage Multicuts (MSM) as a scalable approach for image clustering. Specifically, we solve minimum cost Multicut problems across multiple distributed compute units. Our approach not only allows to solve problem instances which are too large to fit into the shared memory of a single compute node, but it also achieves significant speedups while preserving the clustering accuracy at the same time. We evaluate our proposed method on the CIFAR10 …

A Two-Stage Minimum Cost Multicut Approach to Self-Supervised Multiple Person Tracking (2021)

Ho, Kalun ; Kardoost, Amirhossein ; Pfreundt, Franz-Josef ; Keuper, Janis ; Keuper, Margret

Multiple Object Tracking (MOT) is a long-standing task in computer vision. Current approaches based on the tracking by detection paradigm either require some sort of domain knowledge or supervision to associate data correctly into tracks. In this work, we present a self-supervised multiple object tracking approach based on visual features and minimum cost lifted multicuts. Our method is based on straight-forward spatio-temporal cues that can be extracted from neighboring frames in an image sequences without supervision. Clustering based on these cues enables us to learn the required appearance invariances for the tracking task at hand and train an AutoEncoder to generate suitable latent representations. Thus, the resulting latent representations can serve as robust appearance cues for tracking even over large temporal distances where no reliable spatio-temporal features can be extracted. We show that, despite being trained without using the provided annotations, our model provides competitive results on the challenging MOT Benchmark for pedestrian tracking.

Learning Embeddings for Image Clustering: An Empirical Study of Triplet Loss Approaches (2020)

Ho, Kalun ; Keuper, Janis ; Pfreundt, Franz-Josef ; Keuper, Margret

In this work, we evaluate two different image clustering objectives, k-means clustering and correlation clustering, in the context of Triplet Loss induced feature space embeddings. Specifically, we train a convolutional neural network to learn discriminative features by optimizing two popular versions of the Triplet Loss in order to study their clustering properties under the assumption of noisy labels. Additionally, we propose a new, simple Triplet Loss formulation, which shows desirable properties with respect to formal clustering objectives and outperforms the existing methods. We evaluate all three Triplet loss formulations for K-means and correlation clustering on the CIFAR-10 image classification dataset.

Estimating the Robustness of Classification Models by the Structure of the Learned Feature-Space (2022)

Ho, Kalun ; Pfreundt, Franz-Josef ; Keuper, Janis ; Keuper, Margret

Over the last decade, the development of deep image classification networks has mostly been driven by the search for the best performance in terms of classification accuracy on standardized benchmarks like ImageNet. More recently, this focus has been expanded by the notion of model robustness, \ie the generalization abilities of models towards previously unseen changes in the data distribution. While new benchmarks, like ImageNet-C, have been introduced to measure robustness properties, we argue that fixed testsets are only able to capture a small portion of possible data variations and are thus limited and prone to generate new overfitted solutions. To overcome these drawbacks, we suggest to estimate the robustness of a model directly from the structure of its learned feature-space. We introduce robustness indicators which are obtained via unsupervised clustering of latent representations from a trained classifier and show very high correlations to the model performance on corrupted test data.

Learning Embeddings for Image Clustering: An Empirical Study of Triplet Loss Approaches (2021)

Ho, Kalun ; Pfreundt, Franz-Josef ; Keuper, Margret ; Keuper, Janis

Sample efficient localization and stage prediction with autoencoders (2021)

Hoch, Sebastian ; Lange, Sascha ; Keuper, Janis

Engineering, construction and operation of complex machines involves a wide range of complicated, simultaneous tasks, which potentially could be automated. In this work, we focus on perception tasks in such systems, investigating deep learning approaches for multi-task transfer learning with limited training data. We show an approach that takes advantage of a technical systems’ focus on selected objects and their properties. We create focused representations and simultaneously solve joint objectives in a system through multi-task learning with convolutional autoencoders. The focused representations are used as a starting point for the data-saving solution of the additional tasks. The efficiency of this approach is demonstrated using images and tasks of an autonomous circular crane with a grapple.

Erweiterte Realität in der Medizin (2019)

Hoppe, Harald ; Hazubski, Simon ; Strzeletz, Simon

Das Labor für Computerassistierte Medizin entwickelt Anwendungen, die Ärzte dabei unterstützen, chirurgische Eingriffe schneller und patientenschonender zu realisieren

Method for controlling a device, in particular, a prosthetic hand or a robotic arm (US20200327705A1) (2020)

Hoppe, Harald ; Otte, Andreas ; Hazubski, Simon

A method for controlling a device, in particular a prosthetic hand or a robotic arm, includes using an operator-mounted camera to detect at least one marker positioned on or in relation to the device. Starting from the detection of the at least one marker, a predefined movement of the operator together with the camera is detected and is used to trigger a corresponding action of the device. The predefined movement of the operator is detected in the form of a line of sight by means of camera tracking. A system for controlling a device, in particular a prosthetic hand or a robotic arm, includes a pair of AR glasses adapted to detect the at least one marker and to detect the predefined movement of the operator.

High-Resolution Capillary Printing of Eutectic Gallium Alloys for Printed Electronics (2021)

Hussain, Navid ; Fu, Tongtong ; Marques, Gabriel ; Das, Chittaranjan ; Scherer, Torsten ; Aghassi-Hagmann, Jasmin ; Bog, Uwe ; Berner, Lukas ; Wacker, Irene ; Schröder, Rasmus R. ; Hirtz, Michael

A versatile liquid metal (LM) printing process enabling the fabrication of various fully printed devices such as intra- and interconnect wires, resistors, diodes, transistors, and basic circuit elements such as inverters which are process compatible with other digital printing and thin film structuring methods for integration is presented. For this, a glass capillary-based direct-write method for printing LMs such as eutectic gallium alloys, exploring the potential for fully printed LM-enabled devices is demonstrated. Examples for successful device fabrication include resistors, p–n diodes, and field effect transistors. The device functionality and easiness of one integrated fabrication flow shows that the potential of LM printing is far exceeding the use of interconnecting conventional electronic devices in printed electronics.

Online Identification of Semiconductor Switching Times in Inverters with Inductive Load Using an FPGA and Potential Separated Comparators (2023)

Hähnlein, Stefan ; Degel, Jan Philipp ; Klöffer, Christian

The nonlinear behavior of inverters is largely impacted by the interlocking and switching times. A method for online identifying the switching times of semiconductors in inverters is presented in the following work. By being able to identify these times, it is possible to compensate for the nonlinear behavior, reduce interlocking time, and use the information for diagnostic purposes. The method is first theoretically derived by examining different inverter switching cases and determining potential identification possibilities. It is then modified to consider the entire module for more robust identification. The methodology, including limitations and boundary conditions, is investigated and a comparison of two methods of measurement acquisition is provided. Subsequently the developed hardware is described and the implementation in an FPGA is carried out. Finally, the results are presented, discussed, and potential challenges are encountered.

171 to 180

Open Access

Refine

Author

Year of publication

Document Type

Conference Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Institute

Open Access

390 search hits