Bridging the gap: dual perception attention and local-global similarity fusion for cross-modal image-text matching

Abstract

Current image-text matching methods implicitly align visual-semantic segments within images, and employ cross-modal attention mechanisms to discover fine-grained cross-modal semantic correspondences. Although region-word pairs constitute local matches across modalities, they may lead to inaccurate measurements of relevance when viewed from a global perspective of image-text relationships. Additionally, cross-modal attention mechanisms may introduce redundant or irrelevant region-word alignments, which can reduce retrieval accuracy and limit efficiency. To address these challenges, we propose a Dual perception Attention and local-global Similarity Fusion framework(DASF). Specifically, We combine two types of similarity matching, global and local, to establish a more accurate correspondence between images and text by simultaneously considering global semantics and local details during the matching process. Simultaneously, we integrate dual-perception attention mechanisms to learn the relationship between images and text, utilizing attention polarity to determine the degree of matching and better consider contextual and semantic information, thereby reducing interference from irrelevant regions. Extensive experiments on two benchmark datasets, Flickr30K and MSCOCO, demonstrate the superior effectiveness of our DASF, achieving state-of-the-art performance.

Rethinking diaspora remittances in the post-Mugabe era in Zimbabwe

Abstract

Based on a qualitative study of Zimbabwean migrants based in South Africa, who regularly remitted goods and money to Zimbabwe between 2010 and 2020, this paper suggests that at a local level, remittances alleviated poverty with very limited if any transformation of the political economy at the national level. Such remittances promoted consumerism without sustainable investment that can structurally transform the economy. In addition, the dependence on remittances entrenches the culture of migration at the local level, which also contributes to or promotes ethno-tribal fissiparity. In rethinking diaspora remittances in the post-Mugabe era, it is advanced that the seemingly intractable economic and political quagmire in Zimbabwe must be resolved to inspire confidence in the diaspora to pull remittances together for a national socio-economic cause and not local-level band-aid accomplishments which remittances currently do.

Computing changes in regular square grids: towards integration of pixel and edge level analyses

Abstract

The aim of the paper is to introduce appropriate tools for quantifying the changes occurred at edge level when analyzing land cover changes. The research is conducted in two directions. Firstly, the binary change index at edge level is introduced and its relationship to the already existing binary change edge (computed at pixel level) is studied. It is proved that a certain inequality between the values of the two indices holds. The theoretical part is completed with real-world examples. The second direction investigated in the paper is the possibility of applying Markov type models, a basic tool in land cover change analysis, in quantifying the changes occurred at edge level. This combined approach is illustrated by a case study and it is shown that it provides complementary information related to the spatial extension of the land cover types and to their distribution. Thus, this type of analysis can improve the methodology of measuring the extent and the implications of land cover changes.

Male sling adjustability: does it truly matter?

Abstract

Purpose

Patients with post prostatectomy incontinence (PPI) seem to have different needs. Therefore, device post-operative readjustability may be a beneficial feature in PPI management, even though it lacks study support. The purpose of this study is to describe our surgical technique for male sling (MS) implantation, assess outcomes, and the impact of readjustability.

Methods

We performed a retrospective analysis of 89 consecutive patients who underwent PPI correction with MS Argus-T™ (Promedon, Córdoba, Argentina) from 2009 to 2021. The median follow-up was 48 months (12–120). Data were collected in a dedicated database. Perioperative variables were assessed. A descriptive statistical analysis was performed. Clinical and urodynamic variables were correlated with the need for readjustments and success.

Results

In this cohort, objective success was achieved in 80.5% of the patients (65.9% cured and 14.6% improved). A total of 85.4% of the patients met the criteria for subjective success (74.4% cured and 11% improved). For the subgroup of patients who received previous treatment for urethral stricture (US), 79% achieved objective success (63.2% cured, 15.8% improved), and 84.2% achieved subjective success (78.9% cured, 5.3% improved). For the subgroup of patients who received previous radiotherapy (RT) before sling surgery, 68.7% achieved objective success (37.5% cured, 31.2% improved), and 75% achieved subjective success (37.5% cured, 37.5% improved). Procedures for device readjustment were necessary for 27.7% of patients in the total study population. RT and previous US treatment were predictive factors for the need of readjustment, with rates of 66.7% and 61.1% (OR: 8.46; CI: 2.46–29.00; p = 0.001/OR: 6.41; CI: 2.05–20.03; p = 0.001, respectively).

Conclusions

MS adjustability improved success rates, especially among irradiated patients and those with previous US. RT was an adverse predictor of total continence status even after readjustments.

Mapping of Temporally Dynamic Tropical Forest and Plantations Canopy Height in Borneo Utilizing TanDEM-X InSAR and Multi-sensor Remote Sensing Data

Abstract

This study explores the potential of TDX InSAR data from 2011, 2017, and 2019 for estimating and mapping canopy heights in unique forest and plantations landscape in Sabah, Malaysian Borneo. The findings offer crucial insights for sustainable forest and plantation management. The methodology encompassed the SINC forest height inversion model and two machine learning (ML) models Random Forest (RF) and Symbolic Regression (SR) augmented with diverse predictor variables and height references. Training the ML models with 70% of ICESat-2 ATL08 data and validating with the remaining 30%, we achieved an out-of-bag (OOB) RMSE of 5.4 m for RF and 5.96 m for SR. The overall validation RMSEs were 6.06 m (2011 SR), 10.36 m (2017 SR), and 7.58 m (2019 RF). For specific LULC classes, accuracies ranged from 3.92 m (2011 Mangrove RF) to 6.11 m (2017 Mangrove SR) and 4.35 m (2019 Rubber RF). Field inventory data validation in 2011 and 2019 yielded RMSEs between 4.06 m and 8.69 m, with SR as the top-performing model. Spatial distribution and canopy height classes revealed non-uniform variations in 2011, with SINC overestimating. In contrast, 2017 and 2019 showed uniform height patterns, indicating an increase in canopy heights across forest and plantation LULC, particularly in the 15–20 m range for oil palm, secondary forest, acacia mangium, and rubber. Our findings highlight the potential of InSAR-based canopy height estimation and mapping for tropical forest and plantations, which also can be applied to other areas at local scales considering the LULC landscapes dynamics.

Teaching–learning environmental conflicts through case studies and experiential immersion: introducing students to transdisciplinary research

Abstract

Environmental conflicts have increased considerably in recent years and approaches from situated and complex perspectives have gained relevance for their study. However, disciplinary approaches are still dominant to the detriment of interdisciplinary and transdisciplinary studies. This article analyzes the relevance of a transdisciplinary approach applied to case studies coupled with what we have called “experiential immersion”, as a pedagogical strategy to better understand the complexity associated with environmental conflicts, which are a key challenge to achieve sustainability. We analyzed and compared two pedagogical experiences—one in Argentina and another one in Mexico—that we developed in undergraduate courses to teach about environmental conflicts. We used transdisciplinary approaches, situated cognition and different strategies of pedagogy for sustainability, promoting the experiential immersion of students in real cases and promoting dialogs and knowledge exchange with the actors involved in the conflicts analyzed. Our experience shows that using a transdisciplinary approach applied to case studies coupled with experiential immersion can be an effective strategy for teaching–learning the inherent complexity of environmental conflicts. Implementation of our pedagogical strategy poses, however, several challenges for all the participants: lecturers, students and the social actors engaged in the conflict. Despite several limitations identified in our experiences, we suggest that the pedagogical strategies we have developed and implemented may also be effective for teaching–learning the multiple complex relations and trade-offs associated with transitions to sustainability.

Understanding the mechanisms of meaning-making for transformations toward sustainability: contributions from Personal Knowledge Theory

Abstract

The concept of meaning-making is increasingly identified as a crucial process and an entry point for sustainability transformations in a wide range of contexts and approaches, but it has not yet been studied in this field as an independent concept. In other literature, meaning-making has recently been focused on, yielding valuable information on how to better conceptualize and design events to trigger transformations. Furthermore, that study indicated the presence of underlying mechanisms of meaning-making, which might provide further design insights and theoretical underpinning. Here we investigate those underlying mechanisms, in a case which spans the two literatures. Village leaders in Botswana underwent the specialist shared-values crystallization group process within the WeValue InSitu approach and underwent a sustainability transformation, producing a significantly superior climate change adaptation plan. Using micro-concepts from Personal Knowledge Theory for line-by-line fine-toothed analysis, we reveal mechanisms underlying meaning-making by individuals and the group. The findings show two distinct types of micro-meaningmaking sequences were found: one was assimilative and a rarer one adaptive, involving participants modifying some premises. This distinction allows the micromoment of individual transformation to be identified, allowing ex and ante study to understand better what happened beforehand to cause it, and how it led onward to group and wider transformations. Another finding was that paired cognitive and communicative processes make up iterative meaning-making sequences where individuals take in new stimuli, understand tacitly, articulate the new meaning moreexplicitly, and repeat. Micro-meaning-making thus appears to be micro-integration between aspects of knowledge: tacit/explicit; external/internal. Design implications involve better considerations on assisting participants to access their own tacit spaces; to ensure they have shared experiences which allow intersubjective interactions to trigger and accelerate individual and collective meaning-making; that this space is protected from interruptions such as latecomers, stop–starting the session, and facilitators inserting personal content.

Understanding the mechanisms of meaning-making for transformations toward sustainability: contributions from Personal Knowledge Theory

Abstract

The concept of meaning-making is increasingly identified as a crucial process and an entry point for sustainability transformations in a wide range of contexts and approaches, but it has not yet been studied in this field as an independent concept. In other literature, meaning-making has recently been focused on, yielding valuable information on how to better conceptualize and design events to trigger transformations. Furthermore, that study indicated the presence of underlying mechanisms of meaning-making, which might provide further design insights and theoretical underpinning. Here we investigate those underlying mechanisms, in a case which spans the two literatures. Village leaders in Botswana underwent the specialist shared-values crystallization group process within the WeValue InSitu approach and underwent a sustainability transformation, producing a significantly superior climate change adaptation plan. Using micro-concepts from Personal Knowledge Theory for line-by-line fine-toothed analysis, we reveal mechanisms underlying meaning-making by individuals and the group. The findings show two distinct types of micro-meaningmaking sequences were found: one was assimilative and a rarer one adaptive, involving participants modifying some premises. This distinction allows the micromoment of individual transformation to be identified, allowing ex and ante study to understand better what happened beforehand to cause it, and how it led onward to group and wider transformations. Another finding was that paired cognitive and communicative processes make up iterative meaning-making sequences where individuals take in new stimuli, understand tacitly, articulate the new meaning moreexplicitly, and repeat. Micro-meaning-making thus appears to be micro-integration between aspects of knowledge: tacit/explicit; external/internal. Design implications involve better considerations on assisting participants to access their own tacit spaces; to ensure they have shared experiences which allow intersubjective interactions to trigger and accelerate individual and collective meaning-making; that this space is protected from interruptions such as latecomers, stop–starting the session, and facilitators inserting personal content.

A comprehensive exploration of deep learning approaches for pulmonary nodule classification and segmentation in chest CT images

Abstract

Accurately determining whether nodules on CT images of the lung are benign or malignant plays an important role in the early diagnosis and treatment of tumors. In this study, the classification and segmentation of benign and malignant nodules on CT images of the lung were performed using deep learning models. A new approach, C+EffxNet, is used for classification. With this approach, the features are extracted from CT images and then classified with different classifiers. In other phases of the study, a segmentation between benign and malignant was performed and, for the first time, a comparison of nodes was made during segmentation. The deep learning models InceptionV3, DenseNet121, and SeResNet101 were used as backbone models for feature extraction in the segmentation phase. In the classification phase, an accuracy of 0.9798, a precision of 0.9802, a recognition of 0.9798, an F1 score of 0.9798, and a kappa value of 0.9690 were achieved. During segmentation, the highest values of 0.8026 Jacard index and 0.8877 Dice coefficient were achieved.

Weakly supervised target detection based on spatial attention

Abstract

Due to the lack of annotations in target bounding boxes, most methods for weakly supervised target detection transform the problem of object detection into a classification problem of candidate regions, making it easy for weakly supervised target detectors to locate significant and highly discriminative local areas of objects. We propose a weak monitoring method that combines attention and erasure mechanisms. The supervised target detection method uses attention maps to search for areas with higher discrimination within candidate regions, and then uses an erasure mechanism to erase the region, forcing the model to enhance its learning of features in areas with weaker discrimination. To improve the positioning ability of the detector, we cascade the weakly supervised target detection network and the fully supervised target detection network, and jointly train the weakly supervised target detection network and the fully supervised target detection network through multi-task learning. Based on the validation trials, the category mean average precision (mAP) and the correct localization (CorLoc) on the two datasets, i.e., VOC2007 and VOC2012, are 55.2% and 53.8%, respectively. In regard to the mAP and CorLoc, this approach significantly outperforms previous approaches, which creates opportunities for additional investigations into weakly supervised target identification algorithms.