Skip to main content
Redhat Developers  Logo
  • AI

    Get started with AI

    • Red Hat AI
      Accelerate the development and deployment of enterprise AI solutions.
    • AI learning hub
      Explore learning materials and tools, organized by task.
    • AI interactive demos
      Click through scenarios with Red Hat AI, including training LLMs and more.
    • AI/ML learning paths
      Expand your OpenShift AI knowledge using these learning resources.
    • AI quickstarts
      Focused AI use cases designed for fast deployment on Red Hat AI platforms.
    • No-cost AI training
      Foundational Red Hat AI training.

    Featured resources

    • OpenShift AI learning
    • Open source AI for developers
    • AI product application development
    • Open source-powered AI/ML for hybrid cloud
    • AI and Node.js cheat sheet

    Red Hat AI Factory with NVIDIA

    • Red Hat AI Factory with NVIDIA is a co-engineered, enterprise-grade AI solution for building, deploying, and managing AI at scale across hybrid cloud environments.
    • Explore the solution
  • Learn

    Self-guided

    • Documentation
      Find answers, get step-by-step guidance, and learn how to use Red Hat products.
    • Learning paths
      Explore curated walkthroughs for common development tasks.
    • See all learning

    Hands-on

    • Developer Sandbox
      Spin up Red Hat's products and technologies without setup or configuration.
    • Interactive labs
      Learn by doing in these hands-on, browser-based experiences.
    • Interactive demos
      Click through product features in these guided tours.

    Browse by topic

    • AI/ML
    • Automation
    • Java
    • Kubernetes
    • Linux
    • See all topics

    Training & certifications

    • Courses and exams
    • Certifications
    • Skills assessments
    • Red Hat Academy
    • Learning subscription
    • Explore training
  • Build

    Get started

    • Red Hat build of Podman Desktop
      A downloadable, local development hub to experiment with our products and builds.
    • Developer Sandbox
      Spin up Red Hat's products and technologies without setup or configuration.

    Download products

    • Access product downloads to start building and testing right away.
    • Red Hat Enterprise Linux
    • Red Hat AI
    • Red Hat OpenShift
    • Red Hat Ansible Automation Platform
    • See all products

    Featured

    • Red Hat build of OpenJDK
    • Red Hat JBoss Enterprise Application Platform
    • Red Hat OpenShift Dev Spaces
    • Red Hat Developer Toolset

    References

    • E-books
    • Documentation
    • Cheat sheets
    • Architecture center
  • Community

    Get involved

    • Events
    • Live AI events
    • Red Hat Summit
    • Red Hat Accelerators
    • Community discussions

    Follow along

    • Articles & blogs
    • Developer newsletter
    • Videos
    • Github

    Get help

    • Customer service
    • Customer support
    • Regional contacts
    • Find a partner

    Join the Red Hat Developer program

    • Download Red Hat products and project builds, access support documentation, learning content, and more.
    • Explore the benefits

Implicit function declarations: flex's use of "reallocarray"

April 22, 2019
Arjun Shankar

    Several months ago, I took over the maintenance of the flex package in Fedora and decided to kick the tires by rebasing the package in Fedora Rawhide. I downloaded and hashed the latest tarball at the time, flex-2.6.4, tweaked the spec file, and fired up a local build. Unfortunately, it failed with a SIGSEGV at build time:

    ./stage1flex -o stage1scan.c ./scan.l
    make[2]: *** [Makefile:1695: stage1scan.c] Segmentation fault (core dumped)
    

    Some debugging with gdb led me to the conclusion that the segmentation fault was the result of a block of memory returned from the reallocarray function being written to during flex initialization.  In this article, I'll describe the issue further and explain changes made to address it.

    Here is a simplified snippet of my gdb session:

    (gdb) bt
    #0 check_mul_overflow_size_t (right=1, left=2048, left@entry=0)
    #1 __GI___libc_reallocarray (optr=0x0, nmemb=2048, elem_size=1)
    #2 allocate_array at misc.c:147
    #3 flexinit at main.c:974
    #4 flex_main at main.c:168
    #5 __libc_start_main
    (gdb) fin
    Run till exit from #0 check_mul_overflow_size_t
    __GI___libc_reallocarray
    33              return realloc (optr, bytes);
    (gdb) fin
    Run till exit from #0 __GI___libc_reallocarray
    in allocate_array
    147             mem = reallocarray(NULL, (size_t) size, element_size);
    Value returned is $1 = (void *) 0x5555557c6420
    (gdb) fin
    Run till exit from #0 allocate_array
    in flexinit
    974             action_array = allocate_character_array (action_size);
    Value returned is $2 = (void *) 0x557c6420
    (gdb) n
    975             defs1_offset = prolog_offset = action_offset = action_index = 0;
    (gdb) n
    976             action_array[0] = '\0';
    (gdb) n
    Program received signal SIGSEGV, Segmentation fault.
    

    I didn't notice anything off here right up to the point at which the segfault occurs, but maybe you already did. All I saw was that the returned pointer was non-NULL on line 974, but writing to it on line 976 resulted in a segfault. It began to look like a malloc bug.

    On a whim, I built the same tarball outside of the Fedora build system. This time, the typical ./configure && make command line didn't segfault at build time. So apparently the difference lay in the build options used by rpmbuild. Some trial and error led me to the cause: -pie, the linker flag that produces a position independent executable. Building with -pie caused the segmentation fault.

    Armed with this "reproducer" and advice from my colleagues at Red Hat, I set about doing a git-bisect on the flex sources. HEAD was building cleanly on the upstream master branch at that point even with -pie, so it was just a matter of finding the commit that fixed the build. The commit in question was the fix for the following issue reported against flex upstream:

    #241: "implicit declaration of function reallocarray is invalid in C99"

    So, flex sources didn't declare _GNU_SOURCE, leading to the compiler's seeing no declaration of the reallocarray function. In such cases, the compiler creates an implicit function declaration with the default return type (int) and generates code accordingly. On 64-bit Intel machines, the int type is only 32 bits wide while pointers are 64 bits wide. Going back and looking at the gdb session, it then became clear to me that the pointer gets truncated:

    147             mem = reallocarray(NULL, (size_t) size, element_size);
    Value returned is $1 = (void *) 0x5555557c6420
    (gdb) fin
    Run till exit from #0  allocate_array
    in flexinit
    974             action_array = allocate_character_array (action_size);
    Value returned is $2 = (void *) 0x557c6420
    

    This only happens in position independent executables because the heap gets mapped to a part of the address space where pointers are larger than INT_MAX, exposing the above flex bug. GCC actually warns of the presence of implicit function declarations via the -Wimplicit-function-declaration option. It appears that there was a fairly recent proposal to enable this warning in Fedora builds, but it was eventually shelved. If enabled, the warning would still cause the flex build to fail—but earlier and at a point where the problem was clear.

    At this point, getting the build to compile successfully was a simple matter of backporting the corresponding flex patch that defines _GNU_SOURCE and exposes the reallocarray prototype to the compiler.

    But we didn't just stop there. One of my colleagues, Florian Weimer—a regular contributor to glibc—thought that all this could have been avoided if reallocarray had been exposed by glibc via the more general _DEFAULT_SOURCE feature test macro. The change has now been committed to glibc upstream and is available since glibc-2.29.

    With this change, we hope to avoid similar situations in other components in Fedora and the glibc user community. glibc now provides the reallocarray function prototype unless the user explicitly requires stricter conformance to a given standard.

    Last updated: April 17, 2019

    Recent Posts

    • Federated identity across the hybrid cloud using zero trust workload identity manager

    • Confidential virtual machine storage attack scenarios

    • Introducing virtualization platform autopilot

    • Integrate zero trust workload identity manager with Red Hat OpenShift GitOps

    • Best Practice Configuration and Tuning for Linux and Windows VMs

    Red Hat Developers logo LinkedIn YouTube Twitter Facebook

    Platforms

    • Red Hat AI
    • Red Hat Enterprise Linux
    • Red Hat OpenShift
    • Red Hat Ansible Automation Platform
    • See all products

    Build

    • Developer Sandbox
    • Developer tools
    • Interactive tutorials
    • API catalog

    Quicklinks

    • Learning resources
    • E-books
    • Cheat sheets
    • Blog
    • Events
    • Newsletter

    Communicate

    • About us
    • Contact sales
    • Find a partner
    • Report a website issue
    • Site status dashboard
    • Report a security problem

    RED HAT DEVELOPER

    Build here. Go anywhere.

    We serve the builders. The problem solvers who create careers with code.

    Join us if you’re a developer, software engineer, web designer, front-end designer, UX designer, computer scientist, architect, tester, product manager, project manager or team lead.

    Sign me up

    Red Hat legal and privacy links

    • About Red Hat
    • Jobs
    • Events
    • Locations
    • Contact Red Hat
    • Red Hat Blog
    • Inclusion at Red Hat
    • Cool Stuff Store
    • Red Hat Summit
    © 2026 Red Hat

    Red Hat legal and privacy links

    • Privacy statement
    • Terms of use
    • All policies and guidelines
    • Digital accessibility

    Chat Support

    Please log in with your Red Hat account to access chat support.